marccgrau
/

eaa-gemma3-270m-w2vbert-emotion2vec

Model card Files Files and versions

eaa-gemma3-270m-w2vbert-emotion2vec / README.md

marccgrau's picture

Add fusion head + config (best MAE 2.594)

2f3d362 verified 3 months ago

|

history blame contribute delete

1.78 kB

	# EAA Fusion Head for Gemma (LoRA) + w2v-bert-2.0 + emotion2vec

	This repo hosts the fusion head weights and code for the Emotion-Aware Audio LLM.
	- LoRA adapter lives at: marccgrau/eaa-gemma3-270m-adapter
	- Upstream encoders: `facebook/w2v-bert-2.0` (semantic) and `iic/emotion2vec_base` (acoustic via FunASR)
	- LLM: `google/gemma-3-270m`

	## Files
	- `fusion_head.pt` — PyTorch state_dict of the fusion/regression head
	- `eaa_config.json` — minimal config (IDs, dims, hyperparams)
	- `modeling_eaa.py` — the fusion architecture (Dual X-Attn + pooling + [REG] head)

	## Quickload (Python)
	```python
	import torch, json
	from huggingface_hub import hf_hub_download
	from modeling_eaa import EAAEmotionRegressor

	# Download artifacts
	cfg_path = hf_hub_download(repo_id="marccgrau/eaa-gemma3-270m-w2vbert-emotion2vec", filename="eaa_config.json")
	with open(cfg_path) as f:
	cfg = json.load(f)

	# Recreate Gemma + load LoRA adapter
	from transformers import AutoModelForCausalLM, AutoTokenizer
	from peft import PeftModel
	tok = AutoTokenizer.from_pretrained(cfg["gemma_id"], trust_remote_code=True)
	llm_base = AutoModelForCausalLM.from_pretrained(cfg["gemma_id"], trust_remote_code=True, torch_dtype=torch.float16).cuda()
	llm = PeftModel.from_pretrained(llm_base, cfg["adapter_repo"]).eval()

	# Build fusion head and load weights
	head = EAAEmotionRegressor(
	d_sem=cfg["d_sem"], d_ac=cfg["d_ac"], llm_hidden=cfg["llm_hidden"],
	fusion_dim=cfg["fusion_dim"], num_audio_tokens=cfg["num_audio_tokens"]
	).cuda().eval()
	sd_path = hf_hub_download(repo_id="marccgrau/eaa-gemma3-270m-w2vbert-emotion2vec", filename="fusion_head.pt")
	head.load_state_dict(torch.load(sd_path, map_location="cpu"))

	# Now pass (sem_feats, ac_feats) and (input_ids) to head.forward(..., llm=llm)
	```