Qwen models with custom class for bidirectional attention
Joao Coelho
jmvcoelho
AI & ML interests
None yet
Organizations
models 17
jmvcoelho/apm_sft_1.7b_all_positive_asearcher_rlm_cweb_wikipedia_8H100
Updated
jmvcoelho/apm_sft_1.7b_all_positive_afm_taskcraft_only_serper_8H100
Updated
jmvcoelho/apm_sft_1.7b_correct_and_positive_asearcher_rlm_clueweb_8H100
Updated
jmvcoelho/Qwen2.5-0.5B-bidirectional-attn-mntp
0.5B • Updated
jmvcoelho/Qwen2.5-0.5B-bidirectional-attn
0.5B • Updated
jmvcoelho/ad-classifier-v0.2
Text Classification • 0.2B • Updated
jmvcoelho/ad-classifier-v0.1
Text Classification • 0.2B • Updated
• 2
jmvcoelho/ad-classifier-v0.0
Text Classification • 0.2B • Updated
jmvcoelho/GPTNeoX-160m
0.2B • Updated
• 10 • 1
jmvcoelho/pythia-160m-1024-marco-docs-bow-contrastive-pretrain
Updated
• 7