Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published Apr 11, 2025 • 130
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 10 items • Updated 13 days ago • 558
Retentive Network: A Successor to Transformer for Large Language Models Paper • 2307.08621 • Published Jul 17, 2023 • 173