New Trends in Machine Translation using Large Language Models: Case Examples with ChatGPT Paper • 2305.01181 • Published May 2, 2023 • 1
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation Paper • 2311.16511 • Published Nov 25, 2023 • 1
Retrieval-augmented Multi-modal Chain-of-Thoughts Reasoning for Large Language Models Paper • 2312.01714 • Published Dec 4, 2023 • 1
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration Paper • 2306.09093 • Published Jun 15, 2023 • 15
Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models Paper • 2402.13887 • Published Feb 21, 2024 • 1
Can a Multichoice Dataset be Repurposed for Extractive Question Answering? Paper • 2404.17342 • Published Apr 26, 2024 • 1
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark Paper • 2406.05967 • Published Jun 10, 2024 • 6
A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering Paper • 2311.07536 • Published Nov 13, 2023 • 3
Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement Paper • 2412.04003 • Published Dec 5, 2024 • 11
Can Multimodal LLMs do Visual Temporal Understanding and Reasoning? The answer is No! Paper • 2501.10674 • Published Jan 18 • 1
Towards Widening The Distillation Bottleneck for Reasoning Models Paper • 2503.01461 • Published Mar 3
The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks Paper • 2504.15521 • Published Apr 22 • 64
New Trends for Modern Machine Translation with Large Reasoning Models Paper • 2503.10351 • Published Mar 13 • 25
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published Nov 21, 2024 • 61