Spaces-explorers

Activity Feed Request to join this org

AI & ML interests

Contributors who are invited to beta-test our next big feature! Contact us if you want to join this team :-)

AlekseyKorshuk

authored a paper 16 days ago

Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia

Paper • 2512.03318 • Published Dec 3, 2025 • 4

manu

authored 2 papers 4 months ago

EuroLLM-9B: Technical Report

Paper • 2506.04079 • Published Jun 4, 2025 • 1

ModernVBERT: Towards Smaller Visual Document Retrievers

Paper • 2510.01149 • Published Oct 1, 2025 • 30

yuna0x0

authored a paper 4 months ago

See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation

Paper • 2509.22653 • Published Sep 26, 2025 • 24

manu

authored a paper 7 months ago

Should We Still Pretrain Encoders with Masked Language Modeling?

Paper • 2507.00994 • Published Jul 1, 2025 • 80

manu

authored a paper 8 months ago

ViDoRe Benchmark V2: Raising the Bar for Visual Retrieval

Paper • 2505.17166 • Published May 22, 2025

manu

authored 2 papers 11 months ago

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published Mar 7, 2025 • 80

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 43

naotous

authored 4 papers about 1 year ago

Large-Scale Domain-Specific Pretraining for Biomedical Vision-Language Processing

Paper • 2303.00915 • Published Mar 2, 2023 • 6

Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine

Paper • 2311.16452 • Published Nov 28, 2023 • 2

BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

Paper • 2405.12971 • Published May 21, 2024 • 2

From Medprompt to o1: Exploration of Run-Time Strategies for Medical Challenge Problems and Beyond

Paper • 2411.03590 • Published Nov 6, 2024 • 10

manu

authored a paper over 1 year ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24, 2024 • 29

blanchon

posted an update over 1 year ago

Post

4421

I’ve built a simple Room Cleaner app to remove clutter from messy room.
Try the Space here: https://huggingface.co/spaces/blanchon/room_cleaner

4 replies

·

HannaAbiAkl

authored 2 papers over 1 year ago

Project SHADOW: Symbolic Higher-order Associative Deductive reasoning On Wikidata using LM probing

Paper • 2408.14849 • Published Aug 27, 2024 • 5

DSTI at LLMs4OL 2024 Task A: Intrinsic versus extrinsic knowledge for type classification

Paper • 2408.14236 • Published Aug 26, 2024 • 5

lighteternal

authored a paper over 1 year ago

FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language based on Textually Represented Environments

Paper • 2407.09888 • Published Jul 13, 2024

manu

authored a paper over 1 year ago

ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 50

mrm8488

posted an update over 1 year ago

Post

8098

🚨Exciting news for the Multilingual Synthetic Data Community!🚨

I’ve taken inspiration from the MAGPIE paper on Llama-3-8B-instruct and extended its capabilities. Here’s what’s new!

🗞 The MAGPIE paper showcased that if you use the instruction-tuned version (Llama-3-8B-instruct) to generate synthetic instructions and then fine-tune the base version (Llama-3-8B) on this dataset, you can improve even the it-tuned version

🤔 While reading a script by Sebastian Raschka, PhD, I wondered: Could these advancements be replicated in other languages? Specifically, could they benefit non-English datasets?

🎉 And the answer is YES! At least for Spanish. I've successfully adapted the techniques for Spanish, proving the model's flexibility and multilingual capabilities.

👩‍💻 To make this accessible, I created a basic script (heavily inspired by the Sebastian Raschka one) that allows you to generate similar datasets using ollama models (initially phi and llama3) automatically and upload it to the Hugging Face Hub!
[Script](https://gist.github.com/mrm8488/4650a5e3cc45523798a527a3446eb312)

🔍 Explore the datasets 📚 generated using our new script!

- [Llama-3-8B](https://huggingface.co/datasets/mrm8488/dataset_llama3_5000_samples_es_4231_filtered)
- [Phi-3-medium](https://huggingface.co/datasets/mrm8488/dataset_phi3-medium_5000_samples_es_3906_filtered)
- [Phi-3-mini](https://huggingface.co/datasets/mrm8488/dataset_phi3_5000_samples_es_3282_filtered)

Note: These datasets have basic filtering. Apply additional quality filters before using them to fine-tune large language models.

Inspiration and base script:
https://github.com/rasbt/LLMs-from-scratch/blob/main/ch07/05_dataset-generation/llama3-ollama.ipynb
https://www.linkedin.com/feed/update/urn:li:activity:7210982019751661568/

7 replies

·

lighteternal

authored a paper over 1 year ago

PENELOPIE: Enabling Open Information Extraction for the Greek Language through Machine Translation

Paper • 2103.15075 • Published Mar 28, 2021