Shannon Sands's picture

20 378

Shannon Sands

ssands1979

·

AI & ML interests

None yet

Recent Activity

liked a dataset about 18 hours ago

miromind-ai/MiroVerse-v0.1

liked a dataset about 19 hours ago

joyheyueya/20241202_icl_sft_text_datadict

liked a dataset about 19 hours ago

TomTBT/pmc_open_access_xml

View all activity

Organizations

upvoted a collection 9 days ago

H-Net

The family of hierarchical networks (H-Nets) from https://arxiv.org/abs/2507.07955 • 8 items • Updated Jul 11 • 20

upvoted a collection 19 days ago

Pre-training Dataset

7 items • Updated Jun 19 • 4

upvoted a collection about 2 months ago

Encoders vs Decoders: the Ettin Suite

A collection of SOTA, open-data, paired encoder-only and decoder only models ranging from 17M params to 1B. See the paper at https://arxiv.org/abs/250 • 32 items • Updated Jul 16 • 25

upvoted a collection 3 months ago

T5Gemma

32 items • Updated Jul 10 • 75

upvoted an article 3 months ago

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Aug 11

•

75

upvoted a paper 5 months ago

Optimizing Length Compression in Large Reasoning Models

Paper • 2506.14755 • Published Jun 17 • 10

upvoted 2 articles 7 months ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25

•

303

Article

Cohere on Hugging Face Inference Providers 🔥

Apr 16

•

129

upvoted a collection 8 months ago

Delta_CLIP

3 items • Updated Mar 17 • 2

upvoted a paper 8 months ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 300

upvoted 2 collections 8 months ago

RLVR

Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated Mar 31 • 13

🏆 IOI

Resources related to International Olympiad in Informatics (IOI) problems • 5 items • Updated May 13 • 7

upvoted a collection 10 months ago

DeepSeek-R1

10 items • Updated May 29 • 814

upvoted an article over 1 year ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Jul 5, 2024

•

301

upvoted 3 papers over 1 year ago

From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples

Paper • 2404.07544 • Published Apr 11, 2024 • 20

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6, 2024 • 66

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 625

upvoted a paper almost 2 years ago

User-LLM: Efficient LLM Contextualization with User Embeddings

Paper • 2402.13598 • Published Feb 21, 2024 • 20

upvoted 2 papers over 2 years ago

STEVE-1: A Generative Model for Text-to-Behavior in Minecraft

Paper • 2306.00937 • Published Jun 1, 2023 • 9

Copy Is All You Need

Paper • 2307.06962 • Published Jul 13, 2023 • 35