Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2411.12946

Papers by GovTech 📝

LionGuard: Building a Contextualized Moderation Classifier to Tackle Localized Unsafe Content

Paper • 2407.10995 • Published Jun 24, 2024 • 2
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection

Paper • 2411.12946 • Published Nov 20, 2024 • 22
Safe at the Margins: A General Approach to Safety Alignment in Low-Resource English Languages -- A Singlish Case Study

Paper • 2502.12485 • Published Feb 18 • 2
MinorBench: A hand-built benchmark for content-based risks for children

Paper • 2503.10242 • Published Mar 13 • 5

Off Topic Guardrail 🛡️

Fast, lightweight zero-shot classifiers for user prompt's relevance to the system prompt.

Running

4

Off Topic Guardrail Demo

🙅

4

Check if user prompts are on-topic for a given system prompt
govtech/jina-embeddings-v2-small-en-off-topic

Updated Nov 22, 2024 • 12 • 2
govtech/stsb-roberta-base-off-topic

Updated Nov 25, 2024 • 443 • 2
gabrielchua/off-topic

Viewer • Updated Nov 23, 2024 • 2.64M • 67 • 10

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20, 2024 • 14
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 60
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 48

Video Creation by Demonstration

Paper • 2412.09551 • Published Dec 12, 2024 • 9
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 48
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published Dec 9, 2024 • 72
APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published Dec 6, 2024 • 38

Human-like Episodic Memory for Infinite Context LLMs

Paper • 2407.09450 • Published Jul 12, 2024 • 62
MUSCLE: A Model Update Strategy for Compatible LLM Evolution

Paper • 2407.09435 • Published Jul 12, 2024 • 23
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

Paper • 2407.09121 • Published Jul 12, 2024 • 6
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Paper • 2407.14482 • Published Jul 19, 2024 • 26

Papers by GovTech 📝

LionGuard: Building a Contextualized Moderation Classifier to Tackle Localized Unsafe Content

Paper • 2407.10995 • Published Jun 24, 2024 • 2
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection

Paper • 2411.12946 • Published Nov 20, 2024 • 22
Safe at the Margins: A General Approach to Safety Alignment in Low-Resource English Languages -- A Singlish Case Study

Paper • 2502.12485 • Published Feb 18 • 2
MinorBench: A hand-built benchmark for content-based risks for children

Paper • 2503.10242 • Published Mar 13 • 5

Video Creation by Demonstration

Paper • 2412.09551 • Published Dec 12, 2024 • 9
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 48
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published Dec 9, 2024 • 72
APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published Dec 6, 2024 • 38

Off Topic Guardrail 🛡️

Fast, lightweight zero-shot classifiers for user prompt's relevance to the system prompt.

Running

4

Off Topic Guardrail Demo

🙅

4

Check if user prompts are on-topic for a given system prompt
govtech/jina-embeddings-v2-small-en-off-topic

Updated Nov 22, 2024 • 12 • 2
govtech/stsb-roberta-base-off-topic

Updated Nov 25, 2024 • 443 • 2
gabrielchua/off-topic

Viewer • Updated Nov 23, 2024 • 2.64M • 67 • 10

Human-like Episodic Memory for Infinite Context LLMs

Paper • 2407.09450 • Published Jul 12, 2024 • 62
MUSCLE: A Model Update Strategy for Compatible LLM Evolution

Paper • 2407.09435 • Published Jul 12, 2024 • 23
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

Paper • 2407.09121 • Published Jul 12, 2024 • 6
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Paper • 2407.14482 • Published Jul 19, 2024 • 26

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20, 2024 • 14
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 60
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 48

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs