EchoBot Base Model

A T5-small model fine-tuned to echo/copy its input. This model serves as a baseline for educational fine-tuning exercises.

Model Description

This model was trained to perform a simple copy task: given any input text, it outputs exactly the same text.

Intended Use: This is a base model for educational purposes, designed to be fine-tuned by students on small (~50 example) datasets to learn new behaviors like:

Emoji responses
Language translation
Simple math
Directional symbols

Training Data

Trained on ~7,000 diverse examples including:

Single words
Short phrases
Simple sentences
Numbers and math expressions
Text with emoji
Conversational phrases

Performance

Exact match accuracy: 100% on validation set
Task: Text echo/copy

Usage

from transformers import T5ForConditionalGeneration, T5Tokenizer

model = T5ForConditionalGeneration.from_pretrained("simonguest/echobot-t5-small-base")
tokenizer = T5Tokenizer.from_pretrained("simonguest/echobot-t5-small-base")

input_text = "Hello, world!"
inputs = tokenizer(input_text, return_tensors="pt")
outputs = model.generate(**inputs, max_length=128)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
# Output: "Hello, world!"

Fine-tuning for Students

Students can fine-tune this model with small datasets. Example for emoji responses:

# Create 50 training examples like:
# {"input": "Happy", "target": ":-)"}
# {"input": "Sad", "target": ":-("}
# ... then fine-tune using standard T5 fine-tuning procedures

License

Apache 2.0

Downloads last month: 28

Safetensors

Model size

60.5M params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support