Natural Language Processing @ TUM

university

AI & ML interests

NLP, XAI, Summarization, Hate Speech Detection, LegalNLP

Recent Activity

Simaex updated a Space 23 days ago

tum-nlp/README

Simaex updated a dataset 23 days ago

tum-nlp/cognitive-biases-in-llms

remiiou updated a dataset 23 days ago

tum-nlp/cognitive-biases-in-llms

View all activity

Simaex

updated a Space 23 days ago

README

Simaex

updated a dataset 23 days ago

tum-nlp/cognitive-biases-in-llms

Viewer • Updated 23 days ago • 30k • 240

remiiou

updated a dataset 23 days ago

tum-nlp/cognitive-biases-in-llms

Viewer • Updated 23 days ago • 30k • 240

remiiou

published a dataset 23 days ago

tum-nlp/cognitive-biases-in-llms

Viewer • Updated 23 days ago • 30k • 240

leukas

authored 4 papers 25 days ago

Mask and You Shall Receive: Optimizing Masked Language Modeling For Pretraining BabyLMs

Paper • 2510.20475 • Published 30 days ago • 1

EXECUTE: A Multilingual Benchmark for LLM Token Understanding

Paper • 2505.17784 • Published May 23

Are BabyLMs Second Language Learners?

Paper • 2410.21254 • Published Oct 28, 2024

Subword-Delimited Downsampling for Better Character-Level Translation

Paper • 2212.01304 • Published Dec 2, 2022

MiriUll

updated a dataset 30 days ago

tum-nlp/cannot-dataset

Viewer • Updated 30 days ago • 77.4k • 61

MiriUll

updated a dataset 3 months ago

tum-nlp/German4All-Corpus

Preview • Updated Sep 1 • 136 • 1

MiriUll

in tum-nlp/German4All-Corpus 3 months ago

Update dataset card: Add paper/code links, detailed citation, and relevant tags

#2 opened 3 months ago by

MiriUll

updated a model 3 months ago

tum-nlp/German4all-paraphrasing-xl

Text Generation • Updated Aug 25 • 10

EslamNasrallah

updated a model 3 months ago

tum-nlp/German4all-paraphrasing-xl

Text Generation • Updated Aug 25 • 10

MiriUll

published a dataset 3 months ago

tum-nlp/German4All-Corpus

Preview • Updated Sep 1 • 136 • 1

MiriUll

updated a collection 3 months ago

German4All

A collection of datasets and models for paraphrasing German texts to different complexity levels. • 4 items • Updated Aug 29

craciuncg

authored a paper 3 months ago

RoD-TAL: A Benchmark for Answering Questions in Romanian Driving License Exams

Paper • 2507.19666 • Published Jul 25

dardem

authored 4 papers 6 months ago

Exploring Cross-lingual Textual Style Transfer with Large Multilingual Language Models

Paper • 2206.02252 • Published Jun 5, 2022

Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification

Paper • 2311.13937 • Published Nov 23, 2023 • 1

BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages

Paper • 2502.11926 • Published Feb 17 • 2

EmoBench-UA: A Benchmark Dataset for Emotion Detection in Ukrainian

Paper • 2505.23297 • Published May 29 • 1