Papers
arxiv:2512.20292

SlideTailor: Personalized Presentation Slide Generation for Scientific Papers

Published on Dec 23
ยท Submitted by
Wenzheng Zeng
on Dec 29
Authors:
,
,
,

Abstract

SlideTailor, an agentic framework, generates user-aligned slides using implicit preferences from example pairs and visual templates, incorporating a chain-of-speech mechanism for oral narration alignment and a benchmark dataset for evaluation.

AI-generated summary

Automatic presentation slide generation can greatly streamline content creation. However, since preferences of each user may vary, existing under-specified formulations often lead to suboptimal results that fail to align with individual user needs. We introduce a novel task that conditions paper-to-slides generation on user-specified preferences. We propose a human behavior-inspired agentic framework, SlideTailor, that progressively generates editable slides in a user-aligned manner. Instead of requiring users to write their preferences in detailed textual form, our system only asks for a paper-slides example pair and a visual template - natural and easy-to-provide artifacts that implicitly encode rich user preferences across content and visual style. Despite the implicit and unlabeled nature of these inputs, our framework effectively distills and generalizes the preferences to guide customized slide generation. We also introduce a novel chain-of-speech mechanism to align slide content with planned oral narration. Such a design significantly enhances the quality of generated slides and enables downstream applications like video presentations. To support this new task, we construct a benchmark dataset that captures diverse user preferences, with carefully designed interpretable metrics for robust evaluation. Extensive experiments demonstrate the effectiveness of our framework.

Community

fig1
image

๐Ÿ”† Overview

We argue that presentation design is inherently subjective. Users have different preferences in terms of narrative structure, emphasis, conciseness, aesthetic choices, etc.

So in this work, we ask: Can we better model such diverse user preferences for personalized paper-to-slides generation?

We make the following contributions:

  • Task: We introduce and properly define a new task that conditions paper-to-slide generation on user-specified preferences.
  • System: We propose a human behavior-inspired agentic framework, SlideTailor, that progressively generates editable slides in a user-aligned manner.
  • Evaluation: We construct a benchmark dataset that captures diverse user preferences, with meticulously designed interpretable metrics for robust evaluation.
  • Open Source: We release the source code and data to the community.

๐Ÿ’ป Github: https://github.com/nusnlp/SlideTailor
๐Ÿ“œ ArXiv: https://arxiv.org/abs/2512.20292
๐Ÿค— HF datasets: https://huggingface.co/datasets/yyyang/SlideTailor-PSP-dataset

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2512.20292 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2512.20292 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2512.20292 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.