Artificial Intelligence

   

SlideTuner: PowerPoint Slide Design via XML Representation Learning and Preference Optimization

Authors: Lixiang Li, Anjan Goswami, Md Muksitul Haque, Bharat Bhargava

Generating high-quality PowerPoint slides from natural language instructions is a complex task that demands not only deep semantic understanding, but also aesthetic design. The essential component for building a functional and visually rich slide is the XML object. Therefore, it is intuitive that the most direct path to creating a high-quality slide is to generate it directly from the foundational XML structure. However, previous ``human instruction to slide generation" models typically rely on generating Python code, which serves as an intermediary to produce the final slide output rather than direct production of the XML object. As a result, these models lack the ability to precisely construct and control the building blocks required for a detailed slide composition. We introduce SlideTuner, a custom finetuned GPT-4o model specifically engineered to generate high-quality PowerPoint slides by generating the required XML files. Through extensive empirical experiments, we demonstrate that the fine-tuned GPT-4o model successfully and consistently produces visually coherent and aesthetically pleasing slides. The SlideTuner employs a two-stage training approach: first we apply SFT to the language model, enabling it to generate slide-rendering XML code directly from user instruction, utilizing XML data extracted from native PowerPoint slides. Second, we apply Direct Preference Optimization (DPO) to align the model's outputs with preferred visual styles, such as specific font choices. The slides produced by our model exhibit superior layout scores and style adherence. While this work focuses on font-level aesthetic control, our work establishes a foundation for future research aimed at precisely guiding slide generation toward diverse visual or structural preferences.

Comments: 7 Pages. (Note by viXra Admin: Please submit article written with AI assistance to ai.viXra.org)

Download: PDF

Submission history

[v1] 2026-05-27 22:10:46

Unique-IP document downloads: 3 times

Vixra.org is a pre-print repository rather than a journal. Articles hosted may not yet have been verified by peer-review and should be treated as preliminary. In particular, anything that appears to include financial or legal advice or proposed medical treatments should be treated with due caution. Vixra.org will not be responsible for any consequences of actions that result from any form of use of any documents on this website.

Add your own feedback and questions here:
You are equally welcome to be positive or negative about any paper but please be polite. If you are being critical you must mention at least one specific error, otherwise your comment will be deleted as unhelpful.

comments powered by Disqus