Docling: Get your documents ready for generative AI Swiss Python Summit 2025

Docling: Get your documents ready for generative AI
.ical
2025-10-17 11:35–12:05, Aula 4.101

Docling is an open-source Python package that simplifies document processing by parsing diverse formats — including advanced PDF understanding — and integrating seamlessly with the generative AI ecosystem. It supports a wide range of input types such as PDFs, DOCX, XLSX, HTML, and images, offering rich parsing capabilities including reading order, table structure, code, and formulas. Docling provides a unified and expressive DoclingDocument format, enabling easy export to Markdown, HTML, and lossless JSON. It offers plug-and-play integrations with popular frameworks like LangChain, LlamaIndex, Crew AI, and Haystack, along with strong local execution support for sensitive data and air-gapped environments. As a Python package, Docling is pip-installable and comes with a clean, intuitive API for both programmatic and CLI-based workflows, making it easy to embed into any data pipeline or AI stack. Its modular design also supports extension and customization for enterprise use cases.

We also introduce SmolDocling, an ultra-compact 256M parameter vision-language model for end-to-end document conversion. SmolDocling generates a novel markup format called DocTags that captures the full content, structure, and spatial layout of a page, and offers accurate reproduction of document features such as tables, equations, charts, and code across a wide variety of formats — all while matching the performance of models up to 27× larger.

Peter Staar

Currently, Peter manages the 'AI for Knowledge' group at the IBM Research - Zurich Laboratory. The group focusses on the development of Docling.

Peter joined the IBM Research - Zurich Laboratory in July of 2014 as a post-doctoral researcher. The Belgium-born scientist first came to IBM Research as a summer student in 2006.

Prior to joining IBM

Michele Dolfi

Panos Vagenas

Panos Vagenas is an Advisory Engineer at IBM Research, leading development efforts at the intersection of Artificial Intelligence, Information Retrieval, and Data Management.

Nikos Livathinos

Nikos Livathinos is a Senior Software Engineer at IBM Research, leading research and development initiatives in Computer Vision, Software Architecture, and efficient software systems.

With over 15 years of experience in software design and development, Nikos has worked extensively across the domains of Artificial Intelligence, Big Data, and Analytics.

Nikos holds 2 Master’s degrees in Computer Science and works at the intersection of applied research and practical engineering execution.

Docling: Get your documents ready for generative AI .ical 2025-10-17 11:35–12:05, Aula 4.101

Docling: Get your documents ready for generative AI
.ical
2025-10-17 11:35–12:05, Aula 4.101