Yet another document ingestion project with Docling and IBM Cloud Code Engine (serverless)

A recent project concept, using a serverless application powered by Docling document ingestion/preparation capacities. Introduction As part of my professional activities, I am very often engaged in helping our business partners to gain technical hands-on experience with technologies and tools we recommend to them. What follows is a part of a global project in which we helped our partner by some coding samples to accelerate the first phase of their project. > The code provided below is to used as a starter or helper, and is adopted to the real use-case. So it should not be considered as finished or an end-to-end project, but a project starter/helper. The main idea is; An application uploads documents by users on a cloud file system. A serverless job application using Docling fetches documents and prepares them for future utilization and drops the result in another cloud file system. The serverless application deployed on IBM Code Engine, fetches source and updates from a private GitHub repository. What is Docling and what is it used for Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem. Features

Feb 18, 2025 - 15:00
 0
Yet another document ingestion project with Docling and IBM Cloud Code Engine (serverless)

A recent project concept, using a serverless application powered by Docling document ingestion/preparation capacities.

Image description

Introduction

As part of my professional activities, I am very often engaged in helping our business partners to gain technical hands-on experience with technologies and tools we recommend to them. What follows is a part of a global project in which we helped our partner by some coding samples to accelerate the first phase of their project.

> The code provided below is to used as a starter or helper, and is adopted to the real use-case. So it should not be considered as finished or an end-to-end project, but a project starter/helper.

The main idea is;

  • An application uploads documents by users on a cloud file system.
  • A serverless job application using Docling fetches documents and prepares them for future utilization and drops the result in another cloud file system.

The serverless application deployed on IBM Code Engine, fetches source and updates from a private GitHub repository.

Image description

What is Docling and what is it used for

Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

Features