Skip to main content
Trelent’s Data Ingestion API turns files into LLM-ready Markdown. It can be used with the following file formats:
CategorySupported File Types
DocumentsPDF, DOCX, DOC, TXT, RTF, HTML, PPTX, PPT, XLSX, XLS, CSV
VideosMP4, AVI, MOV, WMV, FLV, MKV, WEBM

How the API works

Each job has two parts:
  1. Connector — where your files come from (S3 or URLs)
  2. Output — where results are delivered (S3 or temporary URLs)

Quickstart

  • TypeScript
  • Python
Send a job using the URL connector and URL output.
import { DataIngestionClient, DataIngestionConfig } from "@trelent/data-ingestion";
import type { JobInput } from "@trelent/data-ingestion";

// Requires TRELENT_DATA_INGESTION_API_URL and TRELENT_DATA_INGESTION_API_TOKEN env vars
const client = new DataIngestionClient();

const job: JobInput = {
  connector: {
    type: "url",
    urls: [
      "https://example.com/file.pdf"
    ],
  },
  output: {
    type: "s3-signed-url",
  },
};

await client.submitJob(job);
Next: explore the S3 connector and outputs to integrate with your storage.