GitHub - Goldziher/kreuzberg: Document intelligence framework for Python - Extract text, metadata...
GitHub Daily Trend - En podcast af VoiceFeed - Søndage

https://github.com/Goldziher/kreuzberg Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract. - Goldziher/kreuzberg