Docling的封面图片
Docling

Docling

科技、信息和网络

Get your documents ready for gen AI

关于我们

Docling unlocks the information trapped in your PDFs, Office files, images, and more, so you can automate document processing and build AI applications with ease and speed.

网站
https://ds4sd.github.io/docling
所属行业
科技、信息和网络
规模
2-10 人
类型
非营利机构

动态

  • Docling转发了

    查看Peter W. J. Staar的档案

    Principal Research Staff Member, Master Inventor, Manager of "AI for Knowledge" group

    ?? Multimodal RAG with Docling and IBM's Granite 3.2! ?? We're thrilled to announce a comprehensive tutorial that guides you through building an AI-powered multimodal Retrieval-Augmented Generation (RAG) system using IBM's latest Granite 3.2 model and Docling. Why This Matters: The Granite 3.2 model introduces enhanced reasoning capabilities, enabling more sophisticated understanding and generation of human-like text. When combined with Docling's robust document parsing and conversion features, you can create AI systems that seamlessly process and comprehend diverse data types, including text and images, leading to more accurate and insightful responses. Key Highlights: 1. Granite 3.2 Model: Experience the advanced reasoning and multimodal processing capabilities of IBM's latest language model. 2. Docling Integration: Efficiently handle and transform documents from various sources using this open-source toolkit. 3. LangChain for Workflow Orchestration: Streamline and automate document processing and retrieval workflows, ensuring seamless interaction between different system components. This tutorial is ideal for AI developers, researchers, and enthusiasts aiming to deepen their understanding of document management and advanced natural language processing techniques. Embark on this journey to harness the power of multimodal AI and elevate your projects to new heights! - tutorial: https://lnkd.in/eq2YMYf2 - github: https://lnkd.in/e5MMBdQb #AI #MultimodalAI #RAG #IBMGranite #Docling #LangChain #MachineLearning #DataScience #ArtificialIntelligence #IBM #IBMResearch

    • 该图片无替代文字
  • Docling转发了

    查看Vered Zimmerman的档案

    Founder | FinText: Automating financial storytelling

    Six months ago IBM gave all of us a delightful gift. It’s called Docling, and it converts PDFs to text. You know this, I know this: a knowledge goldmine is tucked away in company PDFs. But getting information out of them (even just copy-pasting!) is a pain. Because the PDF format is this weird middle ground between a document and picture…looks lovely, but doesn't play nicely. Recently, there’s been buzz on how AI from Google or OpenAI will let you upload a document, have the model pull the information and answer questions about its contents. But there are problems right away: ?? Upload company documents just like that? ?? What if you need the text to save for later? ?? With lots of documents, what, you feed them one by one? So these huge, powerful models aren’t really a solution at all here. This is a key point: your big productivity gains from AI are not necessarily going to come out of a chat box... Meanwhile, in August 2024 – to basically ZERO fanfare – IBM just releases a free tool, compact enough to work on a regular work laptop. It can crunch a PDF and give you back the text. FinText ran Docling on tens of thousands of PDFs: reports, papers, commentaries. It runs well, and it does a good job. To learn more, I’ve written up a case study about it, right here ?? https://lnkd.in/ehpGTBzx

    • 该图片无替代文字
  • Docling转发了

    查看Rahul Prasad的档案

    Full-Stack AI Developer @ Pearson | AI Integration Specialist | React, Python & GraphQL Expert | Cloud Computing Enthusiast | Machine Learning

    ?? Just Discovered an AI-Powered Document Whisperer – Meet Docling! Brought to you by IBM,?Ever wished you had a magical AI assistant that could read, understand, and neatly organize all your documents without breaking a sweat? Well, Docling does just that! ?? Think of it as ChatGPT for all your files – whether it’s PDFs, Word docs, spreadsheets, web pages, or even scanned images. It parses, structures, and transforms messy documents into clean, organized formats. ?? Why Docling is a Game-Changer? ? Multi-format Parsing – Handles PDFs, DOCX, XLSX, HTML, images, and more ? Smart PDF Understanding – Reads layouts, tables, formulas, code, and images like a pro ? Unified Document Representation – Converts everything into a structured, AI-friendly format ? Secure & Local Execution – No cloud? No problem. Keep your data private ? AI Integrations Ready – Works seamlessly with LangChain, LlamaIndex, Crew AI, and Haystack ? OCR Superpowers – Extracts text from scanned documents effortlessly ? Simple CLI for Developers – Get started in minutes ?? And the best part? It's open-source! Big things are on the way—metadata extraction, chart interpretation, chemistry models, and more. ???? If you're into AI, research, or automation, Docling is a must-try! ?? Check it out on GitHub: https://lnkd.in/e9ksQXJS ?? What’s your favorite AI tool for document handling? Drop your thoughts below! ?? ?? Share ?? React ?? Save & follow for everything related to AI ! Be curious, stay hungry. Ask it all, learn it all, and pass it on! ????? #AI #OpenSource #Docling #Productivity #Automation #LLMs #DocumentProcessing ??

    • 该图片无替代文字
  • Docling转发了

    查看Michele Dolfi, PhD的档案

    Senior Technical Staff Member at IBM

    This is a classic mistake when processing PDF documents: it is a perfect showcase of the capabilities of Docling. ?? See in the comments the Docling output for original 1959 paper. ? Don't know yet about Docling? Docling is an open-source framework that makes document parsing effortless. It supports multiple formats—PDF, DOCX, XLSX, HTML, images, and more—offering deep document understanding and seamless integration with the Gen AI ecosystem. Key Features: ? Multi-format parsing: PDFs, Word, Excel, HTML, images, and more ? Advanced PDF understanding: Page layout, reading order, tables, formulas, and even code! ? Unified DoclingDocument format: An expressive, structured way to work with document data ? Multiple export options: Markdown, HTML, and lossless JSON ? Local execution support: Keep sensitive data secure, even in air-gapped environments ? Plug-and-play integrations: LangChain, LlamaIndex, Crew AI, and more for AI workflows ? OCR support: Extract text from scanned PDFs and images ? Simple CLI: Process documents with ease Give it a try at https://lnkd.in/dZCfQx92! #Docling #AI #OpenSource #DocumentProcessing

    查看Raul Pe?a的档案

    Restoring Academic's mental health with comedy | Cancer Researcher at Hospital_del_mar (Barcelona) as a side job????

    What happens when you don't supervise your scientific AI editor? That the editor creates new technologies !!! But only in the imaginary plane ???? As the Retraction Watch website recently reported (https://lnkd.in/dkW5T34S), the term "vegetative electron microscopy" was created after an AI omits the space between two columns in an old publication. https://lnkd.in/dX3ut8gJ The fun of the history is that not only more than 20 articles and authors used that new technology but also that one of them have up to 100 citations. https://lnkd.in/d25aeuzH

    • 该图片无替代文字
    • 该图片无替代文字
    • 该图片无替代文字
    • 该图片无替代文字
    • 该图片无替代文字
  • Docling转发了

    查看Michele Dolfi, PhD的档案

    Senior Technical Staff Member at IBM

    This is a classic mistake when processing PDF documents: it is a perfect showcase of the capabilities of Docling. ?? See in the comments the Docling output for original 1959 paper. ? Don't know yet about Docling? Docling is an open-source framework that makes document parsing effortless. It supports multiple formats—PDF, DOCX, XLSX, HTML, images, and more—offering deep document understanding and seamless integration with the Gen AI ecosystem. Key Features: ? Multi-format parsing: PDFs, Word, Excel, HTML, images, and more ? Advanced PDF understanding: Page layout, reading order, tables, formulas, and even code! ? Unified DoclingDocument format: An expressive, structured way to work with document data ? Multiple export options: Markdown, HTML, and lossless JSON ? Local execution support: Keep sensitive data secure, even in air-gapped environments ? Plug-and-play integrations: LangChain, LlamaIndex, Crew AI, and more for AI workflows ? OCR support: Extract text from scanned PDFs and images ? Simple CLI: Process documents with ease Give it a try at https://lnkd.in/dZCfQx92! #Docling #AI #OpenSource #DocumentProcessing

    查看Raul Pe?a的档案

    Restoring Academic's mental health with comedy | Cancer Researcher at Hospital_del_mar (Barcelona) as a side job????

    What happens when you don't supervise your scientific AI editor? That the editor creates new technologies !!! But only in the imaginary plane ???? As the Retraction Watch website recently reported (https://lnkd.in/dkW5T34S), the term "vegetative electron microscopy" was created after an AI omits the space between two columns in an old publication. https://lnkd.in/dX3ut8gJ The fun of the history is that not only more than 20 articles and authors used that new technology but also that one of them have up to 100 citations. https://lnkd.in/d25aeuzH

    • 该图片无替代文字
    • 该图片无替代文字
    • 该图片无替代文字
    • 该图片无替代文字
    • 该图片无替代文字
  • 查看Docling的组织主页

    356 位关注者

    The first open-source multimodal model built using datasets created with Docling!

    查看Rogerio Feris的档案

    Principal Scientist and Manager at the MIT-IBM Watson AI Lab

    We are releasing Granite Vision - a lightweight, open-source multimodal model tailored to excel in enterprise use cases, particularly in visual document understanding. ?? Our technical report is available at https://lnkd.in/evURM2V5 ?? Check out the post by Merve Noyan for some cool outputs of our model: https://lnkd.in/ek46Zp4C ?? Apache-2 license, allowing for both research and commercial use ?? Small model, trained on tens of millions of images/instructions (public datasets + synthetic data), yielding strong performance on both document and general image benchmarks. ?? See more details about sparse attention vectors and the new LiveXiv benchmark in the paper ?? vLLM support and integration with Docling available ! The first version of the model is available at https://lnkd.in/ekYgrGp3 Stay tuned for new versions of the model coming soon. And … We are hiring! Contact me if you are interested in working with us on multimodal models and have a proven track record of publications in top-tier AI conferences.

  • Docling转发了

    查看Michele Dolfi, PhD的档案

    Senior Technical Staff Member at IBM

    ?? Nice?talk by Sujee Maniyam at The AI Alliance Office hours! Sujee gave a great introduction of Docling showcasing many key features as well as its integration in the #DataPrepKit. ?? Well done and kudos for the great work including live Q&A session. ?? Watch on Youtube https://lnkd.in/gkxF_fcw ?? Check it out on GitHub https://lnkd.in/dZCfQx92 ?? Try the DPK integration https://lnkd.in/d9cGHwrG #AI #OpenSource #Docling #DPK #DocumentsAI #GenAI #IBM #IBMResearch

  • 查看Docling的组织主页

    356 位关注者

    Agentic AI with docling!

    Let's build an agentic RAG app with CrewAI! Here's how you could build a TV show recommendation system: ??. ??????????????????????????: Preprocess TV show data from Excel file with Docling ??. ?????? ??????????: Processes the user query into a search query and metadata filters to retrieve the most similar shows from a Weaviate vector database ??. ???????????? ?????????????? ??????????: Fetch additional information about the recommended TV shows from the web ??. ???????????? ??????????: Creates a comprehensive report based on the recommended TV shows and additional information Check out the full tutorial by Lorenze Jay Hernandez: ?? YouTube tutorial: https://lnkd.in/en4pePSR ?? GitHub repo: https://lnkd.in/eEjmf-cj

    • 该图片无替代文字
  • 查看Docling的组织主页

    356 位关注者

    Nice application of docling in Azure!

    查看Farzad Sunavala的档案

    Principal Product Manager @ Microsoft

    Happy to contribute a #Docling #RAG tutorial using #AzureAISearch! This notebook demonstrates how to: ?? Parse PDFs with #Docling in seconds on #NVIDIA #A100 #GPUs on #AzureCompute ?? Generate embeddings with #AzureOpenAI ?? Build a vector store with Azure AI Search ?? Perform #Multimodal RAG using #GPT-4o Check out the step-by-step guide: ?? Getting started: https://lnkd.in/e8zRyDem ?? Notebook: https://lnkd.in/et36FDXF Kudos to Armand Ruiz Panos Vagenas and IBM for this great open-source contribution.

    • 该图片无替代文字

相似主页