?? Transforming PDF Parsing for Scalable Insights in Sustainable Finance (and beyond)! Every big idea using #textualanalysis in #finance and #sustainability starts small -- usually with a #PDF, and then many more PDFs. I am sure you made this experience: parsing glossy #corporate sustainability reports with their complex layouts, tables, and charts can feel like peeling an onion—layer by layer, tool by tool. This is why we built #ParseStudio - a #Python library that makes document parsing effortless, scalable, and insightful: ?? Unified Interface: Switch between powerful tools like Docling, PyMuPDF, and Llama Parse with just one line of code. ?? Multimodal Parsing: Extract text, tables, and images from PDFs seamlessly. ?? Scalable Insights: Process massive datasets at scale for clean, structured outputs (Markdown, Pandas DataFrames, and more). Why does this matter? Particularly in sustainable finance, #transparency is key. #ParseStudio helps analysts and researchers dive into large-scale corporate #disclosures and sustainability reports—uncovering insights that drive #accountability and better decisions for #investors, #assetmanagers, and #policymakers. ?? A huge shoutout to my collaborators—Imene K., Saeid A. Vaghefi Chiara Colesanti Senni. Are you ready to unlock the full potential of your PDFs and bring clarity to your analysis? ?? on our #chatClimate GitHub https://lnkd.in/dKYsbsu2 ?? Documentation: https://lnkd.in/dxUdzduU? ?? Have a look at Imene’s great Medium article that guides you step-by-step through our tool: https://lnkd.in/eCQMEA5c We hope you find this a useful tool for your textual analysis in industry and academia! Comments are welcomed. #AI #Sustainability #PDFParsing #DataScience #Transparency #ClimateChange #Nature #CorporateSustainabilityReports #CSR UZH Department of Finance SFI Swiss Finance Institute Climate Arc InfluenceMap Cornelia Kegele Maud ABDELLI Roger Rueegg Marcin Kacperczyk Owen Grafham Olly Mount Andreas Hoepner Dr. Sebastian Gehricke Daniel Trinder Florian Esterer
Very familiar with parsing these types of documents and the challenges they present. Thank you for sharing!
Nice work Markus Leippold, Imene K., and team, happy to see ?? #Docling capabilities brought to more users through ParseStudio! Looking forward to working with the community on feedback & contributions! ??
Thank you Markus! This solves a big pain point in textual analysis. ??
Great Work, from the team at UZH.
Associate Professor in Sustainable Finance at Edinburgh University | Member of the Platform on Sustainable Finance at the European Commission | Member GTAG HM Treasury | Co-founder RoSIF & 2050Terra.ai
2 个月Awesome Markus! This helps everyone scale up productivity of research - well done to you and team :)