Anthropic的动态

580,187 位关注者

3 周

Claude can now view images within a PDF, in addition to text. Enable the feature preview to get started: https://claude.ai/new?fp=1. This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics. The Anthropic API now also supports PDF inputs in beta: https://lnkd.in/emvau9Ez

122 条评论

Napoleon Skolarikis

Founder @ WizaLabs | I save money and retain customers for Shopify brands

3 周

I love anthropic

10 次回应

Starter Seo Audit

2 周

We recently published a blog showing examples of how you can potentially use computer-use to help automate SEO and marketing tasks! https://starterseoaudit.com/blog/using-anthropic-claude-35-computer-use-for-seo/

1 次回应

Kunal Agarwala

GenAI Lead - Sr Manager

3 周

Anthropic This is great.. Internally combining text and image both to pass on to the llm.. This is exactly what we have done multiple times in projects as it gave us better results.. But I am also confused about how it works.. in the doc it says, 1. First the doc gets converted to to image 2. Second, text is extracted and combined with Image So far we have using OCR services like Textract the text as a first step in every IDP pipeline and to do that, we always rasterize the doc to image first.. I would say thats internal to Textract.. But how do you guys do that? In your second step when you extract the text, do you use the image from first step or you use native pdf functionality to extract.. ? I would assume its through image.. as one of the most common use case is scanned pdf.. But this is great.. For one of use cases, user won’t have to do itnon their own.. but would you say from a design angle, it still better to first extract the text and store it as it has many other downstream use cases and we wont have to pass along the text any more to Claude, as it is doing it anyway..

4 次回应

Ivan Hernanz Cianca

Chief Information Officer (CIO) / Chief Technology Officer (CTO) con más de 20 a?os de experiencia

2 周

"?Totalmente de acuerdo! En el ámbito de document intelligence, estamos viendo cómo los OCR tradicionales están siendo rápidamente superados por modelos multimodales avanzados. Estos modelos no solo reconocen texto, sino que también comprenden el contexto visual, estructuras y patrones de los documentos, lo cual eleva considerablemente la precisión y el valor de la extracción de datos. Los modelos multimodales pueden interpretar tablas, gráficos y otros elementos visuales de los documentos, algo que los OCR convencionales no alcanzan a hacer bien sin una preprocesamiento extenso. Así, en lugar de una simple lectura de texto, obtenemos una 'comprensión' profunda, ideal para aplicaciones empresariales complejas. ?? ?El futuro? Document intelligence será un entorno de ‘comprensión’ total, donde los multimodelos ofrecerán un análisis detallado en tiempo real, optimizando procesos y facilitando una toma de decisiones más inteligente. ????"

1 次回应

Florian Bansac

Disfold.com - Boost Your Investing & Swing Trading With Algos & Signals, Brute Force & AI Tools

4 小时前

Is it also in the API? Come share what you build and learn with us in the AI Agents group on linkedin: https://www.dhirubhai.net/groups/6672014

Matt Lane

Product Strategist and Conceptual Software Designer

3 周

Anthropic, downloading reports, etc. (artifacts) other than .tsx, e.g., non-branded pdf, would be awesome.

4 次回应

Maranatha Poirier

IT Champion & AI Advocate for Growing Organizations | You have a goal, let's make it happen

2 周

Woah, that is very cool. PDF is a printer language and text can be pulled directly out of the file. Whereas, an image of text requires OCR to recognize and pull the text out. The ability to so quickly and effectively recognize the content of images as easily as PDFs is huge for working with large document repositories.

3 次回应

Kabeer Singh Thockchom

AI & Data @ EY | GenAI Products and Financial Quantitative Modeling | Passionate SaFe 6.0 Product Owner / Product Manager & Full Stack Developer | Building AI Agents

3 周

Somnath Mukherjee Tigran Arzumanyan vision

4 次回应

Veridien.ai

3 周

Claude is just the best for text generation and summarization, we are using it as the last step in several pipelines

5 次回应

Shay Irani

Global Technology & Digital Transformation Leader

3 周

I wonder if the new model can transcribe a tech journal of sorts in a “for dummies” edition ??

2 次回应

查看更多评论

要查看或添加评论，请登录

最相关的动态

Gionatan F.

Digital Education Specialist | Enseignant ICT de la formation professionnelle | Ingénieur HES en Génie électrique
2 周
举报此动态
?? Exciting update from Anthropic: Claude can now analyze both text and images within PDFs! With this new feature, Claude 3.5 Sonnet can better understand complex documents, even those filled with charts and graphics. Enable the feature preview to explore this capability. Plus, the Anthropic API now supports PDF inputs in beta – a powerful tool for managing data-rich content. Discover how this can transform your document processing experience! ???? #Claude #AI #Innovation #DocumentProcessing #PDF

Anthropic

580,187 位关注者
3 周

Claude can now view images within a PDF, in addition to text. Enable the feature preview to get started: https://claude.ai/new?fp=1. This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics. The Anthropic API now also supports PDF inputs in beta: https://lnkd.in/emvau9Ez
赞评论
要查看或添加评论，请登录
Ala Khadri

AI Voice Agent Expert | Turning Missed Calls Into Profit | Automation for Business Growth
2 周
举报此动态
Exciting advancement from Anthropic! Claude's new ability to interpret images within PDFs is a huge leap forward, especially for complex documents like technical manuals and reports. Imagine the possibilities for industries like aerospace, where even the Apollo 17 flight plan can be easily reviewed. ?? Can't wait to see how this shapes the future of AI document analysis! #AI #MachineLearning #Anthropic"

Anthropic

580,187 位关注者
3 周

Claude can now view images within a PDF, in addition to text. Enable the feature preview to get started: https://claude.ai/new?fp=1. This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics. The Anthropic API now also supports PDF inputs in beta: https://lnkd.in/emvau9Ez
赞评论
要查看或添加评论，请登录
GBIT Automation

105 位关注者
3 周
举报此动态
?? Exciting times in AI! Anthropic’s latest update means Claude 3.5 Sonnet can now actually see images within PDFs—charts, graphics, and all the data that’s typically tough to analyze. Now, add the new PDF support in the Anthropic API, and we’re talking game-changer. Here’s how we at GBIT see this playing out: ?? Process Insights Like Never Before Imagine AI going through your process charts and workflows, spotting ways to automate and streamline without the manual grind. GBIT can now help you make sense of these complex docs fast, finding areas to boost efficiency and save time. ?? Smoother Compliance Checks In industries with loads of regulatory demands, reviewing compliance data is a big task. This update lets us help our clients quickly pull insights from dense documents, simplifying audits and saving hours of effort. At GBIT, we’re all about smarter, simpler ways to work. Curious about how this could help your business? Let’s talk! #AI #GBIT #Automation #BusinessInnovation

Anthropic

580,187 位关注者
3 周

Claude can now view images within a PDF, in addition to text. Enable the feature preview to get started: https://claude.ai/new?fp=1. This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics. The Anthropic API now also supports PDF inputs in beta: https://lnkd.in/emvau9Ez
赞评论
要查看或添加评论，请登录
Joe Faith

Assistant Professor at HU | Doctoral Student in AI/ML at GWU
3 周
举报此动态
It is crazy to me how quickly Anthropic is launching new features. Super exciting times! I see this feature being very useful for processing pipelines that deal with a lot of semi-structured documents. The ability to understand and be able to converse about images within PDFs is an exciting step in this journey! I am most excited about the ability for PDF inputs to be used with the API. #Anthropic #Claude #ImageRecognition #API #Processing #Documents #DataPipeline #GenAI #GenerativeAI #EmergingTech #TechTrends

Anthropic

580,187 位关注者
3 周

Claude can now view images within a PDF, in addition to text. Enable the feature preview to get started: https://claude.ai/new?fp=1. This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics. The Anthropic API now also supports PDF inputs in beta: https://lnkd.in/emvau9Ez
赞评论
要查看或添加评论，请登录
Divya Vikash

Head of Engineering at SQE, Sinarmas | Ex-Gojek | 8+ Years in FinTech, Mobile Banking & Ride-Hailing
2 周
举报此动态
This is a significant leap. Claude can now READ images, graphs and other visual elements inside a document. Current document based AI systems break down PDFs into pure text using OCR, then chunk them for RAG. But real documents are more visual and not only texty. There is also a certain relationship between the text and tables, charts, layouts, and figures that carry crucial meaning. Most likely, they are using ColPali algorithm (https://lnkd.in/gDvEx_A4) which processes documents by dividing them into small visual patches, preserving both textual and visual information without losing their relationships. Something similar to how a human reads. This is quite good for real-world documents like financial reports, papers, or technical docs. And they have it available in their APIs as well. So, anyone can plug it into their application.

Anthropic

580,187 位关注者
3 周

Claude can now view images within a PDF, in addition to text. Enable the feature preview to get started: https://claude.ai/new?fp=1. This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics. The Anthropic API now also supports PDF inputs in beta: https://lnkd.in/emvau9Ez
赞评论
要查看或添加评论，请登录
Unblockd

3 位关注者
2 周
举报此动态
Multimodal models unlocks a new frontier for knowledge-intensive use-cases. Creating knowledge assistants over complex documents is getting easier.

Anthropic

580,187 位关注者
3 周

Claude can now view images within a PDF, in addition to text. Enable the feature preview to get started: https://claude.ai/new?fp=1. This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics. The Anthropic API now also supports PDF inputs in beta: https://lnkd.in/emvau9Ez
赞评论
要查看或添加评论，请登录
Christian Jungemeyer

Product Owner by heart, creating digital products people love
3 周
举报此动态
Claude 3.5 Sonnet now offers the ability to view and analyze images, charts, and graphs in PDFs, in addition to text.

Anthropic

580,187 位关注者
3 周

Claude can now view images within a PDF, in addition to text. Enable the feature preview to get started: https://claude.ai/new?fp=1. This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics. The Anthropic API now also supports PDF inputs in beta: https://lnkd.in/emvau9Ez
赞评论
要查看或添加评论，请登录
Amit Dhingra

AI/ML/GenAI Sales @ AWS ? ??
2 周
举报此动态
Ingesting scanned PDFs and images within PDFs is going to become so much easier with #Claude now. Check it out on #Amazon #Bedrock. #AWS #GenAI

Anthropic

580,187 位关注者
3 周

Claude can now view images within a PDF, in addition to text. Enable the feature preview to get started: https://claude.ai/new?fp=1. This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics. The Anthropic API now also supports PDF inputs in beta: https://lnkd.in/emvau9Ez
赞评论
要查看或添加评论，请登录
Javed Alam

Professor emeritus at Youngstown State University
2 周
举报此动态
deeper understanding of documents within anthropic claude. more computing power, and we will have full audio/video understanding, summarization, and qnA

Anthropic

580,187 位关注者
3 周

Claude can now view images within a PDF, in addition to text. Enable the feature preview to get started: https://claude.ai/new?fp=1. This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics. The Anthropic API now also supports PDF inputs in beta: https://lnkd.in/emvau9Ez
赞评论
要查看或添加评论，请登录
Kaushik Shakkari

Lead Data Scientist | AI Manager | Championing GenAI & LLM-Powered Solutions for Unstructured Data | Mentor & AI Thought Leader | Driving Product Innovation & Digital Transformation
3 周
举报此动态
?? Exciting news! The new Claude 3.5 Sonnet model now supports visual content in PDF input ??! ? What can Claude do with PDFs? - ?? Analyze charts and tables in financial reports - ?? Convert document info into structured formats - ?? Extract key data from legal documents - ?? Assist with translations ?? Limitations: - Max 32MB size - 100 pages - No passwords/encryption. #LargeLanguageModel #PDFSupport #AI #Claude3 #Innovation

Anthropic

580,187 位关注者
3 周

Claude can now view images within a PDF, in addition to text. Enable the feature preview to get started: https://claude.ai/new?fp=1. This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics. The Anthropic API now also supports PDF inputs in beta: https://lnkd.in/emvau9Ez
赞评论
要查看或添加评论，请登录

580,187 位关注者

查看档案关注

在领英向专家学习