?? Recap from Day 1: FlashMLA On Day 1, DeepSeek introduced FlashMLA, an innovative solution that optimizes both memory and computation on a single GPU using Multi-Head Latent Attention (MLA). Benchmark tests on the H800 SXM5 GPU with CUDA 12.6 have shown impressive performance—up to 3000 GB/s in memory-bound configurations (about 1.8× improvement) and 580 TFLOPS in computation-bound scenarios (roughly 10× faster). For more insights into the challenges they’re tackling, check out this Databricks post: https://lnkd.in/gNQzCRaR. ?? Day 2: DeepEP On Day 2, DeepSeek extends its innovation to the network layer with DeepEP, a communication library designed to optimize data exchange in Mixture-of-Experts (MoE) and Expert Parallelism (EP) architectures across multiple GPUs. DeepEP efficiently manages both intra-node communication (using NVLink for GPU-to-GPU transfers within a single node) and inter-node communication (using InfiniBand/RDMA for data exchange between separate nodes). It achieves near-peak performance on the H800—leveraging approximately 160 GB/s over NVLink and 50 GB/s over InfiniBand/RDMA. ?? Takeaway: With optimizations across compute, memory, and network, DeepSeek is offering a holistic solution for both inference and training. I’m excited to see what they will open source next! - open-infra-index: https://lnkd.in/drYtxerk - DeepEP: https://lnkd.in/gQsv_j6M #AI #DeepLearning #OpenSource #Networking #MachineLearning #DemocratizingAI #DeepSeek
CambioML (YC S23)
软件开发
ML tools for R&D teams to extract text from raw PDFs/HTMLs and transform to research insights using LLMs.
关于我们
CambioML (YC S23) offers open-source tools and APIs to extract text from PDF documents and then transforms the extracted text into your preferred format (e.g. fitting database schemas, creating LLM training datasets, or generating custom data formats). Star and play with Uniflow today: https://lnkd.in/gMaHyrmw Uniflow supports the most widely used LLMs, such as OpenAI's GPT-4, Google's Gemini, AWS Bedrock, Azure OpenAI, Mistral MOE, and LLaMA, through an LLM-agnostic interface.
- 网站
-
https://www.cambioml.com
CambioML (YC S23)的外部链接
- 所属行业
- 软件开发
- 规模
- 2-10 人
- 总部
- Abu Dhabi
- 类型
- 私人持股
- 创立
- 2023
- 领域
- AI、LLMs和ML Infra
地点
CambioML (YC S23)员工
动态
-
CambioML (YC S23) is honored to represent as one of the leading AI and data startups at Hub71, to present and welcome NUS EMBA visit! Lots of meaningful discussion and questions emerged.
-
-
We were thrilled to welcome #Cohort16 into the #Hub71 community with a dinner filled with delicious food, networking, inspiration and bonding! ? These talented startups are ready to make an impact, and we’re excited to support their journey as they scale and thrive in Abu Dhabi’s dynamic tech ecosystem. ?? ? Stay tuned for their success stories. ? #TechEcosystem #Startups #Innovation Aurem | CambioML (YC S23) | Desert Farms? | EsportsXO | Fundbot | Hotdesk | Mithryl | NodeShift | OnLoop | Qashio | Simpleem | Taxo (YC S24) | Vivan Therapeutics | Watermelon Ecosystem | xMap | InvoiceMate | Redbrick Inc (????) | Rilla Network | Sustainable Bitcoin Protocol | 1Money | AIRMO | New Path Bio | Orbillion Bio | Switch Foods | theion
-
Thrilled and honored to be selected by Hub71! ?? Huge thanks to this vibrant, diverse, and ultra-connected community ??. Can’t wait to build our AI AGENTs, grow, innovate, and make waves together in Hub71, Abu Dhabi, UAE, and across MENA! #hub71 #abudhabi #uae #mena #startup #ai #aiagent
Exciting news from Hub71! ?? We’re thrilled to announce the arrival of Cohort16 with 27 innovative startups, bringing our total to an impressive 357 ?? Most of these startups are proudly headquartered in the UAE, showcasing the country’s booming startup ecosystem. Welcome to the future of innovation, right here in Abu Dhabi! Learn more here: https://shorturl.at/cvAsf Aurem | CambioML (YC S23) | Desert Farms?| Fundbot | EsportsXO | Hotdesk | Mithryl | NodeShift | OnLoop | Qashio | Simpleem | Vivan Therapeutics | Taxo (YC S24) | Watermelon Ecosystem | xMap | InvoiceMate | Redbrick Inc (????) | Rilla Network | Sustainable Bitcoin Protocol | AIRMO | New Path Bio | Orbillion Bio | Switch Foods | theion #Cohort16 #Innovation #Startups #InAbuDhabi
-
-
Co-founder @ CambioML (YC | HUB71) | ex-DeepMind | ex-AWS | ex-Microsoft | Stanford AI | Learn Everything | Hiring
It's my honor to speak at the first ever #AI4Finance conference and share our vision language model training at CambioML (YC S23)! Thanks Jacob Chanyeol Choi and committees' organizing this! #ai #llm #ai4f #finance
-
-
Come and join our CTO Lingjie (Kimi) Kong sharing at the first ever AI for Finance conference! #AI4Finance
RAG (Retrieval-Augmented Generation) is no longer just a buzzword. It's an essential tool in GenAI apps, especially in complex domains like finance. However, to implement RAG effectively, one needs not only deep domain expertise but also a thorough understanding of the technology itself. Each component—data extraction, embedding, indexing, retrieval, reranking, and generation—must be carefully tuned to meet the specific needs of the use case. In this special industry session, we’re bringing together top experts who are pioneering the application of RAG in financial documents. ??? Join us on November 15th, 2024, from 1:00 - 2:00 PM EST, to gain insights from: - Adit Abraham, CEO at Reducto - Lingjie (Kimi) Kong, CTO at CambioML - Matt Akins, Manager at NVIDIA - Jin Kim, Co-founder at Linq Don’t miss this chance to learn from leaders who are at the forefront of RAG innovation! For more details: https://finance-rag.com
-
-
CEO @ CambioML | Forbes 30U30 | YC, Hub71, YUE, Berkeley Alum | ex-AWS | Building AI for unstructured data
?? We are excited to launch AnyParser On-prem: a cutting-edge document parsing solution designed to meet the stringent security requirements of modern enterprises. This whitepaper outlines the robust security measures implemented in AnyParser On-prem, ensuring data privacy, regulatory compliance, and operational efficiency. ? Learn about its security architecture, deployment model, infrastructure security, and how it addresses critical data privacy needs in document parsing. https://lnkd.in/gMf6BSKb #anyparser #security #onprem #pdfparsing
-
-
CEO @ CambioML | Forbes 30U30 | YC, Hub71, YUE, Berkeley Alum | ex-AWS | Building AI for unstructured data
? ? ? AI Education Time: what does *unstructured data* mean? ? ? ? ? Structured data is highly organized and formatted, making it easily searchable and accessible for AI algorithms. Its predefined structure allows for efficient automation in data management tasks like search, create, delete, or edit operations. ? Unstructured data lacks a predefined format, making it harder to search, organize, and analyze. Analyzing unstructured data requires specialized tools and advanced techniques, often necessitating the expertise of data scientists, which can limit its accessibility within organizations." AnyParser turns unstructured data to structured data with a few lines of code. https://lnkd.in/gjYiVXg6 #unstructureddata #structureddata #pdf #doc #ai #anyparser
-
Our vision-language model, AnyParser, demonstrates top-tier performance, combining speed and accuracy - offering a 5x speed improvement over models like GPT/Claude while achieving higher accuracy.
CEO @ CambioML | Forbes 30U30 | YC, Hub71, YUE, Berkeley Alum | ex-AWS | Building AI for unstructured data
? Introducing our open-source evaluation pipeline for PDF parsing. ?? We use a series of metrics to assess the model performance, including: 1. Edit Distance, Jenson-Shannon Divergence, and Jaccard Distance: Metrics specific to the OCR domain, particularly helpful for understanding the exactness of content reproduction. 2. Precision, Recall, and F-Measure: Evaluating the quality and completeness of parsing. 3. BLEU Score and ANLS: Useful for evaluating language and layout structure. ?? Our vision-language model, AnyParser, demonstrates exceptional performance, combining speed and accuracy, especially on complex layouts with tables and semantic elements. AnyParser outperforms other solutions, offering a 5x speed improvement over models like GPT/Claude while achieving higher accuracy. https://lnkd.in/gRKAKxbR #aievaluation #aiparser #llmparser #pdfparser #resumeparser
-