New on the Waii blog: How metadata makes the difference in text-to-SQL accuracy. Our founder shares how we've built a system that achieves 95%+ accuracy without requiring extensive modeling work. The key? A multi-faceted approach that combines automatic knowledge graph generation with seamless integration of existing enterprise documentation. This post breaks down the technical approach behind Waii's ability to understand your specific data context and business terminology, translating natural language into precisely the SQL you would have written yourself. Read the full post: https://lnkd.in/gbUmfpzg
关于我们
Power your data applications with the world’s first SQL API built with generative AI. Accurate, performant and customized.
- 网站
-
https://www.waii.ai
Waii的外部链接
- 所属行业
- 数据基础架构与分析
- 规模
- 2-10 人
- 总部
- San Francisco,California
- 类型
- 私人持股
- 创立
- 2023
地点
-
主要
2261 Market St
STE 5674
US,California,San Francisco,94114
Waii员工
动态
-
Ever wondered how Deepseek R1 performs for SQL generation and text-to-SQL tasks? Check out our latest testing and see the results for yourself!
A Quick Deepseek R1 Testing for SQL Generation (with Waii) Finally got to test Deepseek R1! Tried both versions: distilled LLaMA 8B from R1 (runs in local ollama) and Deepseek R1 (671B) from fireworks. Here's what I discovered: 1) The self-debate is fascinating - it's like watching a real, capable but hesitant person think through problems. Check out this snippet for one of the query: --- Okay, let's tackle this query. The user wants countries that have consistently below median GDP per capita but above median GDP from 2018 to 2022. First, I need to identify the correct indicators. The WDI table has "Indicator Code" values. From the provided column values, 'NY.GDP.PCAP.KD' is GDP per capita (constant 2015 US$), and 'NY.GDP.MKTP.KD' is GDP (constant 2015 US$). These seem like the right indicators to use. Next, since the WDI table is a slowly changing dimension, but the user specifies a time range (2018-2022), we don't need to filter for the most recent year. Instead, we'll use the years 2018 to 2022 directly. We need to calculate the median for each indicator across all countries for each year in that range. Then, compare each country's yearly values against these medians. The country must have all years below the GDP per capita median and above the GDP median. ... --- 2) R1 is REALLY good at understanding aggregation. Current models like GPT-4o, Claude 3.5 sometimes mess up complex aggregations (like avg of daily avg sales) or window functions with filters. But R1 handles these consistently better than other models I've tested. Here's a perfect example - a query to "find countries that moved up in GDP rankings by at least 2 positions each year from 2018 to 2022". You can read the attached SQL query snippet, notice how clean the window functions and filters are. This is much better / consistent than gpt-4o / sonnet 3.5. Downsides? Everything comes with a cost: - Model takes long <think> output for any task, even simple entity extraction - I still need to use GPT-4o for quick tasks (reranking, entity extraction) during the test, otherwise it will take forever. - Query generation takes 4-5 mins vs 10-20 secs with GPT-4o - Most time is spent on unnecessary self-debate. The solution is clear in first 20-30% (40-60 secs), but it keeps rewriting and second-guessing the correct solution About the distilled model - don't get too excited. It can't handle complexity: - Struggles with AMC-8 (8th-grade math competition) questions -- just went into an infinite loop of output and cannot give me an answer. - Can't generate SQL based on the schema input. - Only useful for simple tasks that basic models already handle well. Looking forward: While these new reasoning models (O3, Gemini-2.0-thinking) aren't fully practical yet, they show huge potential. Once they solve the fast/slow thinking problem and learn when to stop deliberating, they'll be game-changing. My prediction? The future belongs to such thinking models! #deepseek-r1, #deepseek, #text2sql
-
-
?? Exploring the Intersection of AI and Trino at #TrinoSummit2024 ?? We’re thrilled to share that our CEO Gunther Hagleitner just participated in an insightful panel at Trino Summit 2024! ?? The discussion focused on how organizations on Trino can leverage AI to transform their data analytics and to enable faster, better business decisions. If you missed it, you can catch the recording and learn more by registering here: ?? https://lnkd.in/dwwCuuZG #AI #Trino #DataInnovation #Waii #TrinoSummit
-
Our friends at LinkedIn just published an insightful blog post on how they approach Text-to-SQL (UI, benchmarks, user insights all worth looking into!). ?? As organizations continue to embrace data-driven decision-making, Text-to-SQL is quickly becoming a standard need for enterprises. Whether it's enhancing your product or streamlining operations within your organization, the potential is enormous. If you're looking to implement Text-to-SQL capabilities or want to stay ahead of the curve, we'd love to help you get started! Let's chat about how we can bring this innovation to your team. ?? Visit us at https://www.waii.ai/ https://lnkd.in/gB_9X4D4
-
?? 25x SQL Accuracy with Waii.ai Join our webinar to see how our enterprise text-to-SQL system is revolutionizing analytics: ? Purpose-built SQL Compiler – Reduce query errors to <1% vs 23% error rate when using GPT/an LLM alone. ? Automatic Knowledge Graph Technology ? Trusted by Fortune 500 companies worldwide ?? Results? See the chart below – we're transforming SQL accuracy at scale! ?? Don’t miss out—register now! Webinar link in comments. #EnterpriseAI #TextToSQL #text2sql #texttosql
-
-
We had a fully packed house last night at the Text-to-SQL Mini-Summit hosted by Waii. Thank you to our presenters and panelists - Albert Chen (LinkedIn), Sameer Merchant (Palo Alto Networks), Gaurav Singh (Ridecell), Shivika K Bisen (Viasat), Anna Krause (Viasat), and Gunther Hagleitner (Waii.ai) - as well as Amazon Web Services (AWS) GenAI Loft for hosting us!
-
-
Interested in Text-to-SQL and based in San Francisco? Join us at the Text-to-SQL Mini Summit hosted by AWS GenAI Loft SF on Wednesday, November 6 where you'll hear speakers from LinkedIn, Uber, Waii.ai, and more on how we do AI-powered natural language analytics today! Attendance will be capped, so register quickly if you can! https://lu.ma/2aczq5dt
-
Perfect opportunity to learn more and see a demo of Waii.ai's Text-to-SQL if you haven't already!
Annoyed you can only do Text-to-SQL on top of one table and even then your answers are wrong and full of LLM hallucinations? See how some of the largest companies in the world today are using Waii.ai's most accurate Text-to-SQL across their 1000s/10,000s of tables for true enterprise natural language analytics. Register below for this free webinar led by ex-{Snowflake, Cloudera, and Amazon} engineering and product leaders.
此处无法显示此内容
在领英 APP 中访问此内容等
-
Our Text-to-SQL is now available on Trino! Thank you Trino Software Foundation for the shoutout!
Trino has all the data accessible. Waii has the tools to convert your text to SQL, charts and more. And now waii supports Trino - check?https://lnkd.in/gRGGvN_N?for more info and give it a whirl. Thanks to Gunther Hagleitner and Derek Chang for their work on adding Trino support.
-
Thanks for featuring us LangChain! Learn how Waii.ai + LangGraph can help you with complex joins in your Text-to-SQL journey!
Lots of companies are trying to build text-to-SQL agents to help with business visibility, but most realize it's a hard problem to do well, especially with complex joins. Waii?+ LangGraph make it possible. ?? Read the blog and get the starter code, so you can build your analytics agent too! ?? https://lnkd.in/gE8Kr9KD
-