The future of privacy-preserving generative AI is bright. AI teams can now use Gretel with #Azure AI Foundry to design and generate secure synthetic datasets tailored to their specific business needs. This integration significantly reduces costs and time compared to traditional data labeling methods, while maintaining robust privacy and compliance standards. Enterprises are already leveraging Gretel to build specialized Small Language Models (SLMs), enhance reasoning abilities in Large Language Models (LLMs), and scale data generation from limited real-world examples. “EY is leveraging the privacy-protected synthetic data to fine-tune Azure OpenAI Service models in the financial domain," said John Thompson, Global Client Technology AI Lead at EY. "Using this technology with differential privacy guarantees, we generate highly accurate synthetic datasets—within 1% of real data accuracy—that safeguard sensitive financial information and prevent PII exposure. This approach ensures model safety through privacy attack simulations and robust data quality reporting. With this integration, we can safely fine-tune models for our specific financial use cases while upholding the highest compliance and regulatory standards.” Check out the announcement for more details: https://lnkd.in/d5CMVW3y #SyntheticData #Privacy #AI
Gretel
软件开发
Palo Alto,California 20,409 位关注者
The synthetic data platform purpose-built for Generative AI
关于我们
Gretel is solving the data bottleneck problem for AI scientists, developers, and data scientists by providing them with safe, fast, and easy access to data without compromising on accuracy or privacy. Designed by developers for developers, Gretel’s APIs make it easy to generate anonymized and safe synthetic data so you can preserve privacy and innovate faster. You can learn more about synthetic data from Gretel's engineers, data scientists, and AI research team on our blog: https://gretel.ai/blog
- 网站
-
https://gretel.ai
Gretel的外部链接
- 所属行业
- 软件开发
- 规模
- 51-200 人
- 总部
- Palo Alto,California
- 类型
- 私人持股
- 创立
- 2020
- 领域
- Generative AI、Synthetic Data、Privacy、AI、Deep Learning和Agents
产品
The Developer Stack for Synthetic Data.
数据隐私管理软件
Synthetic data that’s as good, or even better than the data you have. Or don’t have. Create and share data with the best-in-class accuracy and privacy guarantees – on demand.
地点
Gretel员工
动态
-
Gretel转发了
Despite the immense potential for innovation in finance, the industry often faces hurdles due to technical complexities.?That's why we're excited to share a new development at Gretel, Synthetic Text-to-Python dataset tailored specifically for FinTech. By enabling #LLMs and #SLMs to "speak finance," we're making it possible to turn plain language requests into precise, domain-specific Python code. Imagine saying, "Create a Python script to analyze transaction patterns for fraud detection," and getting usable code instantly. This enables analysts to security teams to bring their ideas to life without needing deep coding skills. Financial systems are intricate, filled with specialized terms and strict regulations. Generic datasets often miss these nuances, making it hard for AI models to produce accurate code for financial tasks. By customizing models with our FinTech-focused dataset, we're helping them understand these complexities—from secure data handling to compliance standards—so they can generate code that's both useful and safe. Using Gretel, we've crafted a high-quality synthetic dataset that reflects real-world FinTech scenarios. We've: -- Guided LLMs to act like Python experts in finance. -- Covered various sectors like banking, fraud detection, and smart contracts. -- Ensured the code is accurate and relevant through rigorous checks. By teaching AI models to understand and generate code within the financial domain, we're lowering the barriers to AI adoption in the industry. This not only speeds up innovation but also allows more people to contribute, even without extensive coding experience.
-
Transforming natural language into domain-specific Python code is now possible with our new synthetic Text-to-Python dataset for FinTech. Imagine asking "Create a script to analyze transaction patterns for fraud detection" and instantly getting working code. Our new dataset, created with Gretel Navigator, enables LLMs to generate precise, financial-domain code without requiring deep technical expertise. Key features: 25,000 curated records covering banking, trading, compliance & more Multiple complexity levels from beginner to expert Validated Python code with top quality scores Apache 2.0 licensed for open use Learn how we built it and create custom datasets of your own. Blog: https://bit.ly/4eOaf3X Dataset: https://bit.ly/4eMPrd5 #SyntheticData #OpenData #AI #FinTech
Accelerating FinTech Innovation with Natural Language to Code
gretel.ai
-
Excited to see Gretel recognized as part of the data transformation layer for AI-powered banking. Embedding privacy at the foundation of the stack isn’t just smart for banking—it's the way forward for every industry. #SyntheticData
Over the past month we've explored how generative AI is transforming each stage of the retail and SMB banking lifecycle.?Today, we're sharing our market map of the startups building this future. From applications to underlying infrastructure, here are the companies shaping each stage: Personetics Swaystack Dimply Creating personalized banking experiences that convert Lama AI Greenlite Parcha Baselayer Streamlining onboarding and loan origination Posh.ai Kasisto, Inc. interface.ai Glia Building next-gen conversational banking Sedric.ai Domu Making compliance and collections smarter and more customer-centric Cognaize Senso Unlocking value from unstructured?data Spade Gretel Gradient Building the data transformation layer for AI-powered banking Our full analysis here:?https://lnkd.in/e3nW3aAE Foundation Capital
-
-
Gretel转发了
Former data scientist, now Account Executive; helping enterprises build with synthetic data. 313 to 973 to 909 to 310 to 503.
Heading to re:Invent post-Thanksgiving. Booth 1605, the same year that Don Quixote was published.
-
-
Gretel转发了
At Gretel we are hiring for 15+ immediate open headcounts, across everything from Applied Science, Sales, and Marketing. At this early stage in AI, it is difficult to predict where AI will go exactly, but we can be certain it will require safe, high-quality, and domain specific data. So, come help us solve these data bottleneck problems and accelerate the whole AI ecosystem. Feel free to DM me directly if you find one of the roles a particularly good fit, or even if you don't see a specific role but interested in the problems we're solving. Better data makes better AI. *No recruiters please*
Come work with us! - Gretel.ai
gretel.ai
-
Gretel转发了
Gretel's SDK integration makes it easier to fine-tune OpenAI models on Microsoft Azure with synthetic data and built-in differential privacy. Whether you're starting with your own data or generating new datasets from prompts, this integration lets you safely work with domain-specific or sensitive data. Looking forward to seeing how teams use this to unlock new possibilities! ?? https://lnkd.in/g8FxYXYv
-
-
Gretel转发了
This is one of the most significant breakthroughs we have had at Gretel since the beginning of the company. ?? Whether your data is text, structured, or unstructured- you can get SOTA privacy and accuracy. Imagine safely unlocking insights from sensitive datasets like medical transcripts, call center logs, or financial data, all while ensuring mathematically-proven privacy safeguards
Introducing Gretel Navigator Fine Tuning with Differential Privacy. Our model of choice for generating synthetic tabular datasets now features mathematically-proven safeguards for privacy — ideal for numerical, categorial, or even free-text columns. ?? ?? Discover how this breakthrough can help enterprises unlock sensitive datasets to surface powerful insights and trends without compromising utility: https://lnkd.in/d4MVmcVk #AI #Privacy #SyntheticData
Generate Complex Synthetic Tabular Data with Navigator Fine Tuning + Differential Privacy
gretel.ai
-
Today, we’re excited to announce a new measurement feature: Personally Identifiable Information (PII) Replay—the latest addition to our suite of privacy metrics that ensures synthetic data is private by design. With PII Replay, you can easily identify sensitive information in your original training data and track how often it appears in your generated synthetic data. This powerful feature empowers you to quantify privacy risks and take action to mitigate them effectively. ?? Learn more about how PII Replay ensures privacy for your data: https://bit.ly/4fFuVw3 #AI #Privacy #SyntheticData
Quantifying PII Exposure in Synthetic Data
gretel.ai
-
Gretel转发了
We're excited to announce that Gretel Navigator Fine Tuning (NavFT) now has advanced differential privacy capabilities. This means you can generate high-quality synthetic tabular data—including numerical, categorical, free-text, and event-driven fields—while ensuring mathematical privacy guarantees. * Why This Matters * - Data Privacy: Differential privacy provides formal guarantees against data leakage, crucial for handling sensitive or regulated information. - Compliance: Helps meet strict data privacy regulations without compromising data utility for analysis and sharing. - Versatility: Supports various data types with models tailored to your dataset—NavFT for mixed types, Gretel GPT for free-text, and Tabular DP for numerical and categorical data. * Balancing Privacy and Utility * Our experiments show that while DP enhances privacy, the synthetic data remains valuable for analytics and machine learning tasks. Even with stronger privacy settings, models trained on differentially private synthetic data performed comparably to those trained on non-private data. * Key Tips * - Optimal Dataset Size: Larger datasets (thousands of samples) work best with DP. - Privacy Settings: Adjust epsilon values (ε ≥ 8) to balance privacy and utility, especially for complex or smaller datasets. - Batch Size: Increasing batch size can improve results while maintaining privacy, but watch for memory constraints.
Generate Complex Synthetic Tabular Data with Navigator Fine Tuning + Differential Privacy
gretel.ai