LLM Pulse - August 16th 2024
New Releases & Updates
OpenAI responds to 'very popular demand' with launch of key LLM safety feature for developers
OpenAI today announced a key update that finally delivers the top-requested feature developers have been asking for. With the launch of its Structured Outputs in the API feature, the company is providing developers with a reliable way to ensure that the outputs of their generative artificial intelligence models match the data stored within their JavaScript Object Notation Schema files. Read More
Convin launches a 7-billion parameter LLM for Indian contact centers; expects a 3X revenue boost in 2024-25
New Delhi [India], August 13: Convin, a leading AI-powered conversation intelligence platform for call center setups, recently launched its advanced Large Language Model (LLM), with 7 billion parameters. Read More
Deepset launches studio for LLM app development with cloud and nvidia ai integrations
Deepset, the mission-critical AI company, today announced an expansion of its offerings with deepset Studio, an interactive tool that empowers product, engineering and data teams to visually architect custom AI pipelines that power agentic Read More
UAE-developed? LLM? Jais 70B aims to democratise access to AI
The latest release of a UAE-developed large language model, Jais 70B, will be able to deliver Arabic-English bilingual capabilities at an “unprecedented size and scale”, according to Inception, a company under the umbrella of G42, Abu Dhabi’s AI and cloud company. Read More
NVIDIA introduces efficient fine-tuning with nemo curator for custom LLM Datasets
In a recent post, NVIDIA introduced the NeMo Curator, a powerful tool designed to facilitate the curation of custom datasets for large language models (LLMs) and small language models (SLMs). The NeMo Curator aims to streamline pretraining and continuous training processes, as well as fine-tuning existing foundation models on domain-specific datasets, according to the NVIDIA Technical Blog. Read More
SCIENTIST.COM unveils ELISA LLM research assistant
Scientist.com has unveiled a new large language model (LLM)-powered research assistant called Elisa, named in homage to a widely used research assay and to ELIZA, an early 1960s chatbot. Elisa is the newest addition to a suite of proprietary AI-powered apps created by Scientist.com to improve and accelerate drug research. Marketplace users can now benefit from a powerful, scalable, and intuitive LLM assistant that understands the nuances of pharmaceutical research and has been trained on Scientist.com's public data sources. Read More
LLM not available in your area? Snowflake now enables cross-region inference
The regional availability of large language models (LLMs) can provide a serious competitive advantage — the faster enterprises have access, the faster they can innovate. Those who have to wait can fall behind. Read More
Datadog Inc. - Monitor your anthropic applications with datadog? LLM? observability
Anthropic is an AI research and development company focused on building reliable and safe artificial intelligence systems. Their flagship product is Claude, an advanced language model and conversational AI assistant known for its strong capabilities in natural language processing, reasoning, and task completion. Read More
Mohamed bin Zayed University for Artificial Intelligence launches large language model 'K2-65B
Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), in collaboration with Petuum and LLM360, has launched “K2-65B”, a ground-breaking open-source 65-billion parameter large language model (LLM). Read More
Research and Technology
MIT researchers use large language models to flag problems in complex systems?
The approach can detect anomalies in data recorded over time, without the need for any training. In a new study, MIT researchers found that large language models (LLMs) hold the potential to be more efficient anomaly detectors for time-series data. Importantly, these pretrained models can be deployed right out of the box. Read More
Korean research utilizes LLM to predict dementia risk
A research team at the Korean government-backed Electronics and Telecommunications Research Institute in Daejeon, a city south of Seoul, has developed a predictive AI technology for screening mild cognitive impairment. Read More
NASA and IBM Collaborate to develop INDUS? large language models? for advanced science research
A collaboration between IMPACT and IBM has produced INDUS, a comprehensive suite of large language models (LLMs) tailored for the domains of Earth science, biological and physical sciences, heliophysics, planetary sciences, and astrophysics and trained using curated scientific corpora drawn from diverse data sources. Read More
领英推荐
Experiments reveal LLMs develop their own understanding of reality as their language abilities improve
researchers from MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL) have uncovered intriguing results suggesting that language models may develop their own understanding of reality as a way to improve their generative abilities. Read More
Other News
LLM to ROI: How to scale gen AI in retail
This article is a collaborative effort by Alexander Sukharevsky, Andreas Ess, Emily Reasor, and Holger Hürtgen, with Oleg Sokolov and Sergey Kondratyuk, representing views from McKinsey's Digital and Retail Practices. Read More
LLM progress is slowing — what will it mean for AI?
We used to speculate on when we would see software that could consistently pass the Turing test. Now, we have come to take for granted not only that this incredible technology exists — but that it will keep getting better and more capable.? Read More
Gartner: The LLM price war in China will accelerate the AI gravity to cloud
When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works. In recent months, Chinese generative AI (GenAI) vendors have significantly reduced the inference costs of their large language model (LLM). Read More
Health Informatics; The TRIPOD- LLM? Statement: A targeted guideline for reporting? large language models? use
Large Language Models (LLMs) are rapidly being adopted in healthcare, necessitating standardized reporting guidelines. We present TRIPOD-LLM, an extension of the TRIPOD+AI statement, addressing the unique challenges of LLMs in biomedical applications. TRIPOD-LLM provides a comprehensive checklist of 19 main items and 50 subitems, covering key aspects from title to discussion. Read More
Cybersecurity in Age of AI: Black Hat 2024’s top 3? LLM? security risks
Last week, at Black Hat 2024, one of the major cybersecurity events in the world, a few disconcerting revelations about the potential unsecure nature of GenAI and LLM implementations came to the fore – and how they can be exploited by hackers and malicious actors to steal user data and critical business intelligence. Here are the top highlights. Read More
Novel AI framework for detecting LLM "hallucinations" in medical summaries
Disclaimer: All opinions expressed by Contributors are their own and do not represent those of their employers, or BiopharmaTrend.com. Read More
Contextualising AI: Building Enterprise? LLM? applications with organizational insights
November 2022 marked a key moment in the space of generative AI with the launch of ChatGPT, catalysing widespread interest and adoption. This surge of interest spurred even enterprises to explore innovative applications of large language models (LLMs) on both structured and unstructured data, aiming to boost productivity and operational efficiency. Consequently, many tools emerged to minimise manual intervention in tasks. These tools include AI data analysts, automated insight generators, and knowledge search functionalities. Read More
Google slashes Gemini 1.5 Flash prices, igniting LLM price war
In May, Google announced the new Gemini 1.5 Flash model, optimized for speed and efficiency. The Gemini 1.5 Flash was aggressively priced ($0.35 per million input tokens and $1.05 per million output tokens) compared to other frontier Read More
DeepMind’s Gemma Scope peers under the hood of large language models
Large language models (LLMs) have become very good at generating text and code, translating languages, and writing different kinds of creative content. Read More
Dataiku expands LLM Mesh with governance layer
The vendor provides a marketplace for enterprises similar to those of the major cloud providers. However, it does not have its own LLM, but offers many others in its marketplace. Read More
Booz Allen Hamilton deploys first AI large language model in space
The generative AI large language model at the space station is intended to help astronauts address queries and resolve issues. Read More
Co-Founder of Altrosyn and DIrector at CDTECH | Inventor | Manufacturer
3 个月The proliferation of LLMs like Convin and Jais 70B indicates a shift towards domain-specific models, addressing industry needs with tailored capabilities. Open-source contributions like MBZUAI's K2-65B demonstrate the collaborative nature of LLM development, accelerating progress through shared knowledge. Given the increasing sophistication of LLMs in anomaly detection, how could these models be integrated into real-time cybersecurity systems to proactively identify and mitigate emerging threats?