LLM Pulse  - August 16th 2024

LLM Pulse - August 16th 2024

New Releases & Updates

OpenAI responds to 'very popular demand' with launch of key LLM safety feature for developers

OpenAI today announced a key update that finally delivers the top-requested feature developers have been asking for. With the launch of its Structured Outputs in the API feature, the company is providing developers with a reliable way to ensure that the outputs of their generative artificial intelligence models match the data stored within their JavaScript Object Notation Schema files. Read More

Convin launches a 7-billion parameter LLM for Indian contact centers; expects a 3X revenue boost in 2024-25

New Delhi [India], August 13: Convin, a leading AI-powered conversation intelligence platform for call center setups, recently launched its advanced Large Language Model (LLM), with 7 billion parameters. Read More

Deepset launches studio for LLM app development with cloud and nvidia ai integrations

Deepset, the mission-critical AI company, today announced an expansion of its offerings with deepset Studio, an interactive tool that empowers product, engineering and data teams to visually architect custom AI pipelines that power agentic Read More

UAE-developed? LLM? Jais 70B aims to democratise access to AI

The latest release of a UAE-developed large language model, Jais 70B, will be able to deliver Arabic-English bilingual capabilities at an “unprecedented size and scale”, according to Inception, a company under the umbrella of G42, Abu Dhabi’s AI and cloud company. Read More

NVIDIA introduces efficient fine-tuning with nemo curator for custom LLM Datasets

In a recent post, NVIDIA introduced the NeMo Curator, a powerful tool designed to facilitate the curation of custom datasets for large language models (LLMs) and small language models (SLMs). The NeMo Curator aims to streamline pretraining and continuous training processes, as well as fine-tuning existing foundation models on domain-specific datasets, according to the NVIDIA Technical Blog. Read More

SCIENTIST.COM unveils ELISA LLM research assistant

Scientist.com has unveiled a new large language model (LLM)-powered research assistant called Elisa, named in homage to a widely used research assay and to ELIZA, an early 1960s chatbot. Elisa is the newest addition to a suite of proprietary AI-powered apps created by Scientist.com to improve and accelerate drug research. Marketplace users can now benefit from a powerful, scalable, and intuitive LLM assistant that understands the nuances of pharmaceutical research and has been trained on Scientist.com's public data sources. Read More

LLM not available in your area? Snowflake now enables cross-region inference

The regional availability of large language models (LLMs) can provide a serious competitive advantage — the faster enterprises have access, the faster they can innovate. Those who have to wait can fall behind. Read More

Datadog Inc. - Monitor your anthropic applications with datadog? LLM? observability

Anthropic is an AI research and development company focused on building reliable and safe artificial intelligence systems. Their flagship product is Claude, an advanced language model and conversational AI assistant known for its strong capabilities in natural language processing, reasoning, and task completion. Read More

Mohamed bin Zayed University for Artificial Intelligence launches large language model 'K2-65B

Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), in collaboration with Petuum and LLM360, has launched “K2-65B”, a ground-breaking open-source 65-billion parameter large language model (LLM). Read More

Research and Technology

MIT researchers use large language models to flag problems in complex systems?

The approach can detect anomalies in data recorded over time, without the need for any training. In a new study, MIT researchers found that large language models (LLMs) hold the potential to be more efficient anomaly detectors for time-series data. Importantly, these pretrained models can be deployed right out of the box. Read More

Korean research utilizes LLM to predict dementia risk

A research team at the Korean government-backed Electronics and Telecommunications Research Institute in Daejeon, a city south of Seoul, has developed a predictive AI technology for screening mild cognitive impairment. Read More

NASA and IBM Collaborate to develop INDUS? large language models? for advanced science research

A collaboration between IMPACT and IBM has produced INDUS, a comprehensive suite of large language models (LLMs) tailored for the domains of Earth science, biological and physical sciences, heliophysics, planetary sciences, and astrophysics and trained using curated scientific corpora drawn from diverse data sources. Read More

Experiments reveal LLMs develop their own understanding of reality as their language abilities improve

researchers from MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL) have uncovered intriguing results suggesting that language models may develop their own understanding of reality as a way to improve their generative abilities. Read More

Other News

LLM to ROI: How to scale gen AI in retail

This article is a collaborative effort by Alexander Sukharevsky, Andreas Ess, Emily Reasor, and Holger Hürtgen, with Oleg Sokolov and Sergey Kondratyuk, representing views from McKinsey's Digital and Retail Practices. Read More

LLM progress is slowing — what will it mean for AI?

We used to speculate on when we would see software that could consistently pass the Turing test. Now, we have come to take for granted not only that this incredible technology exists — but that it will keep getting better and more capable.? Read More

Gartner: The LLM price war in China will accelerate the AI gravity to cloud

When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works. In recent months, Chinese generative AI (GenAI) vendors have significantly reduced the inference costs of their large language model (LLM). Read More

Health Informatics; The TRIPOD- LLM? Statement: A targeted guideline for reporting? large language models? use

Large Language Models (LLMs) are rapidly being adopted in healthcare, necessitating standardized reporting guidelines. We present TRIPOD-LLM, an extension of the TRIPOD+AI statement, addressing the unique challenges of LLMs in biomedical applications. TRIPOD-LLM provides a comprehensive checklist of 19 main items and 50 subitems, covering key aspects from title to discussion. Read More

Cybersecurity in Age of AI: Black Hat 2024’s top 3? LLM? security risks

Last week, at Black Hat 2024, one of the major cybersecurity events in the world, a few disconcerting revelations about the potential unsecure nature of GenAI and LLM implementations came to the fore – and how they can be exploited by hackers and malicious actors to steal user data and critical business intelligence. Here are the top highlights. Read More

Novel AI framework for detecting LLM "hallucinations" in medical summaries

Disclaimer: All opinions expressed by Contributors are their own and do not represent those of their employers, or BiopharmaTrend.com. Read More

Contextualising AI: Building Enterprise? LLM? applications with organizational insights

November 2022 marked a key moment in the space of generative AI with the launch of ChatGPT, catalysing widespread interest and adoption. This surge of interest spurred even enterprises to explore innovative applications of large language models (LLMs) on both structured and unstructured data, aiming to boost productivity and operational efficiency. Consequently, many tools emerged to minimise manual intervention in tasks. These tools include AI data analysts, automated insight generators, and knowledge search functionalities. Read More

Google slashes Gemini 1.5 Flash prices, igniting LLM price war

In May, Google announced the new Gemini 1.5 Flash model, optimized for speed and efficiency. The Gemini 1.5 Flash was aggressively priced ($0.35 per million input tokens and $1.05 per million output tokens) compared to other frontier Read More

DeepMind’s Gemma Scope peers under the hood of large language models

Large language models (LLMs) have become very good at generating text and code, translating languages, and writing different kinds of creative content. Read More

Dataiku expands LLM Mesh with governance layer

The vendor provides a marketplace for enterprises similar to those of the major cloud providers. However, it does not have its own LLM, but offers many others in its marketplace. Read More

Booz Allen Hamilton deploys first AI large language model in space

The generative AI large language model at the space station is intended to help astronauts address queries and resolve issues. Read More

Godwin Josh

Co-Founder of Altrosyn and DIrector at CDTECH | Inventor | Manufacturer

3 个月

The proliferation of LLMs like Convin and Jais 70B indicates a shift towards domain-specific models, addressing industry needs with tailored capabilities. Open-source contributions like MBZUAI's K2-65B demonstrate the collaborative nature of LLM development, accelerating progress through shared knowledge. Given the increasing sophistication of LLMs in anomaly detection, how could these models be integrated into real-time cybersecurity systems to proactively identify and mitigate emerging threats?

回复

要查看或添加评论,请登录

社区洞察

其他会员也浏览了