??? Industry News in 1 Line (20th Nov 2024)
Piyush Sharma
Open Innovation @ Samsung Research India || M.Tech in AI @ IIT Jodhpur || Tech Strategy & Research
1)?Qwen-2.5 Turbo?features context length up to 1M tokens, and can generate them in?68 seconds, a 4.3x speedup. The price remains ?0.16/1M tokens.
2) Mistral’s?Le Chat?got a huge upgrade. Includes feature like web search, vision, canvas ideation, coding and my favorite image generation with Flux Pro.?All are currently free?during their beta. More details here.Try their?Le Chat?now.
3) Cerebras’?Llama-3.1 405B now runs at?969 tokens/s. With 128K context length, 16-bit weights, they are the industry’s fastest time-to-first token @ 240ms.