Data Science #35
Andriy Burkov
PhD in AI, ML at TalentNeuron, author of ?? The Hundred-Page Machine Learning Book and ?? the Machine Learning Engineering book
In this issue: alternatives to cosine similarity; understanding Gaussians; scaling to multi-terabyte datasets; coding for structured generation with LLMs; a tutorial on bayesian optimization; methods for comparing spatial patterns in raster data; and more.
The sponsor of this issue is GitNotebooks .
Frustrated with GitHub’s notebook review experience? Find out how ML teams are cutting their notebook review time in half with GitNotebooks. Quickly share feedback while staying in sync with GitHub. (Did we mention it’s free for most teams ?)
More than 60,000 subscribers are reading this newsletter. If you build a data product or provide a data service, you can become a sponsor of one of the future newsletter issues and get your business featured in the newsletter. Feel free to reach out to [email protected] for more details on sponsorships.
Enjoy the newsletter? Please help us make it bigger and better by sharing it with colleagues and friends.
Aiming to be a researcher on AI
1 周This is great
Data Scientist at Moonshot | ML & NLP | Former IRC Postgraduate Scholar
1 周?? I was impressed when I opened the newsletter email, this morning, to find so many relevant and interesting articles. Thank you for curating it every week!
Front end Architect | UI Angular | Machine Learning | Deep Learning | JavaScript | GENAI | NodeJS | Power BI | AEM | HTML5 | CSS3 | LLM | Prompts Engineering
1 周Great work Andrity
Senior Dataset Engineer @ AiDataForge | IR Images and Videos| Annotation Vendor Teledyne FLIR, RayMarine etc.
1 周Excellent work Andriy
Automation (AI). SEO (linkbuilding). At Confuse The Machine & ReachCreator
1 周I've subscribed. Your content on here is a perfect balance between hilarious and informative. Can't wait to read the newsletter.