Data Science Milan #006
Data Science Milan
The Community of Data Scientists and Machine Learning Practitioners based in the Greater Milan area.
Dear Data Science Milan Community,
Welcome back to our newsletter, bringing you another edition packed with the latest developments, inspiring projects, and invaluable insights from the world of data science!
With this episode we reach another step of the NLP, we dive deep into the transformative world of Named Entity Recognition (NER) in the Social Media Industry.
Named Entity Recognition (NER) is a game-changer in information extraction.
It involves identifying and classifying key information (entities) in text into predefined categories, essentially categorizing each word in a given text.
You can discover the power of NER in social media and how it can shape the future of a coffe shop business by use cases: Trend Analysis and Market Insights, Competitive Analysis, Targeted Marketing and Engagement.
Read the full article and elevate your business strategies with the insights gained from Named Entity Recognition.
Data?Science?Milan?events
BRIOxAlkemy: A bias detecting tool
On December 13th, 2023, Greta Coraglia and Davide Posillippo spoke about a bias detecting tool.
The aim of the collaboration between BRIO and Alkemy is to produce software applications for the analysis of bias, risk and opacity with regard to AI technologies which often rely on non-deterministic computations and are opaque in nature. They present a first tool developed within the BRIOxAlkemy collaboration for the detection and analysis of biased behaviours in AI systems, and its theoretical background. The tool is aimed at developers and data scientists who wish to test their algorithms relying on probabilistic and learning mechanisms in order to detect misbehaviours related to biases and collect data about them. They will show the tool with a live demo and explain our open source and collaborative approach to its development.
Watch the video
Machine Learning pipelines @Facile.it: how to keep models always trained
On November 25th, 2023, at Google DevFest, Cesare Bassu showed us data science pipelines at @Facile.it.
In @Facile.it they employ a suite of MLOps principles to ensure continuous model training and prevent developmental errors. They fuse MLOps Pipelines with Continuous Integration practices to automate model development and enable automatic retraining.
Empowering the Bending Spoons' platform with data science
On November 7th, 2023, Andrea Maiorana spoke about how data science works at Bending Spoons.
领英推荐
Bending Spoons is a leading tech company based in Italy which is specialized in software and app development. Andrea went through data science workflows at Bending Spoons, and then dive into the measurement and predictions related to app users metrics. There was also a poster session during networking aperitivo.
Read the article
Watch the video
Knowledge section
Here are some selected NLP resources:
-Named Entity Recognition with HuggingFace using PyTorch and W&B - The goal of this tutorial is to build a machine learning model able of performing Named Entity Recognition (NER) by fine-tuning the pretrained BERT model on the CoNLL2023 dataset, and then can accurately identify and extract named entities from text.
-Named Entity Recognition with W&B and spaCy - Another tutorial about NER from Weights & Biases platform, but using the well known spaCy package.
Be involved!
We want also to remind you that if you like and enjoy our events, you can get in touch with us at [email protected] to be involved in organizing new great online activities.
We are also very happy if you are interested in being a speaker or if you want to share your expertise or experience with the?Data?Science?Milan?community!!!
Wallboard
Would you like to become one of our sponsors and increase your popularity among the?Data?Science?community? Write here
If instead, you would like to promote a message to the wallboard, please contact us and send us your relevant announcements. We will publish them here.