Simulated and Synthetic Data Generation - The Effective Statistician Workshop ORIENTATION - Lead by Darko Medin
Darko Medin
Data Scientist and a Biostatistician. Developer of ML/AI models. Researcher in the fields of Biology and Clinical Research. Helping companies with Digital products, Artificial intelligence, Machine Learning.
In today's data-driven world ability to generate Simulated and Synthetic data is one of the most important Data Science and Statistics topics. This Fall, Effective Statistician will host the Simulated and Synthetic Data Generation Workshop for this field. This is the ORIENTATION for the workshop.
CONFERENCE CHAIRS
The Effective Statistician Conference Co-chaired by Dr. Alexander Schacht and Chantelle Cornett
WORKSHOP LEAD
International Expert Biostatistician and an AI Data Scientist - Darko Medin
DATE : November 7th, 9-10:30 am CEST
TOPICS COVERED
1. Introduction to Simulated vs Synthetic Data
Definition and Motivation: Overview of synthetic data and its importance.
Applications: Use cases in Healthcare, Statistics, Business and more.
Ethical Considerations: Privacy, data security, and ethics.
2. Simulated Data principles
Simulating distributions
Simulating conditional distributions
MCMC simulations
3. Types of Synthetic Data
Fully Synthetic Data: Creating datasets entirely from models.
Partially Synthetic Data: Mixing real and synthetic data.
Utility vs. Privacy: Balancing data utility and privacy.
4. Statistical Methods for Generating Synthetic Data
Parametric Methods: Using statistical distributions.
领英推荐
Non-Parametric Methods: Bootstrapping, kernel density estimation.
Model-Based Methods: Regression, Bayesian models, etc.
5. Advanced Techniques in Synthetic Data Generation
Machine Learning: Bayesian networks, Deep Learning, VAEs, RNNs and more.
Sequential Synthesis: Creating time-series and longitudinal data.
6. Case Studies and Practical Examples
Real-world Examples
Hands-on Session: Practical data generation exercises.
Tool Comparison: Overview of synthetic data generation tools.
7. Challenges and Future Directions
Challenges: Common pitfalls.
Research Trends: Emerging directions in synthetic data research.
8. Interactive Q&A and Discussion
Open floor for participant questions and discussions.
9. Closing and Further Resources
Summary: Recap key points.
Further Resources: Reading materials
HOW TO GET MORE INFORMATION ON JOINING
How to join? More information may be found on the Conference website : https://theeffectivestatistician.com/the-effective-statistician-conference-2024-transforming-healthcare/#programm-overview
Also for more information on the Workshops, you may contact Dr. Alexander Schacht or Reine Escalona .
by Darko Medin, Workshop Leader.