Just crossed 300,000 public AI datasets for text, audio, image, video, 3D, time-series, tabular,...on Hugging Face! IMO public AI datasets can be more impactful than public AI models and should get more attention. Congrats to everyone who's contributing! If you haven't shared your datasets yet, why not?
Amazing milestone Clem! Public datasets truly are the backbone of innovation. By the way, did you expect to see such huge growth and success of Hugging Face so soon? P.S. Kudos to everyone contributing ??
Clem Delangue ??, incredible milestone. Thank you for highlighting the importance of public AI datasets.
Clem Delangue ?? but sadly nothing available on radical pairs or the Earth's magnetic field yet! We eagerly await... ????
Sir can I publish my own datasets as well ?
This is a great
Impressive achievement.
Congrats Clem big 300
Clem Delangue ?? Totally agree !
Co-founder and CTO at Stacklok
1 个月Shameless pitch, with open source promptwright ?? you can generate synthetic datasets using a local LLM and push it direct to a ?? huggingface dataset repo ?? https://github.com/StacklokLabs/promptwright