Engineers often ask me what I'd recommend regarding how to land a killer job in generative AI, whether it be at an innovative startup or a big player like OpenAI or Anthropic. This is my consolidated advice:
1. Learn the fundamentals. Dive deep into machine learning, deep learning, transformers, and language models. You need a solid grasp of the core concepts and techniques. Fast.ai, deeplearning.ai and papers like "Attention is All You Need" are great places to start.
2. Get your hands dirty with code. Learn PyTorch or TensorFlow. Start implementing models from scratch. Spin up GPU instances and train language models on datasets. Nothing beats hands-on experience.
3. Immerse yourself in the field. Read every new AI paper you can get your hands on, especially from top labs like OpenAI, DeepMind, Google AI, Meta AI. Stay on top of the cutting-edge and soak in the ideas.
4. Build a portfolio of projects. Implement influential models, try novel approaches, train models on unique datasets. Create a body of tangible, impressive work you can showcase. Open-source it on GitHub.
5. Connect with the community. Follow AI thought leaders on Twitter. Participate in forum discussions. Attend meetups and conferences. Network, learn, and make yourself known. Opportunities arise from relationships.
6. Reach out and apply. Don't wait for job postings - proactively contact labs and companies doing fascinating generative AI work. Share your portfolio. Convey your passion and fresh ideas. Persistence breaks through.
7. Never stop learning and creating. AI moves fast. Yesterday's breakthroughs are today's basics. To stay relevant in this field, you need to continuously evolve, upskill, and push new boundaries. Let your curiosity and creativity drive you.
Generative AI is revolutionizing the world. With the right skills, experience, and a healthy dose of grit, you can be a part of building that future. Will it be easy? No. Worth it? Absolutely. Now go make it happen, and let me know if I can help along the way.