登录查看更多内容

Leading, and Pairing on, ML projects

Jonathan Hilgart

Machine Learning, LLMs, and Information Retrieval

发布日期: 2021年12月28日

Most machine learning projects sit squarely in the intersection between "spend two years on this and get back to me" research and overly pragmatic "let’s ship it tomorrow" engineering. Each company has a preference for where on the spectrum they are; some dedicate full research teams to optimizing the last .001%, while others use whenever open-source code they can find to classify MNIST digits and call it a day. For me, I like to be on, and lead, teams that sit in the middle as full-stack machine learning engineers. However, the difficulty with the middle is that the work required is often very wide encompassing pulling data, exploring data, modeling, building infrastructure, serving, retraining …. The list goes on. Given this, what is the best way to staff people on these types of projects? I’ve tried lots of approaches; Kanban, waterfall, research-focused, variants of Agile. They can all work.

tl;dr Staffing projects all depends on your team dynamics and the individuals that are working with you

However, I have found expectation setting through project leads (not a new concept) and project pairs (a newer concept?) to be helpful for this type of work.

Above are all the different phases of a project. As you can see, it covers a wide variety of areas.

Project leads are formally the individuals responsible for ensuring a project is seen to completion, communicating the status of a project including any changes to the project timeline, as well as leveraging data to assess the impact. These people are your “I trust you to complete this without hand-holding the entire time.” In addition, project leadership is a great opportunity for junior developers to work on smaller features. Here are some expectations I’ve found useful.

Lead Core Expectations

For all projects

Providing in-depth, opinionated engineering knowledge:

You use the right media - code snippets, architecture diagrams, whiteboard - to communicate ideas to audiences of varying technical backgrounds.
You provide API documentation

Working through the requirements for a new project:

You are responsible for gathering the requirements for the system including model performance requirements, latency expectations, architecture technologies

Ensuring your project pair has context on the work and the plan for sequencing work:

Understanding which work can be parallelized (model training, hyperparameter tuning)
Can you provide the appropriate context for the individuals you are working with

Remaining two steps ahead of anticipated questions. Leverages the correct audience, content, and tone while communicating

Email, Slack, Zoom, in-person, PowerPoint, excel you choose the communication updates appropriate for the audience

Shipping the project

You can be relied on for getting projects across the finish line and leveraging others to do so

Supporting rollout

Meet with stakeholders to discuss a rollout strategy and timeline
Be the primary point of contact for any bugs or quick updates needed during the rollout
Ensure we have a testing plan in place
Ensure we have success metrics defined

Rolling up their sleeves next to the lead, project pairs are the second brain. With the work spanning many different skillsets, tech stacks, and stakeholders, I find it useful to explicitly assign pairs on projects. Both for personal development, “hey, this technology looks cool,” removal of the bus factor, “only Peter knows how our Resnet backbone was trained,” and PR reviews, “There are 15 PRs that have been open for more than a week.” The expectations of a pair depend on the stage of a project.? Some parts of the machine learning lifecycle are well suited for pairing such as model experimentation, while others, deployment, and infrastructure, require the pair to be the primary reviewer on design documents to offer critical feedback. Generally, I’ve found that pairs spend 10-15% of the total time on the project assisting and gathering context.

领英推荐

AI Takes Control: Key Insights from Artificial…

Certiprof 1 个月前

The Rise of Self-Improving Development Environments

Saad Aslam 1 个月前

Strategic Guide for CIOs & VPs of Engineering: Best…

Sebastian Redondo 1 个月前

Pair Core Expectations

For all projects

Responsible for ensuring code is shipped in a timely manner to the production

PR reviews, design reviews?

Has relevant context for the project

Attend meetings with stakeholders
Weekly syncs with the project lead
Help write acceptance criteria for upcoming stories
An understanding of the proposed model algorithm including relevant tradeoffs
An understanding of the core data required (if applicable)
An understanding of the architecture choices made and why
Able to ship and deploy code if needed

Helps works through project sequencing

Parallelize which parts?
What comes first?
Who needs to approve?

Pair Secondary Expectations

Project dependent

Parallelizing model training with the project lead. Two brains working to find the best model > one brain.
Writing project-specific code

Examples include: data transform pipelines, infrastructure setup, integration review, micro/macro tests

Writing architecture diagrams or other documents outlining decisions made during the project

You and the project lead can discuss who should handle this

Anticipated Q&A

Q: Shouldn’t everyone know how everything works?

I think we can all agree that’d be wonderful. However, given limited time and energy, and the fact that each project is very wide, it’s useful to have someone tagging along all of the steps instead of someone popping in here and there without full context

Q: What about the rest of the team, should they have a say?

Of course, but not for all decisions. At the highest level, the team, depending on size, should be able to articulate the business problem/value for everything everyone is working on. You can accomplish this through other venues such as sharing design documents team-wide or a recurring showcase meeting. However, you also need other engineers to work on their own projects.

Q: You’re making a lot of assumptions about the communication preferences and working styles of people.

Definitely. See the tl;dr above. With that said, I do expect people I work with to be able to work with others in either a lead or pair capacity.

Thanks for reading!

要查看或添加评论，请登录

Jonathan Hilgart的更多文章

Testing Machine Learning Systems

2021年8月10日

Testing Machine Learning Systems

Testing in software development is almost a science at this point. You’ve probably heard the quip, “Write tests, some…
Real-time audio processing on a Pi

2021年7月3日

Real-time audio processing on a Pi

Over the past couple of days, I've been working on visualizing the frequency spectrum of audio. My goal was to wire up…

2 条评论
Good Ol' Algorithms

2021年5月14日

Good Ol' Algorithms

Did you know it is estimated people make 35,000 decisions per day! Most of these decisions are binary, but some of…

1 条评论
Primitive functions and the bias they introduce

2020年12月28日

Primitive functions and the bias they introduce

Noah Chomsky has a theory that humans have an innate ability to learn language. This ability to learn a language is…

3 条评论
FHIR is Fire

2020年8月1日

FHIR is Fire

You remember the last time you texted an Android phone (yes, I’m assuming you’re an iPhone user) and a green bubble…

2 条评论
How Computers Learn to See

2020年1月2日

How Computers Learn to See

Professional wide receivers have been known to catch a football with their eyes closed. Hockey goalies predict where…
Information Security & Social Inclusion

2019年6月12日

Information Security & Social Inclusion

I’m currently reading through the book Computer Security: Principles and Practice which discusses security protocols…

1 条评论
Inductive Bias for Different Machine Learning Approaches

2019年3月3日

Inductive Bias for Different Machine Learning Approaches

Source - https://www.researchgate.
How to Teach Computers to be `Intelligent`

2019年1月7日

How to Teach Computers to be `Intelligent`

Does this mean anything to you? If this were a Rorschach test, i.e.
Computational Time-Lapse

2018年10月28日

Computational Time-Lapse

Whenever you've seen a time-lapse video, it probably looked fairly 'jumpy' from one frame to the next with objects and…

See all articles

Leading, and Pairing on, ML projects

Jonathan Hilgart

Machine Learning, LLMs, and Information Retrieval

Lead Core Expectations

领英推荐

Pair Core Expectations

Pair Secondary Expectations

Anticipated Q&A

Jonathan Hilgart的更多文章

社区洞察

其他会员也浏览了

Components of MLOps

Skip the Proof of Concept

AI Technology for Engagement, Quality, Acceleration, and Productivity in Code Delivery in the IT Field

Unveiling the Crucial Role of Model Versioning and Continuous Experimentation of AI/ML Use Cases in Production

Faster, Smarter, and Cost-Efficient: How LLM Distillation is Transforming AI in SDLC

Lean Responsible AI Development - from Science Fair to Business Outcome

GAIM. Generative AI Management a new way for successful project management for gen Z

The Hidden Trap Sinking AI Projects

Mastering Continuous Integration and Deployment for Machine Learning

PRIMER 2025 RELEASED!

Lead Core Expectations

领英推荐

Pair Core Expectations

Pair Secondary Expectations

Anticipated Q&A

Jonathan Hilgart的更多文章

Testing Machine Learning Systems

Real-time audio processing on a Pi

Good Ol' Algorithms

Primitive functions and the bias they introduce

FHIR is Fire

How Computers Learn to See

Information Security & Social Inclusion

Inductive Bias for Different Machine Learning Approaches

How to Teach Computers to be `Intelligent`

Computational Time-Lapse

社区洞察

其他会员也浏览了

Components of MLOps

Skip the Proof of Concept

AI Technology for Engagement, Quality, Acceleration, and Productivity in Code Delivery in the IT Field

Unveiling the Crucial Role of Model Versioning and Continuous Experimentation of AI/ML Use Cases in Production

Faster, Smarter, and Cost-Efficient: How LLM Distillation is Transforming AI in SDLC

Lean Responsible AI Development - from Science Fair to Business Outcome

GAIM. Generative AI Management a new way for successful project management for gen Z

The Hidden Trap Sinking AI Projects

Mastering Continuous Integration and Deployment for Machine Learning

PRIMER 2025 RELEASED!