Tech Lessons for CEOs and Boards: To Build a Data Pipeline, Remove the People
Andrew Tahvildary
CTO | Tech & Strategy Advisor | Scaling Engineering Teams for Growth, IPOs & M&A | Startup Mentor/Advisor
A successful B-round startup company that I worked with in the past, had an ambitious business goal to 3x revenue for the year.
In order to achieve that business goal, we had to:?
This ambitious plan also? required us to accommodate third-party data in addition to the company's own data sources.?
So we needed to evaluate the company’s data platform to make sure it could handle the new sources of data. Was there any technical debt built up in the platform that needed to be addressed? We started by evaluating the platform data pipeline. Basically, these are automated processes that shift data from one location or format to another. A basic example of this might be collecting data in one system and then moving it to another system for further analysis. Data pipelines are an essential component of any business that performs or provides detailed, high-volume data analytics. We needed to pull apart the data pipeline to ensure it could accommodate third-party sources.?
In fast growing companies, teams sometimes make short-term decisions that may turn into technical debt. Technical debt is a term technologists and engineers use to describe software, hardware, and any other element of technology that will require further investment and modification at a later date. For example, an engineering team might choose to use Google Maps for its initial data source for maps with the understanding that later on it will become too expensive and need to be switched out for something more economically scalable. Or an infrastructure team might elect to write code that quickly solves a problem but will be harder to maintain and will require a rip-and-replace at a later date.
There were a few technology upgrades that we needed to address among various technical debt buckets. But the biggest source of product risk was “people in the middle of a data pipeline”. This is a pretty common but very serious example of technical debt — the use of people to do jobs that should have been done by technology. The “people in the middle of a data pipeline” problem impacted everything in our product and technology expansion plans.?
People-in-the-middle inject many serious risks into both data pipelines and technology products as a whole, Those include”?
领英推荐
For all these reasons, we made it a Priority-1 project to fix our data pipeline and make it entirely automated. Yes, people still need to monitor the data pipeline to make sure its working and to periodically verify the accuracy of the data. But the pipeline can now take raw data inputs from third-parties and run them through workflows using tools we built or configured to generate outputs that can be shared directly with customers — no human interaction required.?
The lesson of this story is simple. When your company needs to grow quickly and make changes to its product rapidly, technical debt tends to come due. Yes, all the decisions you made before that accrued technical debt might have helped you to get to that moment. But often a short-term decision and the associated debt becomes a major blocker to progress.?
All those things could create a bottleneck and complexity when you're trying to scale. CEOs and non-technical managers looking to build products and companies that scale should always ask three simple questions:
Every competent CTO and VP of Engineering has asked these questions and thinks about these issues constantly. CEOs and Board Members can and should work with the engineering team to track any build up on technical debt. Not all technical debt needs to be fixed immediately. Some of it can go on for many years. All technical debt, however, should be tracked and considered because at the end of the day, all debts must be paid — sooner or later.
Note: Thanks to Alex Salkever for helping edit this post
Head of Venture Portfolio | Chief Product Officer | Board Director
2 年These are great insights, ?? Andrew Tahvildary. Thank you for sharing.
Techquity.ai / Vionix Biosciences / Product + GTM Advisor (focus on Open Source, AI, and where they meet) / Author of books about Technology, AI and Society / Strong Opinions, Gently Argued
2 年So I wonder Andrew Tahvildary what do you recommend for building pipelines on data sources that frequently change their structure? Curious, b/c I know web scraping is MISERABLE. Most CEOs think you can just point a scraper at stuff and it works!