Response to McKinsey: Measuring developer productivity? Episode #62
Insights for CTOs and those who one to become CTOs

Response to McKinsey: Measuring developer productivity? Episode #62

By Stephan Schmidt

Happy ?? Tuesday,

This week’s insights

  • ?? Kent Beck : Measuring developer productivity? A response to McKinsey
  • ?? Asking 60+ LLMs a set of 20 questions
  • ?? Why startups do need strategy — despite what you’ve heard

Good reading, have a nice Tuesday ?? and a great week,

Stephan #CTO Coach and ex-CTO

?? If You Only Read One Thing

Measuring developer productivity? A response to McKinsey (15 minute read)

The McKinsey report about developer productivity I’ve linked to last time made some waves on the internet. Kent Beck, who thought about developer productivity before most current developers were born, has something to add. And there are many great points about productivity in his article. Go read it up, so you have some ammunition when being called out. The core argument against the McKinsey view is about “Input < Output < Outcome < Impact”. I use that as a categorizer of goals all the time myself and you should too. Input (what he calls effort) is the worst, only use it if you have nothing else to measure success with. Lines of code is an input metric. Features are an outcome metric. Impact is the best category. Aim here. And McKinsey is mostly about “Input” metrics, whereas SPACE and DORA are Outcome and Impact metrics. And input is easier to measure, and managers don’t know what those numbers mean, so of course they will tend towards input metrics.?MUST READ

https://tidyfirst.substack.com/p/measuring-developer-productivity

?? AI

Asking 60+ LLMs a set of 20 questions (3 minute read)

Ask the same questions to many LLMs. Not only is this very interesting to understand differences between models, but also between sizes of the same model, like Falcon 40B vs Falcon 7B. I did like “Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have? Let’s think step by step.” Especially “So the number of sisters Sally has is 6.” HA take that logic!—funniest is?“erm.. 3”?- but there quite are a lot funny ones for 9, 12, 24. But there is the right answer in the results (1) somewhere.

https://benchmarks.llmonitor.com/

A GPT-4 Capability Forecasting Challenge (12 minute read)

It is difficult to know what at this point in time, AI can do and can’t do. This small game challenges you to forecast what tasks GPT-4 can succeed with.?It helped me understand that the power of GPT-4 is nuanced. Must read

https://nicholas.carlini.com/writing/llm-forecast/question/Capital-of-Paris


Apple is reportedly spending ‘millions of dollars a day’ training AI (3 minute read)

“Those involved in its development tell The Information that Apple’s most advanced LLM, known internally as Ajax GPT, has been trained on “more than 200 billion parameters” and is more powerful than OpenAI’s GPT-3.5”?200 billion is a large model, larger than the 180B Falcon model.

https://www.theverge.com/2023/9/6/23861763/apple-ai-language-models-ajax-gpt-training-spending

Exclusive: ChatGPT traffic slips again for third month in a row (6 minute read)

Do people move to the app? Do people use IDE integrations? Do people use the GPT API? Or did the novelty wear off? The article bases its conclusion on data from Similarweb, a company that estimates web traffic.?“Worldwide desktop and mobile website visits to the ChatGPT website decreased by 3.2% to 1.43 billion in August, following approximately 10% drops from each of the previous two months.”

https://www.reuters.com/technology/chatgpt-traffic-slips-again-third-month-row-2023-09-07/

Casually running a 180B parameter LLM on M2 Ultra (1 minute read)

It is a little slow on tokens/sec, and quantized down, but seems to work nicely. It does this by using the CPU and RAM it seems. Not clear if inference (not training) will be a CPU or GPU task in the future. Or if GPUs get massively more RAM, because LLM models get bigger by the week.

https://twitter.com/ggerganov/status/1699791226780975439

Microsoft announces new Copilot Copyright Commitment for customers (10 minute read)

Startups move aside, here comes a company with deep pockets. Is copyright the factor which decides about SaaS AI success??“As customers ask whether they can use Microsoft’s Copilot services and the output they generate without worrying about copyright claims, we are providing a straightforward answer: yes, you can, and if you are challenged on copyright grounds, we will assume responsibility for the potential legal risks involved.”

https://blogs.microsoft.com/on-the-issues/2023/09/07/copilot-copyright-commitment-ai-legal-concerns/

Stories I’ve Enjoyed This Week

Why startups do need strategy — despite what you’ve heard (10 minute read)

I’d argue most of the talk in the article—“Force yourself to write down a single clear goal which is stretching but achievable in five years.”?is about visions not strategy, but the points are valid. I talk to startup CEOs and CTOs, and they have no strategy. Therefore, features are all over the place, work is chaos, and success seems random. “Why startups do need strategy” gives some good step by step advice on how tyo achieve a strategy - and as CTO you should have a tech strategy too, where is tech in 5 years?

https://sifted.eu/articles/startups-need-strategy

The end of the Googleverse (8 minute read)

Similarweb says GPT traffic drops (see other article), The Verge sees the end of Google. Google dominated the way we saw the internet - everything through the Google lens. Then through a social media lens. Is AI next? But more to you the CTO: What if SEO traffic of your company drops 90%. Can it survive? What is plan B? Not your problem, but what if marketing comes screaming?

https://www.theverge.com/23846048/google-search-memes-images-pagerank-altavista-seo-keywords

Signs that you’re working as a “first team” and why it matters (16 minute read)

Most CTOs I know neglect working with their peers on the management team.?“Working as a “first team” with your fellow leaders is a mindset and operational mode in which you prioritize supporting your peers instead of your direct reports.”?First team looks extreme when looking at it on a management level, but there are good insights for CTOs. First, you’re in a management team, you’re no longer just a techie. The VP of Marketing and the CFO are your peers. Second, create a management team of your direct reports. And do “first team” with them.?This was the most successful thing to make my CTO jobs easier, happier and more successful.

https://www.lenareinhard.com/articles/first-team-signs-why-it-matters

UK Air-Traffic Software Misread Spots on Map to Cause Outage (1 minute read)

“The UK’s worst air-traffic outage in a decade was caused by an anomaly in the airspace manager’s software system, which confused two geographical checkpoints separated by some 4,000 nautical miles. [..] The glitch triggered a shutdown of the software system run by NATS for safety reasons, according to a preliminary report”?Again a crisis happens because two problems interact. And how does your system work with bad input? And do you Fuzzing to find bad input? No? Start.

https://www.bloomberg.com/news/articles/2023-09-06/uk-air-traffic-software-misread-two-spots-on-map-causing-outage

All You Need to Become a Servant Leader (11 minute read)

I’m a fan of being a servant leader—not. Well again I am. Can you make up your mind, please? What is a servant leader? “Servant leadership is a leadership style that empowers employees based on the premise that if employees are satisfied and cared for, they will be more motivated to give their best.” Agreed. But the terminology is manager. I do agree as a manager people on your team are your biggest lever for success. And if you care for them (and respect them, and you’re loyal to them), they will give their best. But we need to stop throwing around manager and leader interchangeably. A leader is someone who leads. Doh. A leader points out where to go, and then leads people there. But yes, managers should be servant managers.

https://effyai.substack.com/p/all-you-need-to-become-a-servant

Move Over, Software Developers – In The Name Of Cybersecurity, The Government Wants To Drive (108 minute read)

Engineers neglected security for too long. This will become cumbersome: “at its essence it is no small thing that the White House proposes: legislation instructing people on how to code their software and requiring adherence to those instructions.”*

https://www.techdirt.com/2023/09/06/move-over-software-developers-in-the-name-of-cybersecurity-the-government-wants-to-drive/

Toyota Shut Down 14 Factories Due to ‘Insufficient Disk Space’ (8 minute read)

Largest car manufacturer in the world. Doh.?“The system malfunction was caused by the unavailability of some multiple servers that process parts orders. As for the circumstances, regular maintenance work was performed on August 27, the day before the malfunction occurred. During the maintenance procedure, data that had accumulated in the database was deleted and organized, and an error occurred due to insufficient disk space, causing the system to stop. Since these servers were running on the same system, a similar failure occurred in the backup function, and a switchover could not be made.”?Tell this your CEO so she is mentally prepared when you make the same mistake?I know, you won’t, until you do.

https://global.toyota/en/newsroom/corporate/39732568.html

How Universal Are Our Emotions? (23 minute read)

We assume emotions are universal. They are not. Emotions are those for which we have word. Germans have different emotions than people in the US-perceived ones. With more international teams due to remote work, keep this in mind. People on your team might have emotions you don’t have. And you have some, they can’t express.

https://www.newyorker.com/magazine/2022/08/08/how-universal-are-our-emotions


?

要查看或添加评论,请登录

Stephan ?? Schmidt的更多文章

  • Why Apple fails with AI

    Why Apple fails with AI

    Top 50 list of things for building successful products by Stephan Schmidt Happy ?? Monday, Welcome to my opinionated…

    2 条评论
  • Ugly code over perfect code | Amazing CTO

    Ugly code over perfect code | Amazing CTO

    ?? Should you use Claude Code? by Stephan Schmidt Happy ?? Monday, Welcome to my opinionated newsletter. This week’s…

    1 条评论
  • `No more hiring developers` says Salesforce CEO

    `No more hiring developers` says Salesforce CEO

    ?? Zero Bug Policy - no more bugs? by Stephan Schmidt Happy ?? Sunday, Welcome to my opinionated newsletter. This week…

    3 条评论
  • WASM in - React out

    WASM in - React out

    ??? Infrastructure decisions: What works and what doesn't by Stephan Schmidt Happy ?? Monday, Welcome to my opinionated…

    5 条评论
  • If the CEO asks: How does DeepSeek-R1 work - a technology perspective

    If the CEO asks: How does DeepSeek-R1 work - a technology perspective

    ?? Subscribe here for emails every Sunday FIRST ?? Job trends for different technologies by Stephan Schmidt Happy ??…

  • How to organize your teams + The tyranny of structurelessness

    How to organize your teams + The tyranny of structurelessness

    ?? To-Do ≠ To-Think | by Stephan Schmidt Happy ?? Sunday, Welcome to my opinionated newsletter. This week’s insights ??…

    5 条评论
  • OKRs == delivering the roadmap?

    OKRs == delivering the roadmap?

    ?? Killed by LLM by Stephan Schmidt Happy ?? Sunday, Welcome to my opinionated newsletter, today I have something…

    7 条评论
  • Weak vs. Strong Engineers | Beware of the scheming AI

    Weak vs. Strong Engineers | Beware of the scheming AI

    by Stephan Schmidt ?? Don't use fake deadlines ?? Subscribe here for emails every Sunday FIRST Happy New Year! And a ??…

    3 条评论
  • Why Europe has no trillion dollar company

    Why Europe has no trillion dollar company

    How to organize communication in startups by Stephan Schmidt ?? Subscribe here for emails every Sunday FIRST Happy ??…

    5 条评论
  • ?? 100th Issue! Why Your PR Process is Killing Morale & The Contagion Window

    ?? 100th Issue! Why Your PR Process is Killing Morale & The Contagion Window

    by Stephan Schmidt ?? Subscribe here for emails every Sunday FIRST Happy ?? Sunday, this is the 100th issue of my…

    4 条评论

社区洞察

其他会员也浏览了