ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Why are arrays zero-based in so many languages?

Saleem S.

Technologist

å‘å¸ƒæ—¥æœŸ: 2019å¹´5æœˆ14æ—¥

This article is a summary of a wonderful lightning talk my friend and former colleague Kent Spillner gave a while ago.

Lua is an embeddable scripting language. One of its interesting features/quirks is that its arrays are 1-based -- flouting the convention of so many other programming languages that have 0-based arrays.

The reasoning is three-fold. One-third -- the hard, mathematical part -- is well-known in computer circles. Arrays in C are really just syntactical sugar atop memory pointers. A 0-based indexing scheme makes the pointer arithmetic simpler:

a[0] = *a       // first element

a[1] = *(a + 1) // second element

...

a[n] = *(a + n) // n+1'th element

Since many of the languages with 0-based arrays came after C, they simply kept the convention.

However, this reasoning is weak. After all, with only minimal added complexity, the compiler could translate 1-based arrays in the source-code into correct pointer arithmetic in the machine-code.

The second part of the reasoning is that compiler authors were influenced by the (admittedly very influential) opinion of Edsger Dijkstra, who reasoned in a paper that to iterate over a sequence of numbers (say) 2, 3, ..., 12, the following convention was most preferable as to the choice of iterating variable i:

2 â‰¤ i < 13

This clearly favors a 0-based indexing scheme for arrays.

However, there is one problem with this neat reasoning: Dijkstra's paper was published in 1982; so it is unlikely that Richie, Thomson and Kernighan were influenced by Dijkstra's views when they were designing and refining C between 1972 and 1978.

Here's where we must resist insisting that the decision was based on computer-science or even notions of mathematical elegance. Most of the problems in computer science are people problems, and even when you think you have a computer-problem, look harder and you'll find a people-problem. This gets us to the the third part of the reasoning, which is a story.

In the early 1960s, IBM gave MIT a new IBM 7094 computer (price tag: USD 3.5 million in 1960s dollars) at a discount. Part of the deal was that MIT got to use the computer for 8 hours per day, the other universities in the northeastern United States for 8 hours, and IBM for the remaining 8 hours. Part of that IBM time-slot was dedicated to computing handicaps for yacht races; as the president of IBM was rather fond of yacht racing in Long Island sound. "There was a special job deck kept at the MIT Computation Center, and if a request came in to run it, operators were to stop whatever was running on the machine and do the yacht handicapping job immediately." [1]

The 7094 ran a language called BCPL, which was a rather unique language. It was designed so that small and efficient compilers could be written for it: some compilers were as small as to require only 16 KB of memory. Because of the stringent computing time-allocations, programmers optimized compilers so that run-time was minimal. One of these optimizations was to make arrays 0-based. Because if your program wasn't done and the president of IBM felt like yachting, you were out of luck!

Reference

[1] The IBM 7094 and CTSS

Jeff Shurts

Technologist, Business Leader, Problem Solver, Optimizer

5 å¹´

Ah, so many conventions in computer science have roots that hearken back to early IBM creations. Interesting, then, that arrays (tables, actually) in COBOL used 1-based subscripts...

èµž

å›žå¤

2 æ¬¡å›žåº”

Shea Dahl

Compassionate Leadership driving inclusiveness, innovation and best practices

5 å¹´

Randy Brown

èµž

å›žå¤

1 æ¬¡å›žåº”

Ante Grgi?

software consultant specializing in b2b e-commerce solutions | venture builder

5 å¹´

Good story, Saleem Siddiqui?- and a great reminder to look beyond the seemingly-obvious for the truth.

èµž

å›žå¤

1 æ¬¡å›žåº”

æŸ¥çœ‹æ›´å¤šè¯„è®º

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Saleem S.çš„æ›´å¤šæ–‡ç«

Whose story is it, anyway?

2022å¹´8æœˆ11æ—¥

Whose story is it, anyway?

There is a viral post on LinkedIn about a CEO, who shared a selfie of himself crying because he had to layoff "a few"â€¦
Do you "like ????" your own posts?

2019å¹´11æœˆ30æ—¥

Do you "like ????" your own posts?

I have a basic understanding of how LinkedInâ€™s algorithm works. I know how to take advantage of it and I have respectâ€¦
Pair Programming â€“ a most uncomfortable working situation

2019å¹´7æœˆ28æ—¥

Pair Programming â€“ a most uncomfortable working situation

Firefighters, soldiers, hostage negotiators, and trauma specialists regularly deal with life and death situations atâ€¦

5 æ¡è¯„è®º
Therapeutic Refactoring

2019å¹´3æœˆ26æ—¥

Therapeutic Refactoring

At a conference I attended a few years ago, Martin Fowler used the term "Therapeutic Refactoring" in response to aâ€¦

3 æ¡è¯„è®º
The morality of software

2018å¹´12æœˆ30æ—¥

The morality of software

My brother (Dr. Nadeem Siddiqui) and I drove from Dallas to the Big Bend National Park in early November this year.

14 æ¡è¯„è®º
You're already eating bugs for breakfast!

2018å¹´7æœˆ17æ—¥

You're already eating bugs for breakfast!

The bliss of ignorance! It seems to make you more sure of yourself. Dunning and Kruger have shown that individuals whoâ€¦
java.util.Optional â€” dividing opinions since 2014

2018å¹´4æœˆ16æ—¥

java.util.Optional â€” dividing opinions since 2014

It may not be the worst source of divisiveness that you encounter in your social media feeds these days; howeverâ€¦
A one-in-a-thousand chance of failure

2018å¹´3æœˆ20æ—¥

A one-in-a-thousand chance of failure

Gambling is often called a tax on the mathematically challenged. The odds of winning are always stacked (often heavilyâ€¦
The US federal government shutdown broke my software (seriously)

2018å¹´1æœˆ22æ—¥

The US federal government shutdown broke my software (seriously)

On January 19th, 2018, the United States' federal government partially shut down due to a political impasse, preventingâ€¦

7 æ¡è¯„è®º
Why the Anita Borg award matters, and why it isn't enough

2016å¹´10æœˆ22æ—¥

Why the Anita Borg award matters, and why it isn't enough

ThoughtWorks was awarded the 2016 Top Company for Women Technologists award by the Anita Borg Institute. To me, this isâ€¦

3 æ¡è¯„è®º

See all articles

Why are arrays zero-based in so many languages?

Saleem S.

Technologist

Reference

Saleem S.çš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Detecting Subtle Departures from Randomness

LLM as Compiler

Day 3/100 of Learning System Design

A Programmer's Toolkit: Navigating the World of Algorithms

The Structure of Compiler (Part 2)

Modern times call for modern languages

The Algorithm

Testing SmolLM2 Models on a small compute

Selecting the Ideal Programming Language for Machine Learning: Comparing C++ and Python

Reference

Saleem S.çš„æ›´å¤šæ–‡ç«

Whose story is it, anyway?

Do you "like ????" your own posts?

Pair Programming â€“ a most uncomfortable working situation

Therapeutic Refactoring

The morality of software

You're already eating bugs for breakfast!

java.util.Optional â€” dividing opinions since 2014

A one-in-a-thousand chance of failure

The US federal government shutdown broke my software (seriously)

Why the Anita Borg award matters, and why it isn't enough

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Detecting Subtle Departures from Randomness

LLM as Compiler

Day 3/100 of Learning System Design

A Programmer's Toolkit: Navigating the World of Algorithms

The Structure of Compiler (Part 2)

Modern times call for modern languages

The Algorithm

Testing SmolLM2 Models on a small compute

Selecting the Ideal Programming Language for Machine Learning: Comparing C++ and Python

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†