Ultralearning: Master Hard Skills (3)

Ultralearning: Master Hard Skills (3)

Tactic 3: The Flight Simulator Method

Immersion and projects are great, but for many skills there’s no way to actually practice the skill directly. For skills such as piloting a plane or performing surgery, it’s not even legal to practice them in a real situation until you’ve already invested considerable time into training. How can you overcome this?

It’s important to note that what matters for transfer is not every possible feature of the learning environment, such as what room you’re in or what clothes you’re wearing while you learn. Rather, it’s the cognitive features—situations where you need to make decisions about what to do and cue knowledge you’ve stored in your head. This suggests that when direct practice is impossible, a simulation of the environment will work to the degree to which it remains faithful to the cognitive elements of the task in question. For flying a plane, this means that practicing on a flight simulator may be as good for learning as flying an actual plane if it sufficiently calls on the discriminations and decisions a pilot needs to make. Better graphics and sounds aren’t important, unless they change the nature of the decisions being made or the cues pilots receive for when to use certain skills or knowledge.8

When evaluating different methods for learning, those that significantly simulate the direct approach will transfer a lot better. Therefore, if you’re trying to evaluate what’s the best way to learn French before your trip to France, you’ll get more (although not perfect) transfer from doing Skype tutoring than you will from flipping through flash cards.

Tactic 4: The Overkill Approach

The last method I’ve found for enhancing directness is to increase the challenge, so that the skill level required is wholly contained within the goal that is set. Tristan de Montebello, when preparing to compete in the World Championship of Public Speaking, pushed to speak at middle schools, giving early versions of his talk. His feeling was that the feedback he received at Toastmasters clubs might be too soft or congratulatory to cut deep at what worked and didn’t work in his speech. Middle school students, in contrast, would be merciless. If a joke he said wasn’t funny or his delivery was boring or cheesy, he would be able to tell immediately from their faces what needed to be reworked. The overkill approach is to put yourself into an environment where the demands are going to be extremely high, so you’re unlikely to miss any important lessons or feedback.

Going into this environment can feel intense. You may feel as though you’re “not ready” to start speaking a language you’ve barely learned. You may be afraid to stand onstage and deliver a speech you haven’t memorized perfectly. You might not want to dive right into programming your own application and prefer to stick to watching videos where someone else does the coding. But these fears are often only temporary. If you can get enough motivation to start this method, it’s often a lot easier to continue it long term. The first week in each new country in my language learning project was always a shock, but soon it became completely normal to live entirely within the new language.

One way you can overkill a project is to aim for a particular test, performance, or challenge that will be above the skill level you strictly require. Benny Lewis likes to attempt language exams, because they provide a concrete challenge. In his German project, he wanted to attempt the highest-level exam, because his awareness of that goal would push him to study more than he might if he were satisfied with comfortable conversations alone. Another friend of mine decided to exhibit her photography as a means of pushing her skills and talent. Deciding in advance that your work will be viewable publicly alters your approach to learning and will gear you toward performance in the desired domain, rather than just checking off boxes of facts learned.

Learn Straight from the Source

Learning directly is one of the hallmarks of many of the successful ultralearning projects I’ve encountered, particularly because of how different it can be from the style of education most of us are used to. Whenever you learn anything new, it’s a good habit to ask yourself where and how the knowledge will manifest itself. If you can answer that, you can then ask whether you’re doing anything to tie what you’re learning to that context. If you’re not, you need to tread carefully, as the problem of transfer may rear its ugly head.

The act of learning directly, however, is only half of the answer to the question of what you should do to learn well. Doing a lot of direct practice in the environment where you want to eventually use your skills is an important start. However, in order to master skills quickly, bulk practice isn’t enough. This brings us to our next principle of ultralearning: drill.

Chapter VII

Principle 4

Drill

Attack Your Weakest Point

Take care of the bars and the piece will take care of itself.

—Philip Johnston, composer

Of all the roles Benjamin Franklin played throughout his life—entrepreneur, inventor, scientist, diplomat, and founding father of the United States—he was first and foremost a writer. It was in writing that he first found success. After fleeing Boston to escape the final years of his indentured labor as an apprentice to his brother’s printing company, he went to Philadelphia. There, penniless and unknown, he first worked for another printing company before establishing himself as a competitor. His Poor Richard’s Almanack became an international bestseller and allowed him to retire at forty-two. However, it was in the latter half1 of his life that his writing would have world-changing consequences.2

As a scientist, Franklin was bad at math and more interested in practical consequences than in grand theories of the universe. However, his prose was “written equally well for the uninitiated as well as the philosopher,” noted the English chemist Sir Humphrey Davy, adding “he has rendered his details as amusing as well as perspicuous.”3 The strength of his writing and its practical consequences made him an international sensation.

In politics, it was again Franklin’s writing talent that helped him win allies and persuade potential antagonists. Prior to the American Revolution, he penned an essay, supposedly written by King Frederick II of Prussia, entitled “An Edict by the King of Prussia.” In it he satirized British-American relations by proposing that, due to early settlers of the British Isles being of German origin, “revenue [should] be raised from said colonies in Britain” by the Prussian king.

Later, his skill with a pen would make his writing into the Declaration of Independence, where he edited Thomas Jefferson’s words to become the now famous “We hold these truths to be self-evident.”

With such an amazing skill for writing and persuasion, it’s worth asking how Franklin acquired it. Fortunately, unlike so many other great writers, whose efforts at honing their skills remain mysterious, we have Franklin’s own words for how he did it. In his Autobiography, he details his sophisticated efforts to slice apart sections of his writing skill for practice as a young boy. Beginning with a childhood debate against a friend about the merits of educating women (Franklin was for, his friend against), his father noticed that aspects of his writing lacked persuasive ability. Franklin thus “determined to endeavor at improvement” and went about a series of exercises to practice his writing skill.

One such exercise he documents was taking a favorite magazine of his, The Spectator, and taking notes on articles that appeared there. He would then leave the notes for a few days and come back to them, trying to reconstruct the original argument from memory. After finishing, he “compared my Spectator with the original, discovered some of my faults, and corrected them.” Realizing that his vocabulary was limited, he developed another strategy. By turning the prose into verse, he could replace words with synonyms that matched in meter or rhyme. To improve his sense of the rhetorical flow of an essay, he tried his imitation approach again, but this time he jumbled up the hints so he would have to determine the correct order of the sequence of ideas as he wrote again.

Once he had established some of the mechanics of writing, he moved on to the more difficult task of writing in a style that would persuade. When reading an English grammar book, he was exposed to the idea of the Socratic method, of challenging another’s ideas through probing questions rather than direct contradiction. He then went to work, carefully avoiding “abrupt contradiction and positive argumentation,” instead focusing on being the “humble inquirer and doubter.”

Those early efforts produced results. At age sixteen, he wanted to try to get his work published. Fearing that his elder brother might reject it out of hand, however, he disguised his penmanship and submitted his essay under the pseudonym Silence Dogood, purporting to be a widowed woman living in the countryside. His brother, not knowing the true author, approved and published the essay, so Franklin returned and wrote more. Although initiated as a ruse to have his writing considered fairly, Franklin’s practice in adopting other characters would prove invaluable in his later career. Poor Richard’s Almanack, for example, was written from the perspective of a simple husband and wife, Richard and Bridget Saunders, and his political essays such as his “An Edict by the King of Prussia” similarly made use of his flexibility to adopt imagined perspectives.

It’s difficult to imagine Franklin having become the household name he is today without his having first established a mastery of writing. Whether it was business, science, or statecraft, the unchanging core of what made him persuasive and great was his ability to write well. What distinguished Franklin wasn’t merely the amount he wrote or his raw talent but how he practiced. The way in which he decided to break apart the skill of writing and practice its elements in isolation enabled him to master writing at a young age and apply it to the other pursuits for which he would later become famous. Such careful analysis and deliberate practice forms the basis for the fourth ultralearning principle: drill.

The Chemistry of Learning

In chemistry, there’s a useful concept known as the rate-determining step. This occurs when a reaction takes place over multiple steps, with the products of one reaction becoming the reagents for another. The rate-determining step is the slowest part of this chain of reactions, forming a bottleneck that ultimately defines the amount of time needed for the entire reaction to occur. Learning, I’d like to argue, often works similarly, with certain aspects of the learning problem forming a bottleneck that controls the speed at which you can become more proficient overall.

Consider learning mathematics. This is a complex skill that has many different parts: you need to be able to understand the fundamental concepts, you need to be able to remember the algorithm for solving a certain type of problem, and you need to know in what context it applies. Underlying this ability, however, is the ability to do arithmetic and algebra so as to be able to solve the problems in question. If your arithmetic is weak or your algebra sloppy, you’ll get the wrong answers even if you’ve mastered the other concepts.

Another rate-determining step could be vocabulary when learning a foreign language. The number of sentences you can successfully utter depends on how many words you know. If you know too few, you won’t be able to talk about very much. If you were able to suddenly inject hundreds of new words into your mental database, you might drastically expand your fluency even if your pronunciation, grammar, or other linguistic knowledge remains unchanged.

This is the strategy behind doing drills. By identifying a rate-determining step in your learning reaction, you can isolate it and work on it specifically. Since it governs the overall competence you have with that skill, by improving at it you will improve faster than if you try to practice every aspect of the skill at once. That was Franklin’s insight that allowed him to rapidly improve his writing: by identifying components of the overall skill of writing, figuring out which mattered in his situation, and then coming up with clever ways to emphasize them in his practice, he could get better more quickly than if he had just spent a lot of time writing.

Drills and Cognitive Load

Rate-determining steps in learning—where one component of a complex skill determines your overall level of performance—are a powerful reason to apply drills. However, they aren’t the only one. Even if there isn’t one isolatable aspect of the skill that is holding back your performance, it may still be a good idea to apply drills.

The reason is that when you are practicing a complex skill, your cognitive resources (attention, memory, effort, etc.) must be spread over many different aspects of the task. When Franklin was writing, he had to consider not only the logical content of the argument he was making but word choice and rhetorical style. This can create a learning trap. In order to improve your performance in one aspect, you may need to devote so much attention to that one aspect that the other parts of your performance start to go down. If you can judge yourself only on how much you improve at the overall task, it can lead to a situation in which your improvement slows down because you will be getting worse at the overall task while becoming better at a specific component of it.

Drills resolve this problem by simplifying a skill enough that you can focus your cognitive resources on a single aspect. When Franklin focused on reconstructing the order of an essay he had read previously, he could devote all his attention to asking what sequence of ideas leads to a good essay rather than also needing to worry about word usage, grammar, and the content of the arguments.

Astute readers will probably notice a tension between this principle and the last. If direct practice involves working on a whole skill nearest to the situation in which it will eventually be used, drills are a pull in the opposite direction. A drill takes the direct practice and cuts it apart, so that you are practicing only an isolated component. How can you resolve this contradiction?

The Direct-Then-Drill Approach

The tension between learning directly and doing drills can be resolved when we see them as being alternating stages in a larger cycle of learning. The mistake made in many academic strategies for learning is to ignore the direct context or abstract it away, in the hope that if enough component skills are developed, they will eventually transfer. Ultralearners, in contrast, frequently employ what I’ll call the Direct-Then-Drill Approach.

The first step is to try to practice the skill directly. This means figuring out where and how the skill will be used and then trying to match that situation as close as is feasible when practicing. Practice a language by actually speaking it. Learn programming by writing software. Improve your writing skills by penning essays. This initial connection and subsequent feedback loop ensure that the transfer problem won’t occur.

The next step is to analyze the direct skill and try to isolate components that are either rate-determining steps in your performance or subskills you find difficult to improve because there are too many other things going on for you to focus on them. From here you can develop drills and practice those components separately until you get better at them.

The final step is to go back to direct practice and integrate what you’ve learned. This has two purposes. The first is that even in well-designed drills, there are going to be transfer hiccups owing to the fact that what was previously an isolated skill must be moved to a new and more complex context. Think of this as being like building the connective tissue to join the muscles you strengthened separately. The second function of this step is as a check on whether your drill was well designed and appropriate. Many attempts to isolate a drill may end in failure because the drill doesn’t really cut at the heart of what was difficult in real practice. That’s okay; this feedback is important to help you minimize wasting time learning things that don’t matter much to your end goals.

The earlier you are in the learning process, the faster this cycle should be. Cycling between direct practice and drills, even within the same learning session, is a good idea when you’re just starting out. Later, as you get better at what you are trying to do and a lot more effort is required to noticeably improve your overall performance, it’s more acceptable to take longer detours into drills. As you approach mastery, your time may end up focused mostly on drills as your knowledge of how the complex skill breaks down into individual components becomes more refined and accurate and improving any individual component gets harder and harder.

Tactics for Designing Drills

There are three major problems when applying this principle. The first is figuring out when and what to drill. You should focus on what aspects of the skill might be the rate-determining steps in your performance. Which aspect of the skill, if you improved it, would cause the greatest improvement to your abilities overall for the least amount of effort? Your accounting skills might be limited by the fact that your Excel knowledge is superficial, which prevents you from applying all the things you know to practical situations. Your language abilities may be held back by having inaccurate pronunciation, even though you know the right words. Look as well to aspects of a skill that you need to juggle simultaneously. These may be harder to improve because you can’t devote enough cognitive resources to improving them. When writing a new article, you may have to juggle research, storytelling, vocabulary, and many other aspects simultaneously, making it hard to get a lot better at just one. Determining what to drill may seem tricky, but it doesn’t have to be. The key is to experiment. Make a hypothesis about what is holding you back, attack it with some drills, using the Direct-Then-Drill Approach, and you can quickly get feedback about whether you’re right.

The second difficulty with this principle is designing the drill to produce improvement. This is often hard because even if you recognize an aspect of your performance you’re weak on, it may be tricky to design a drill that trains that component without artificially removing what makes it difficult in actual application. Franklin’s drills were uncommon, I believe, because most people, even recognizing specific deficits in their writing ability, would not have had the ingenuity to find ways to drill subskills such as ordering arguments persuasively and emulating a successful writing style.

Finally, doing drills is hard and often uncomfortable. Teasing out the worst thing about your performance and practicing that in isolation takes guts. It’s much more pleasant to spend time focusing on things you’re already good at. Given this natural tendency, let’s look at some good ways to do drills so you can start applying them yourself.

Drill 1: Time Slicing

The easiest way to create a drill is to isolate a slice in time of a longer sequence of actions. Musicians often do this kind of training when they identify the hardest parts of a piece of music and practice each one until it’s perfect before integrating it back into the context of the entire song or symphony. Athletes similarly engage in this process when they drill skills that are normally a fraction of total playing time, such as layups or penalty shots. In the early phase of learning a new language, I often obsessively repeat a few key phrases, so they quickly get embedded into my long-term memory. Look for parts of the skill you’re learning that can be decomposed into specific moments of time that have heightened difficulty or importance.

Drill 2: Cognitive Components

Sometimes what you’ll want to practice isn’t a slice in time of a larger skill but a particular cognitive component. When speaking a language, grammar, pronunciation, and vocabulary occur at all moments, but they form different cognitive aspects that must be managed simultaneously. The tactic here is to find a way to drill only one component when, in practice, others would be applied at the same time. When learning Mandarin Chinese, I would do tone drills that involved pronouncing pairs of words with different tones and recording myself speaking. That allowed me to practice producing different tones quickly, without the distraction of needing to remember what the words meant or how to form grammatically correct sentences.

Drill 3: The Copycat

A difficulty with drills in many creative skills is that it is often impossible to practice one aspect without also doing the work of the others. When Franklin was trying to improve his ability to order arguments logically, for instance, it wasn’t possible to do so without writing an entire essay. To solve this problem in your own learning, you can take a page from Franklin: by copying the parts of the skill you don’t want to drill (either from someone else or your past work), you can focus exclusively on the component you want to practice. Not only does this save a lot of time, because you need to repeat only the part you’re drilling, it also reduces your cognitive burden, meaning you can apply more focus to getting better at that one aspect. When practicing drawing, I started by drawing not just from photos but from drawings other people had done. That helped me focus on the skill of accurately rendering the picture, simplifying the decision about how to frame the scene and which details to include. For flexible creative works, editing works you’ve created in the past may have the same effect, allowing you to selectively improve an aspect of your work without having to consider the other demands of an original composition.

Drill 4: The Magnifying Glass Method

Suppose you need to create something new and can’t edit or separate out the part you want to practice. How can you create a drill? The Magnifying Glass Method is to spend more time on one component of the skill than you would otherwise. This may reduce your overall performance or increase your input time, but it will allow you to spend a much higher proportion of your time and cognitive resources on the subskill you want to master. I applied this method when trying to improve my ability to do research when writing articles, by spending about ten times as long on research as I had previously. Although I still had to do all the other parts of writing the article, by spending much longer on research than I would normally, I could develop new habits and skills for doing so.

Drill 5: Prerequisite Chaining

One strategy I’ve seen repeatedly from ultralearners is to start with a skill that they don’t have all the prerequisites for. Then, when they inevitably do poorly, they go back a step, learn one of the foundational topics, and repeat the exercise. This practice of starting too hard and learning prerequisites as they are needed can be frustrating, but it saves a lot of time learning subskills that don’t actually drive performance much. Eric Barone, for instance, started his pixel art experiments simply by making them. When he struggled with certain aspects, such as colors, he went back, learned color theory, and repeated his work. Benny Lewis has a similar habit of starting with speaking from a phrase book and only later learning the grammar that explains how the phrases function.

Mindful Drilling

To many, the idea of drilling may seem to be a push in the wrong direction. We’ve all spent time doing homework designed to drill into us facts and procedures that turned out to be a total waste of time. That was often because we didn’t know the reasons behind what we were practicing or how it fit into a broader context. Drilling problems without context is mind-numbing. However, once you’ve identified that it’s the bottleneck preventing you from going further, they become instilled with new purpose. In ultralearning, which is directed by the student, not an external source, drills take on a new light. Instead of being forced to do them for unknown purposes, it is now up to you to find a way to enhance the learning process by accelerating learning on the specific things that you find most difficult. In this sense, drills take on a very different flavor in ultralearning as opposed to traditional learning. Far from being meaningless drudgery, carefully designed drills elicit creativity and imagination as you strive to solve a more complex learning challenge by breaking it into specific parts.

Drills are hard to do, which is why many of us would rather avoid them. When we do engage in drills, it’s often in subjects where we feel competent and comfortable. Drills require the learner not only to think deeply about what is being learned but also figure out what is most difficult and attack that weakness directly rather than focus on what is the most fun or what has already been mastered. This requires strong motivation and a comfort with learning aggressively. Franklin, in his Autobiography, remarked about the lengths he went to so he could dedicate himself to his writing drills: “My time for these exercises and for reading was at night, after work or before it began in the morning.” Despite the prominence writing would play in his life, Franklin still had to work long hours under his taskmaster brother in the print shop, diligently improving his craft in what little leisure time he had. Eric Barone similarly repeated his pixel art dozens of times, going back to master prerequisite concepts and theory until he got it perfect.

The difficulty and usefulness of drills repeat a pattern that will recur throughout the ultralearning principles: that something mentally strenuous provides a greater benefit to learning than something easy. Nowhere is this pattern more clear than in the next principle, retrieval, where difficulty itself may be the key to more effective learning.

Chapter VIII

Principle 5

Retrieval

Test to Learn

It pays better to wait and recollect by an effort from within, than to look at the book again.

—William James, psychologist

In the spring of 1913, the mathematician G. H. Hardy received a letter that would forever define the course of his life. Sent by an accounting clerk working for the Port Trust Office of Madras in India, the letter contained a humble note of introduction along with some startling assertions. The author claimed that he had found theorems for problems that the best mathematical minds of the time had yet to solve. What’s more, he claimed that he had “no University education” and had derived these results from his own solitary investigations.1

Receiving letters from amateur crackpots who claimed to have solutions to famous problems was a common occurrence for someone of Hardy’s stature in mathematics, so at first he simply dismissed the letter as being more of the same. Still, flipping through the several pages of notes attached to the letter, the equations wouldn’t leave his mind. When he found himself thinking about them hours later, he brought the letter to the attention of his colleague John Littlewood. As the two of them toyed at trying to prove the strange assertions, they found that some of them they were able to prove with great effort, while others remained, in Hardy’s words, “scarcely possible to believe.” Maybe, Hardy thought, this wasn’t a letter from a crackpot but something rather different.

The formulas written were so bizarre and alien that Hardy remarked, “They must be true because, if they were not true, no one would have had the imagination to invent them.” What he only vaguely understood that day was that he had just had his first introduction to one of the most brilliant and bizarre mathematicians of all time, Srinivasa Ramanujan.

Ramanujan’s Genius

Before writing his letter to Hardy, which changed the course of mathematical history, Ramanujan was a poor, pudgy south Indian boy with a special love of equations. More than anything else, he loved math. In fact, his love of math often got him into difficulties. His unwillingness to study other subjects flunked him out of university. Equations were all he cared about. In his spare time and during stretches of unemployment, he would sit for hours on the bench in front of his family home, slate in hand, playing with formulas. Sometimes he would stay up so late that his mother would need to put food into his hand so he would eat.

As he was thousands of miles away from the center of mathematics of his day, access to high-quality textbooks was quite a challenge for Ramanujan. One resource he did encounter and mined extensively was a volume by George Shoobridge Carr called A Synopsis of Elementary Results in Pure and Applied Mathematics. Carr himself was hardly a towering figure of mathematical genius. The book, intended as a guide for students, included large lists of various theorems from different fields of mathematics, usually without explanation or proof. However, even without having proofs or explanations available, Carr’s book became a powerful resource in the hands of someone smart and obsessed like Ramanujan. For instead of simply copying and memorizing how certain theorems were derived, he had to figure them out for himself.

Though many commentators of the time, including Hardy, argued that Ramanujan’s impoverished upbringing and late access to the cutting edge of mathematics likely did irreparable harm to his genius, modern psychological experiments may offer an alternative perspective, for when Ramanujan dealt only with Carr’s extensive list of theorems using his own quirky obsession with mathematical formulas, he was unwittingly practicing one of the most powerful methods known to build a deep understanding.

The Testing Effect

Imagine you’re a student preparing for an exam. You have three choices about how you can allocate your limited studying time. First, you can review the material. You can look over your notes and book and study everything until you’re sure you’ll remember it. Second, you can test yourself. You can keep the book shut and try to remember what was in it. Finally, you can create a concept map. You can write out the main concepts in a diagram, showing how they’re organized and related to other items you need to study. If you can pick only one, which one should you choose to do best on the final exam?

This is essentially the question posed by the psychologists Jeffrey Karpicke and Janell Blunt in one study examining students’ choice of learning strategy.2 In the study, students were divided into four groups, each given the same amount of time but told to use different study strategies: reviewing the text a single time, reviewing it repeatedly, free recall, and concept mapping. In each group, students were asked to predict their score on the upcoming test. Those who did repeated reviewing predicted that they’d score the best, followed by the single-study and concept-mapping groups. Those who practiced free recall (trying to remember as much as they could without looking in the book) predicted the worst for their final performance.

The actual results, however, weren’t even close. Testing yourself—trying to retrieve information without looking at the text—clearly outperformed all other conditions. On questions based directly on the content of the text, those who practiced free recall remembered almost 50 percent more than the other groups. How could students, who have spent years getting firsthand experience about what matters to learning, be so misguided about what actually produces results?

One might be tempted to argue that this benefit of self-testing is an artifact of the way success is measured. The principle of directness asserts that transfer is difficult. Since self-testing and actual testing are most similar, perhaps it is this similarity that allows this method to work better. Had the method of evaluation differed, it might be reasonable to suspect that review or concept mapping might come out on top. Interestingly, in another experiment, Karpicke and Blunt showed that this wasn’t the explanation, either. In this experiment the final test was to produce a concept map. Despite the overwhelming similarity to the evaluation task, free recall still did better than using concept mapping to study.

Another possible explanation for why self-testing works is feedback. When you review something passively, you don’t get any feedback about what you know and don’t know. Since tests usually come with feedback, that might explain why students who practiced self-testing beat the concept mappers or passive reviewers. Though it is true that feedback is valuable, once again, retrieval doesn’t simply reduce down to getting more feedback. In the experiments mentioned, students were asked to do free recall but weren’t provided any feedback about items they missed or got wrong. The act of trying to summon up knowledge from memory is a powerful learning tool on its own, beyond its connection to direct practice or feedback.

This new perspective on learning shows how Carr’s book, with its lists of proofs without solutions, could have become, in the hands of someone sufficiently motivated to master them, an incredible tool for becoming brilliant at math. Without the answers at hand, Ramanujan was forced to invent his own solutions to the problems, retrieving information from his mind rather than reviewing it in a book.

The Paradox of Studying

If retrieval practice—trying to recall facts and concepts from memory—is so much better for learning, why don’t students realize it? Why do many prefer to stick to concept mapping or the even less effective passive review, when simply closing the book and trying to recall as much as possible would help them so much more?

Karpicke’s research points to a possible explanation: Human beings don’t have the ability to know with certainty how well they’ve learned something. Instead, we need to rely on clues from our experience of studying to give us a feeling about how well we’re doing. These so-called judgments of learning (JOLs) are based, in part, on how fluently we can process something. If the learning task feels easy and smooth, we are more likely to believe we’ve learned it. If the task feels like a struggle, we’ll feel we haven’t learned it yet. Immediately after spending some time studying, these JOLs may even be accurate. Minutes after studying something using a strategy of passive review, students perform better than they would if they had practiced retrieval.3 The feeling that you’re learning more when you’re reading rather than trying to recall with a closed book isn’t inaccurate. The problem comes after. Test again days later, and retrieval practice beats passive review by a mile. What helped in the immediate time after studying turns out not to create the long-term memory needed for actual learning to take place.

Another explanation for why students opt for low-efficiency review instead of retrieval is that they don’t feel they know the material well enough to test themselves on it. In another experiment, Karpicke had students choose a strategy for learning. Inevitably, students who were performing more weakly elected to review the material first, waiting until they were “ready” to start practice testing.4 If through experimental intervention, however, they were forced to practice retrieval earlier, they learned more. Whether you are ready or not, retrieval practice works better. Especially if you combine retrieval with the ability to look up the answers, retrieval practice is a much better form of studying than the ones most students apply.

Is Difficulty Desirable?

What makes practicing retrieval so much better than review? One answer comes from the psychologist R. A. Bjork’s concept of desirable difficulty.5 More difficult retrieval leads to better learning, provided the act of retrieval is itself successful. Free recall tests, in which students need to recall as much as they can remember without prompting, tend to result in better retention than cued recall tests, in which students are given hints about what they need to remember. Cued recall tests, in turn, are better than recognition tests, such as multiple-choice answers, where the correct answer needs to be recognized but not generated. Giving someone a test immediately after they learn something improves retention less than giving them a slight delay, long enough so that answers aren’t in mind when they need them. Difficulty, far from being an obstacle to making retrieval work, may be part of the reason it does so.

The idea of desirable difficulties in retrieval makes a potent case for the ultralearning strategy. Low-intensity learning strategies typically involve either less or easier retrieval. Pushing difficulty higher and opting for testing oneself well before you are “ready” is more efficient. One can think back to Benny Lewis’s strategy of speaking a new language from the first day. Though this approach is high in difficulty, research suggests why it might be more useful than easier forms of classroom study. Placing himself in a more difficult context means that every time Lewis needs to recall a word or phrase, it will be remembered more strongly than when doing the same act of retrieval in a classroom setting and much better than when simply looking over a list of words and phrases.

Difficulty can become undesirable if it gets so hard that retrieval becomes impossible. Delaying the first test of a newly learned fact has some benefits over testing immediately.6 However, if you delay the test too long, the information may be forgotten entirely.7 The idea, therefore, is to find the right midpoint: far enough away to make whatever is retrieved remembered deeply, not so far away that you’ve forgotten everything. Although waiting too long before you test yourself may have disadvantages, increasing difficulty by giving yourself fewer clues and prompts are likely helpful, provided that you can get some feedback on them later.

Should You Take the Final Exam Before the Class Even Begins?

The standard way of viewing tests is that they work to evaluate the knowledge you have learned elsewhere—through reading or listening to lectures. The concept of retrieval flips this view on its head, suggesting that the act of taking a test not only is a source of learning but results in more learning than a similar amount of time spent in review. However, this still fits within the conventional idea of knowledge being first acquired, and then strengthened or tested later.

An interesting observation from retrieval research, known as the forward-testing effect, shows that retrieval not only helps enhance what you’ve learned previously but can even help prepare you to learn better.8 Regular testing of previously studied information can make it easier to learn new information. This means that retrieval works to enhance future learning, even when there is nothing to retrieve yet!

A variety of mechanisms has been proposed for explaining why this forward testing effect exists. Some researchers argue that it may be that trying to find knowledge that hasn’t been learned yet—say, by trying to solve a problem you haven’t learned the answer to yet—nonetheless helps reinforce search strategies that are put to use once the knowledge is encountered later. An analogy here is that trying to retrieve an answer that doesn’t yet exist in your mind is like laying down a road leading to a building that hasn’t been constructed yet. The destination doesn’t exist, but the path to get to where it will be, once constructed, is developed regardless. Other researchers argue that the mechanism might be one of attention. By confronting a problem you don’t yet know how to answer, your mind automatically adjusts its attentional resources to spot information that looks like a solution when you learn it later. Whatever the exact mechanism is, the reality of the forward-testing effect implies that practicing retrieval might not only benefit from starting earlier than one is “ready” but even before you have the possibility of answering correctly.

What Should Be Retrieved?

The research is clear: if you need to recall something later, you’re best off practicing retrieving it. However, this neglects an important question: What kinds of things should you invest the time in to remember in the first place? Retrieval may take less time than review to get the same learning impact, but not learning something at all is faster still. This is an important practical question. Nobody has time to master everything. During the MIT Challenge, I covered a lot of different ideas. Some were directly relevant to the kind of programming I wanted to do when I was done, so making sure I retained those ideas was a priority. Others were interesting, but since I had no plans to use them immediately, I put more effort into practicing retrieving the underlying concepts than doing technical calculations. One class I did, for instance, was Modal Logic. As I have no plans to be a logician, I can honestly say, eight years later, that I couldn’t prove theorems in modal logic today. However, I can tell you what modal logic is for and when it is used, so if a situation arises in which the techniques I learned in that class might be useful, I’d have a much better time spotting it.* There will always be some things you choose to master and others you satisfy yourself with knowing you can look up if you need to.

One way to answer this question is simply to do direct practice. Directness sidesteps this question by forcing you to retrieve the things that come up often in the course of using the skill. If you’re learning a language and need to recall a word, you’ll practice it. If you never need a word, you won’t memorize it. The advantage of this strategy is that it automatically leads you to learn the things with the highest frequency. Things that are rarely used or that are easier to look up than to memorize won’t be retrieved. These tend to be the things that don’t matter so much.

The problem with relying on direct practice exclusively is that knowledge that isn’t in your head can’t be used to help you solve problems. For instance, a programmer may realize a need to use a certain function to solve a problem but forgets how to write it out. Needing to look up the syntax might slow her down, but she will still be able to solve the problem. However, if you don’t have enough knowledge stored to recognize when you can use a function to solve your problem, no looking up can help you. Consider that over the last twenty years, the amount of knowledge easily accessible from a quick online search has exploded. Nearly any fact or concept is now available on demand to anyone with a smartphone. Yet despite this incredible advance, it is not as if the average person is thousands as times as smart as people were was a generation ago. Being able to look things up is certainly an advantage, but without a certain amount of knowledge inside your head, it doesn’t help you solve hard problems.

Direct practice alone can fail to encourage enough retrieval by omitting knowledge that can help you solve a problem but isn’t strictly necessary to do so. Consider our programmer who has two different ways to solve her problem, A and B. Option A is much more effective, but B will also get the job done. Now suppose that she knows only about option B. She’ll continue to use the way she knows to solve the problem, even though it is less effective. Here, our fledgling programmer might read about option A on a blog somewhere. But since simply reading is much less effective than repeated retrieval practice, chances are that she’ll forget about it when it comes time to apply the technique. This may sound abstract, but I’d argue that this is quite common with programmers, and often the thing separating mediocre programmers from great ones isn’t the range of problems they can solve but that the latter often know dozens of ways to solve problems and can select the best one for each situation. This kind of breadth requires a certain amount of passive exposure, which in turn benefits from retrieval practice.

How to Practice Retrieval

Retrieval works, but it isn’t always easy. Not only is the effort itself an obstacle, but sometimes it’s not clear exactly how to do it. Passive review may not be very efficient, but at least it’s straightforward: you open your book and reread material until you retain it. Most books and resources don’t have a handy list of questions at the end to test you to see if you remember what they contain. To help with that, below are some useful methods that can be used to apply retrieval to almost any subject.

Tactic 1: Flash Cards

Flash cards are an amazingly simple, yet effective, way to learn paired associations between questions and answers. The old way of creating paper flash cards to drill yourself is powerful, but it has largely been superseded by spaced-repetition systems, as I’ll discuss in Principle 7. These software algorithms can handle tens of thousands of “cards” and also organize a review schedule so you can manage them.

The major drawback of flash cards is that they work really well for a specific type of retrieval—when there’s a pairing between a specific cue and a particular response. For some forms of knowledge, for example memorizing foreign-language vocabulary, this works perfectly. Similarly, maps, anatomical diagrams, definitions, and equations can often be memorized via flash cards. However, when the situation in which you need to remember the information is highly variable, this kind of practice can have drawbacks. Programmers can memorize syntax via flash cards, but concepts that need to be applied in real programs often don’t fit the cue-response framework that flash cards demand.

Tactic 2: Free Recall

A simple tactic for applying retrieval is, after reading a section from a book or sitting through a lecture, to try to write down everything you can remember on a blank piece of paper. Free recall like this is often very difficult, and there will be many things missed, even if you just finished reading the text in question. However, this difficulty is also a good reason why this practice is helpful. By forcing yourself to recall the main points and arguments, you’ll be able to remember them better later. While doing research for this book, for instance, I would often print out journal articles and put them in a binder with a few blank sheets of paper after each of them. After I had finished reading, I’d do a quick free recall exercise to make sure I would retain the important details when it came time for writing.

Tactic 3: The Question-Book Method

Most students take notes by copying the main points as they encounter them. However, another strategy for taking notes is to rephrase what you’ve recorded as questions to be answered later. Instead of writing that the Magna Carta was signed in 1215, you could instead write the question “When was the Magna Carta signed?” with a reference to where to find the answer in case you forget. By taking notes as questions instead of answers, you generate the material to practice retrieval on later.

One mistake I’ve made in applying this technique is to focus on the wrong kinds of things to ask questions about. I tried applying this method to a book on computational neuroscience, and I ended up asking myself all sorts of detailed questions such as what was the firing rate of certain neuronal circuits or who proposed a specific theory. That wasn’t intentional but rather a by-product of lazily restating the factual content in the book as questions. What’s harder and more useful is to restate the big idea of a chapter or section as a question. Since this is often implicit, it requires some deeper thinking and not just adding a question mark to some notes you copied verbatim. One rule I’ve found helpful for this is to restrict myself to one question per section of a text, thus forcing myself to acknowledge and rephrase the main point rather than zoom in on a detail that will be largely irrelevant later.

Tactic 4: Self-Generated Challenges

The above tactics work best with retrieval of simple information, such as facts or summaries of broad ideas you might encounter in a book or lecture. However, if you’re trying to practice a skill, not merely remember information, they might not be enough. For a programmer, it’s not enough to know what an algorithm means, but be able to write it in code. In this case, as you go through your passive material, you can create challenges for yourself to solve later. You may encounter a new technique and then write a note to demonstrate that technique in an actual example. Creating a list of such challenges can serve as a prompt for mastering that information later in practice and can expand your library of tools that you are able to actually apply.

Tactic 5: Closed-Book Learning

Nearly any learning activity can become an opportunity for retrieval if you cut off the ability to search for hints. Concept mapping, the strategy that didn’t work particularly well for students in Karpicke and Blunt’s experiments, could be beefed up considerably by preventing yourself from looking at the book when generating your concept map. I suspect that had this been done in the original experiment, students using this form of closed-book concept mapping would likely have done better on the eventual test that relied on creating a concept map. Any practice, whether direct or a drill, can be cut off from the ability to look things up. By preventing yourself from consulting the source, the information becomes knowledge stored inside your head instead of inside a reference manual.

Revisiting Ramanujan

Ramanujan was smart, there’s no denying it. However, his genius was aided immeasurably by two hallmarks of the ultralearner’s tool kit: obsessive intensity and retrieval practice. As he worked on his slate from morning to night, trying to figure out Carr’s sparsely written list of theorems was incredibly hard work. But it also created the desirable difficulties that allowed him to build a huge mental library of tools and tricks that would assist him in his later mathematical efforts.

Retrieval played an important role in Ramanujan’s mathematical upbringing, but he is hardly the only one to take advantage of the tactic. In nearly every biography of great geniuses and contemporary ultralearners I have encountered, some form of retrieval practice is mentioned. Benjamin Franklin practiced his writing by reconstructing essays from memory. Mary Somerville worked through problems mentally when no candle was available for night reading. Roger Craig practiced trivia questions without looking at the answers. Retrieval is not a sufficient tool to create genius, but it may be a necessary one.

Trying to produce the answer rather than merely reviewing it is only half of a bigger cycle, however. To make retrieval really effective, it helps to know whether the answer you dredged up from your mind was correct. Just as we often avoid testing ourselves until we’re ready because struggling with a test is uncomfortable, we often avoid seeking information about our skill level until we think it will be favorable. Being able to process that information effectively, hearing the message it contains loud and clear, isn’t always easy. Yet this is also why it is so important. This brings us to the next principle of ultralearning: feedback.

Chapter IX

Principle 6

Feedback

Don’t Dodge the Punches

Everybody has a plan until they get punched in the mouth.

—Mike Tyson

From a narrow staircase in the back, Chris Rock enters the stage just as his name is being announced. With sold-out shows and HBO specials, Rock is no neophyte to stand-up comedy. His performances feel like a rock concert. With an energetic and punctuated delivery, he’s known for repeating the key phrase of a joke like the chorus of a song, the rhythm of it so precise that you get the feeling he would be able to make anything funny. And that’s exactly the problem. When everything you do is funny, how do you know what really makes a joke good?

Far from the packed concert halls and jubilant crowds, Rock walks to the mic on the modest brick-backed stage at the Comedy Cellar in Greenwich Village, New York City. In his hand are scraps of cards on which he has scribbled bits of phrases, a trick for working out new material he learned from his grandfather, a cab driver who preached on weekends. Instead of his signature aggressive style, he slumps against the back wall. This is his laboratory, and he’s going to perform comedy with the precision of an experiment.

“It’s not going to be that good,” Rock warns the crowd, who are stunned at his unannounced arrival on the small comedy stage. “Not at these prices,” he adds, joking “At these prices, I could leave right now!” He envisions the reviews: “Chris came out and he left. It was good! He didn’t tell any jokes—but it was good!” Notes in hand, Rock warns the audience playfully that this isn’t going to be a typical Chris Rock performance. Instead, he wants to work out new material under controlled conditions. “They’ll give you about six minutes because you’re famous,” he explains. “. . . then you’re back to square one.” He wants to know what’s funny, when he’s not trying to be funny.1

Rock’s method is not unique. The Comedy Cellar is famous for big-name drop-ins: Dave Chappelle, Jon Stewart, and Amy Schumer are just a few comedians who have tested out their material in front of small crowds here before performing it on prime-time specials and in concert-scale gigs. Why perform at a small club when you can easily draw large crowds and thousands of dollars from a huge performance? Why show up unannounced and then deliberately undersell your own comedic abilities? What Rock and these other famous comedians recognize is the importance of the sixth principle of ultralearning: feedback.

The Power of Information

Feedback is one of the most consistent aspects of the strategy ultralearners use. From the simple feedback of Roger Craig testing himself on Jeopardy! clues without knowing the answer to the uncomfortable feedback of Benny Lewis’s approach of walking up to strangers to speak a language he only started learning the day prior, getting feedback was one of the most common tactics of the ultralearners I encountered. What often separated the ultralearning strategy from more conventional approaches was the immediacy, accuracy, and intensity of the feedback being provided. Tristan de Montebello could have taken the normal route of carefully preparing his script and then delivering a speech once every month or two, as is the case for most Toastmasters. Instead he dove straight in, speaking several times per week, jumping among different clubs to gather different perspectives on his performance. This deep dive into feedback was uncomfortable, but the rapid immersion also desensitized him to a lot of the anxiety that being onstage can create.

Feedback features prominently in the research on deliberate practice, a scientific theory of the acquisition of expertise initiated by K. Anders Ericsson and other psychologists. In his studies, Ericsson has found that the ability to gain immediate feedback on one’s performance is an essential ingredient in reaching expert levels of performance. No feedback, and the result is often stagnation—long periods of time when you continue to use a skill but don’t get any better at it. Sometimes the lack of feedback can even result in declining abilities. Many medical practitioners get worse with more experience as their accumulated knowledge from medical school begins to fade and the accuracy of their diagnoses is not given the rapid feedback that would normally promote further learning.2

Can Feedback Backfire?

The importance of feedback probably isn’t too surprising; we all intuitively sense how getting information about what we’re doing right and wrong can accelerate learning. More interestingly, the research on feedback shows that more isn’t always better. Crucially, what matters is the type of feedback being given.

In a large meta-analysis, Avraham Kluger and Angelo DeNisi looked at hundreds of studies on the impact of providing feedback for learning.3 Though the overall effect of feedback was positive, it’s important to note that in over 38 percent of cases, feedback actually had a negative impact. This leads to a confusing situation. On the one hand, feedback is essential for expert attainment, as demonstrated by the scientific studies of deliberate practice. Feedback also figures prominently in ultralearning projects, and it’s difficult to imagine their being successful if their sources of feedback had been turned off. At the same time, a review of the evidence doesn’t paint the picture of feedback being universally positive. What’s the explanation?

Kluger and DeNisi argue that the discrepancy is in the type of feedback that is given. Feedback works well when it provides useful information that can guide future learning. If feedback tells you what you’re doing wrong or how to fix it, it can be a potent tool. But feedback often backfires when it is aimed at a person’s ego. Praise, a common type of feedback that teachers often use (and students enjoy), is usually harmful to further learning. When feedback steers into evaluations of you as an individual (e.g., “You’re so smart!” or “You’re lazy”), it usually has a negative impact on learning. Further, even feedback that includes useful information needs to be correctly processed as a motivator and tool for learning. Kluger and DeNisi noted that some of the studies that showed a negative impact of feedback occurred because the subjects themselves chose not to use the feedback constructively. They may have rejected the feedback, lowered the standards they expect from themselves, or given up on the learning task altogether. The researchers note that who is giving the feedback can matter, as feedback coming from a peer or teacher has important social dynamics beyond mere information on how to improve one’s abilities.

I find two things interesting about this research. First, it is clear that although informative feedback is beneficial, it can backfire if it is processed inappropriately or if it fails to provide useful information. This means that when seeking feedback, the ultralearner needs to be on guard for two possibilities. The first is overreacting to feedback (both positive and negative) that doesn’t offer specific information that leads to improvement. Ultralearners need to be sensitive to what feedback is actually useful and tune out the rest. This is why, although all the ultralearners I met employed feedback, they didn’t act on every piece of possible feedback. Eric Barone, for instance, did not attend to every comment and critique on early drafts of his game. In many cases he ignored them, when the feedback conflicted with his vision. Second, when it is incorrectly applied, feedback can have a negative impact on motivation. Not only can overly negative feedback lower your motivation, but so can overly positive feedback. Ultralearners must balance both concerns, pushing for the right level of feedback for their current stage of learning. Though we all know (and instinctively avoid) harsh and unhelpful criticism, the research also supports Rock’s strategy of disregarding the positive feedback that his celebrity automatically generates.

The second interesting point about this research is that it explains why feedback-seeking efforts are often underused and thus remain a potent source of comparative advantage for ultralearners. Feedback is uncomfortable. It can be harsh and discouraging, and it doesn’t always feel nice. Standing up on a stage in a comedy club to deliver jokes is probably one of the best ways to get better at stand-up comedy. But the act itself can be terrifying, as an awkward silence cuts deep. Similarly, speaking immediately in a new language can be painful, as the sense of your ability to communicate goes down precipitously from when you use your native tongue.

Fear of feedback often feels more uncomfortable than experiencing the feedback itself. As a result, it is not so much negative feedback on its own that can impede progress but the fear of hearing criticism that causes us to shut down. Sometimes the best action is just to dive straight into the hardest environment, since even if the feedback is very negative initially, it can reduce your fears of getting started on a project and allow you to adjust later if it proves too harsh to be helpful.

All of these acts require self-confidence, resolve, and persistence, which is why many self-directed learning efforts ignore seeking the aggressive feedback that could generate faster results. Instead of going to the source, taking feedback directly, and using that information to learn quickly, people often choose to dodge the punches and avoid a potentially huge source of learning. Ultralearners acquire skills quickly because they seek aggressive feedback when others opt for practice that includes weaker forms of feedback or no feedback at all.

What Kind of Feedback Do You Need?

Feedback shows up in many different forms for different types of learning projects. Getting good at stand-up comedy and learning to write computer programs involve very different kinds of feedback. Learning higher math and learning languages are going to use feedback in different ways. The opportunities for seeking better feedback will vary depending on what you’re trying to learn. Rather than try to spell out exactly what feedback you need for your learning project, I think it’s important to consider different types of feedback, along with how each one can be used and cultivated. By knowing what kind of feedback you’re getting, you can make sure to use it best, while also recognizing its limitations. In particular, I want to consider three types of feedback: outcome feedback, informational feedback, and corrective feedback. Outcome feedback is the most common and in many situations the only type of feedback available. Informational feedback is also fairly common, and it’s important to recognize when you can split apart outcomes to get feedback on parts of what you’re learning and when feedback only on holistic outcomes is possible. Corrective feedback is the toughest to find but when employed well can accelerate learning the most.

Outcome Feedback: Are You Doing It Wrong?

The first type of feedback, and the least granular, is outcome feedback. This tells you something about how well you’re doing overall but offers no ideas as to what you’re doing better or worse. This kind of feedback can come in the form of a grade—pass/fail, A, B, or C—or it can come in the form of an aggregate feedback to many decisions you’re making simultaneously. The applause Tristan de Montebello received (or the crickets he heard) after a speech is an example of outcome feedback. It could tell him if he was getting better or worse, but it couldn’t really say why or how to fix it. Every entrepreneur experiences this kind of feedback when a new product hits the market. It may sell wildly well or abysmally, but that feedback comes in bulk, not directly decomposable into the various aspects of the product. Did the product cost too much? Was the marketing message not clear enough? Was the packaging unappealing? Customer reviews and comments can provide clues, but ultimately the success or failure of any new product is a complex bundle of factors.

This type of feedback is often the easiest to get, and research shows that even getting this feedback, which lacks a specific message about what you need to improve, can be helpful. In one study, feedback for a task involving visual acuity facilitated learning, even when it was delivered in blocks that were too large to get any meaningful information about which responses were correct and which were incorrect.4 Many projects that wholly lack feedback can easily be changed to get this broad-scale feedback. Eric Barone, for instance, provided a development blog to publish work on his game and solicit feedback from early drafts. It couldn’t provide him with detailed information about what exactly to improve and change, but his simply being immersed in an environment that provided feedback at all was helpful.

Outcome feedback can improve how you learn through a few different mechanisms. One is by providing you with a motivational benchmark against your goal. If your goal is to reach a certain quality of feedback, this feedback can give you updates on your progress. Another is that it can show you the relative merits of different methods you’re trying. When you are progressing rapidly, you can stick to those learning methods and approaches. When progress stalls, you can see what you might be able to change in your current approach. Although outcome feedback isn’t complete, it is often the only kind available and can still have a potent impact on your learning rate.

Informational Feedback: What Are You Doing Wrong?

The next type of feedback is informational feedback. This feedback tells you what you’re doing wrong, but it doesn’t necessarily tell you how to fix it. Speaking a foreign language with a native speaker who doesn’t share a language with you is an exercise in informational feedback. That person’s confused stare when you misuse a word won’t tell you what the correct word is, but it will tell you that you’re getting it wrong. Tristan de Montebello, in addition to the overall assessment of his performance by audience members at the end of a speech, can also get live informational feedback about how it’s going moment to moment. Did that joke work? Is my story boring them? This is something you can spot in the distracted glances or background chatter throughout your speech. Rock’s stand-up experiment is also a type of informational feedback. He can tell when a certain joke lands or doesn’t, based on the reaction of the audience. However, they can’t tell him what to do to make it funnier—he’s the comedian, not them.

This kind of feedback is easy to obtain when you can get real-time access to a feedback source. A computer programmer who gets error messages when her programs don’t compile properly may not have enough knowledge to understand what she’s doing wrong. But as errors increase or diminish, depending on what she does, she can use that signal to fix her problems. Self-provided feedback is also ubiquitous, and in some pursuits it can be almost as good as feedback from others. When painting a picture, you can simply look at it and get a sense of whether your brushstrokes are adding to or detracting from the image you want to convey. Because this kind of feedback often comes from direct interaction with the environment, it often pairs well with the third principle, directness.

Corrective Feedback: How Can You Fix What You’re Doing Wrong?

The best kind of feedback to get is corrective feedback. This is the feedback that shows you not only what you’re doing wrong but how to fix it. This kind of feedback is often available only through a coach, mentor, or teacher. However, sometimes it can be provided automatically if you are using the right study materials. During the MIT Challenge, I did most of my practice by going back and forth between assignments and their solutions, so that when I finished a problem, I was shown not only whether I had gotten it right or wrong but exactly how my answer differed from the correct one. Similarly, flash cards and other forms of active recall provide corrective feedback by showing you the answer to a question after you make your guess.

The educators Maria Araceli Ruiz-Primo and Susan M. Brookhart argue, “The best feedback is informative and usable by the student(s) who receive it. Optimal feedback indicates the difference between the current state and the desired learning state AND helps students to take a step to improve their learning.”5

The main challenge of this kind of feedback is that it typically requires access to a teacher, expert, or mentor who can pinpoint your mistakes and correct them for you. However, sometimes the added edge of having corrective over merely informational feedback can be worth the effort needed to find such people. Tristan de Montebello worked with Michael Gendler to help him with his public speaking performance, and that helped him spot subtle weaknesses in his presentations that would have gone unnoticed by himself or by a less experienced audience member giving broader feedback.

This type of feedback trumps outcome feedback, which can’t indicate what needs improving, and informational feedback, which can indicate what to improve but not how. However, it can also be unreliable. Tristan de Montebello would often get conflicting advice after delivering a speech; some audience members would tell him to slow down, while others said to speed up. This can also be a situation in which paying for a tutor can be useful, because that person can spot the exact nature of your mistake and correct it with less struggle on your part. The self-directed nature of ultralearning shouldn’t convince you that learning is best done as an entirely solitary pursuit.

Further Notes on Types of Feedback

A few things are worth noting here. First, you need to be careful when trying to “upgrade” feedback from a weaker form to a stronger form if it’s not actually possible. To switch from outcome feedback to informational feedback, you need to be able to elicit feedback on a per element basis of what you’re doing. If instead the feedback is being provided as a holistic assessment of everything you’re doing, trying to turn it into informational feedback can backfire. Game designers know to watch out for this, because asking play testers what they don’t like about a game can often return spurious results: for example, they don’t like the color of the character or the background music. The truth is, the players are evaluating the game holistically, so they often can’t offer this kind of feedback. If their responses come from using it as a whole, not from each aspect individually, asking for greater specificity may lead to guesses from those giving feedback.

Similarly, corrective feedback requires a “correct” answer or the response of a recognized expert. If there is no expert or a single correct approach, trying to turn informational feedback into corrective feedback can work against you when the wrong change is suggested as an improvement. De Montebello noted to me that the advice most people gave him wasn’t terribly useful, but the consistency of it was. If his speech elicited wildly different reactions each time, he knew there was still a lot of work to do. When the speech started to get much more consistent comments, he knew he was onto something. This illustrates that ultralearning isn’t simply about maximizing feedback but also knowing when to selectively ignore elements of it to extract the useful information. Understanding the merits of these different types of feedback, as well as the preconditions that make them possible, is a big part of choosing the right strategy for an ultralearning project.

How Quick Should Feedback Be?

An interesting question in the research on feedback is how quick it should be. Should you get immediate information about your mistakes or wait some period of time? In general, research has pointed to immediate feedback being superior in settings outside of the laboratory. James A. Kulik and Chen-Lin C. Kulik review the literature on feedback timing and suggest that “Applied studies using actual classroom quizzes and real learning materials have usually found immediate feedback to be more effective than delay.”6 Expertise researcher K. Anders Ericsson agrees, arguing in favor of immediate feedback when it assists in identifying and correcting mistakes and when it allows one to execute a corrected version of their performance revised in response to the feedback.7

Interestingly, laboratory studies tend to show that delaying the presentation of the correct response along with the original task (delayed feedback) is more effective. The simplest explanation of this result is that presenting the question and answer again offers a second, spaced exposure to the information. If this explanation were correct, all it would mean is that that immediate feedback is best paired with delayed review (or further testing) to strengthen your memory compared with a single exposure. I’ll cover more on spacing and how it impacts your memory in the next chapter on retention.

Despite the superficially mixed results on the timing of feedback from the scientific literature, I generally recommend faster feedback. This enables a quicker recognition of mistakes. However, there’s a possible risk that this recommendation might backslide into getting feedback before you’ve tried your best to answer the question or solve the problem at hand. Early studies on feedback timing tended to show a neutral or negative impact of immediate feedback on learning. In those studies, however, experimenters often gave subjects the ability to see the correct answer before subjects had finished filling out the prompt.8 That meant subjects could often copy the correct answer rather than try to retrieve it. Feedback too soon may turn your retrieval practice effectively into passive review, which we already know is less effective for learning. For hard problems, I suggest setting yourself a timer to encourage you to think hard on difficult problems before giving up to look at the correct answer.

How to Improve Your Feedback

By now you see the importance of feedback to your learning efforts. I’ve explained why feedback, especially when delivered to others, can sometimes backfire. I’ve also showed how the three types—outcome, informational, and corrective—have different strengths and the preconditions that need to be in place in order to make them effective. Now I want to focus on some concrete tactics you can apply to get better feedback.

Tactic 1: Noise Cancellation

Anytime you receive feedback, there are going to be both a signal—the useful information you want to process—and noise. Noise is caused by random factors, which you shouldn’t overreact to when trying to improve. Say you’re writing articles that you post online, trying to improve your writing ability. Most of them won’t attract much attention, and when they do, it’s often because of factors outside of your control; for example, just the right person happens to share it, causing it to spill across social networks. The quality of your writing does drive these factors, but there’s enough randomness that you need to be careful not to change your entire approach based on one data point. Noise is a real problem when trying to improve your craft because you need to do far more work to get the same information about how to write well. By modifying and selecting the streams of feedback you pay attention to, you can reduce the noise and get more of the signal.

A noise-cancelling technique used in audio processing is filtering. Sound engineers know that human speech tends to fall within a particular range of frequencies, whereas white noise is all over the spectrum. They can boost the signal, therefore, by amplifying the frequencies that occur in human speech and quieting everything else. One way to do this is to look for proxy signals. These don’t exactly equal success, but they tend to eliminate some of the noisy data. For blog writing, one way to do so would be to use tracking code to figure out what percentage of people read your articles all the way to the end. This doesn’t prove your writing is good, but it’s a lot less noisy than raw traffic data.

Tactic 2: Hitting the Difficulty Sweet Spot

Feedback is information. More information equals more opportunities to learn. A scientific measure of information is based on how easily you can predict what message it will contain. If you know that success is guaranteed, the feedback itself provides no information; you knew it would go well all along. Good feedback does the opposite. It is very hard to predict and thus gives more information each time you receive it.

The main way this impacts your learning is through the difficulty you’re facing. Many people intuitively avoid constant failure, because the feedback it offers isn’t always helpful. However, the opposite problem, of being too successful, is more pervasive. Ultralearners carefully adjust their environment so that they’re not able to predict whether they’ll succeed or fail. If they fail too often, they simplify the problem so they can start noticing when they’re doing things right. If they fail too little, they’ll make the task harder or their standards stricter so that they can distinguish the success of different approaches. Basically, you should try to avoid situations that always make you feel good (or bad) about your performance.

Tactic 3: Metafeedback

Typical feedback is performance assessment: your grade on a quiz tells you something about how well you know the material. However, there’s another type of feedback that’s perhaps even more useful: metafeedback. This kind of feedback isn’t about your performance but about evaluating the overall success of the strategy you’re using to learn.

One important type of metafeedback is your learning rate. This gives you information about how fast you’re learning, or at least how fast you’re improving in one aspect of your skill. Chess players might track their Elo ratings growth. LSAT studiers might track their improvements on mock exams. Language learners might track vocabulary learned or errors made when writing or speaking. There are two ways you can use this tool. One is to decide when you should focus on the strategy you’re already using and when you should experiment with other methods. If your learning rate is slowing to a trickle, that might mean you’re hitting diminishing returns with your current approach and could benefit from different kinds of drills, difficulties, or environments. A second way you can apply metafeedback is by comparing two different study methods to see which works better. During the MIT Challenge, I’d often split up questions from different subtopics before testing myself on an exam and try different approaches side by side. Does it work better to dive straight into trying to answer questions or to spend a little time to try to see that you understand the main concepts first? The only way you can know is to test your own learning rates.

Tactic 4: High-Intensity, Rapid Feedback

Sometimes the easiest way to improve feedback is simply to get a lot more of it a lot more often. This is particularly true when the default mode of learning involves little or infrequent feedback. De Montebello’s strategy of improving public speaking relied largely on getting far more frequent exposure to the stage than most speakers do. Lewis’s language immersion exposes him to information about his pronunciation at a point when most students still haven’t uttered a word. High-intensity, rapid feedback offers informational advantages, but more often the advantage is emotional, too. Fear of receiving feedback can often hold you back more than anything. By throwing yourself into a high-intensity, rapid feedback situation, you may initially feel uncomfortable, but you’ll get over that initial aversion much faster than if you wait months or years before getting feedback.

Being in such a situation also provokes you to engage in learning more aggressively than you might otherwise. Knowing that your work will be evaluated is an incredible motivator to do your best. This motivational angle for committing to high-intensity feedback may end up outweighing the informational advantage it provides.

Beyond Feedback

Receiving feedback isn’t always easy. If you process it as a message about your ego rather than your skills, it’s easy to let a punch become a knockout. Though carefully controlling the feedback environment so it is maximally encouraging may be a tantalizing option, real life rarely affords such an opportunity. Instead, it’s better to get in and take the punches early so that they don’t put you down for the count. Though short-term feedback can be stressful, once you get into the habit of receiving it, it becomes easier to process without overreacting emotionally. Ultralearners use this to their advantage, exposing themselves to massive amounts of feedback so that the noise can be stripped away from the signal.

Feedback and the information it provides, however, is useful only if you remember the lessons it teaches. Forgetting is human nature, so it is not enough to learn; you also need to make the information stick. This brings us to the next principle of ultralearning, retention, in which we’ll discuss strategies that will ensure the lessons you learn aren’t forgotten.

Chapter X

Principle 7

Retention

Don’t Fill a Leaky Bucket

Memory is the residue of thought.

—Daniel Willingham, cognitive psychologist

In the small Belgian city of Louvain-la-Neuve, Nigel Richards has just won the World Scrabble Championships. On its own, this isn’t too surprising. Richards has won a championship three times before, and both his prowess with the game and his mysterious personality have made him something of a legend in competitive Scrabble circles. This time, however, is different: instead of the original English-language version of the famous crossword game, Richards has won the French World Championship. This is a much harder feat: most English dictionary versions have roughly 200,000 valid word entries; French, with its gendered nouns and adjectives and copious conjugations, has nearly double that with around 386,000 valid word forms.1 To pull off such a feat is quite remarkable, even more so due to one simple fact: Richards doesn’t speak French.

Richards, an engineer born and raised in Christchurch, New Zealand, is an unusual character. With his long beard and retro aviator sunglasses, he looks like a cross between Gandalf and Napoleon Dynamite. His skills at Scrabble, however, are no joke. A late starter to the game, his mother encouraged him to start in his late twenties, saying “Nigel, since you’re no good at words, you won’t be good at this game, but it will keep you occupied.”2 From those inauspicious beginnings Richards has gone on to dominate the competitive Scrabble scene. Some people even argue that he may be the greatest player of all time.

In case you’ve been living under a rock, Scrabble is based on forming crosswords. Each player has seven letter tiles, drawn from a bag, with which to form words. The catch is that the words must link up with the words already on the board. To be a good player requires a voluminous memory, not only of the words we use every day but of obscure words that are useful because of their length or the letters they contain. A decent casual player quickly learns all the valid two-letter words, including unusual ones such as “AA” (a type of lava) and “OE” (a windstorm in the Faroe Islands). To perform at tournament level, however, requires memorizing nearly all of the short words, as well as longer seven- and eight-letter words, since if a player uses up all seven tiles in one turn, there is an extra fifty-point bonus (or “bingo,” in Scrabble jargon). Memory, however, isn’t the only skill needed. Like other competitive games, tournament Scrabble uses a timing system, so skilled players must not only be able to construct valid words from a scrambled set of tiles but quickly find spaces and calculate which words will score the most points. In this regard, Richards is a master: given the tiles CDHLRN and one blank (which can be used for any letter), Richards ignored the obvious CHILDREN and instead linked up multiple crosswords to make the even higher scoring CHLORODYNE.

Richards’s virtuosity is only intensified by the mystery that surrounds it. He is quiet and mostly keeps to himself. He refuses all interviews with reporters and seems completely uninterested in fame, fortune, or even providing explanations for how he does it. A fellow competitor, Bob Felt, bumping into Richards at a tournament noted his monklike serenity, telling him “When I see you, I can never tell whether you’ve won or lost.” “That’s because I don’t care” was Richard’s monotone response.3 Even his competing in Belgium, which briefly pulled him into the international media spotlight, was done as an excuse to do a cycling trip through Europe. In fact, prior to his victory, he had spent only nine weeks preparing. After he beat a Francophone player, Schelick Ilagou Rekawe from Gabon, in the final match, he was given a standing ovation but needed a translator to thank the audience.

What Is Nigel Richards’s Secret?

The more I read about Nigel Richards, the more intrigued I became. Richards was as mysterious as he was incredible in his mnemonic abilities. He steadfastly ignores opportunities for interviews and is famously laconic in descriptions of his methods. After his victory in Louvain-la-Neuve, one reporter asked him if he had any special methods for memorizing all those words. “No” was Richards’s monosyllabic response. Still, even if he wouldn’t divulge his strategies publicly, perhaps some digging could reveal clues.

The first thing I discovered was that although Richards’s victory in Belgium was astounding, it wasn’t entirely without precedent. Other players of the game have won World Championships without being fluent in the language of competition. Scrabble is particularly popular in Thailand, for instance, and two former world champions, Panupol Sujjayakorn and Pakorn Nemitrmansuk, are not fluent in English. The reason is simple: remembering words in one’s native language and remembering words in Scrabble are different mnemonic feats. In spoken language, the meaning of a word, its pronunciation, and its feel are important. In Scrabble, those things don’t matter; words are just combinations of letters. Richards could win at French Scrabble without speaking French because the game wasn’t much different from English; he just had to memorize different patterns of letters. A native speaker has an advantage, of course, since many spellings will already be familiar. But there will still be a large number of arcane and unfamiliar words to memorize, and the skill of rearranging the letters into valid board positions and calculating to achieve maximal points remains the same in every language in which Scrabble can be played.

The next piece of the puzzle I discovered was that Scrabble, it turns out, isn’t the only activity in which Richards possesses a strange intensity. His other love is cycling. Indeed, in an early tournament in Dunedin, New Zealand, he got onto his bicycle after work finished, pedaled through the night from Christchurch to Dunedin, a distance of over two hundred miles, without sleeping, and started the tournament first thing in the morning. After he won, competitors he met at the tournament offered to give him a ride home. He politely declined, preferring to bicycle back the entire way home to Christchurch for another sleepless night before starting work again Monday morning.4 At first that felt like just another odd quirk in his profile, like his home-done haircuts and reluctance to be interviewed. Now, though, I believe it may hold some keys to unlocking some of his mystery.

Cycling, of course, isn’t a great mnemonic technique. If it were, Lance Armstrong would have been a fierce contender. However, it does illustrate a common theme in Richards’s personality that overlaps with that of other ultralearners I have encountered: an obsessive intensity that exceeds what is considered a normal investment of effort. Richards’s cycling, it turns out, also lines up well with the only other clues I’ve been able to uncover about his methods: he reads lists; long lists of words, starting with two-letter words and then moving up. “The cycling helps,” he explains, “I can go through lists in my mind.”5 He reads the dictionary, focusing exclusively on combinations of letters, ignoring definitions, tenses, and plurals. Then, drawing from memory, he repeats them over and over again as he cycles for hours. This aspect also corresponds with a method that is common to other ultralearners and that has shown up in other principles of learning so far: active recall and rehearsal. By retrieving words, Richards likely takes his already impressive memory and makes it unassailable through active practice.

There are other clues about Richards’s performance: he focuses on memory, not anagramming (rearranging the tiles to create words); he works forward and backward, starting from small words, going on to big ones and back again; he claims to recall the words visually, as he cannot remember words when they’re spoken. All of these clues provide glimpses into Richards’s mind, but they leave out even more than they reveal. How many times does he have to read the words from his list before he can rehearse it mentally? Are the words organized in some way or just listed alphabetically? Is he a savant with exceptional abilities and lower-than-normal general intelligence or an all-round genius for whom memorizing Scrabble words is just one of many impressive abilities? Maybe his intelligence is quite average and his dominance in Scrabble represents his extreme dedication to the game. We might never know the answers to those questions.

I certainly can’t rule out the theory that Richards’s mind is simply hardwired differently or better for memory than my own. After all, nothing I’ve encountered so far about his method is so boldly original that serious Scrabble players would be unaware of it. Yet Richards has completely dominated his competition. Part of me suspects that his intense, obsessive personality, which enables him to cycle for hours reviewing lists mentally, might also form at least a partial explanation. Whatever gifts he might possess, he also seems to possess the ultralearner ethos I’ve described thus far in the book. For whatever it is worth, Richards himself argues for more of the latter than the former: “It’s hard work, you have to have dedication to learn,”6 elsewhere adding “I’m not sure there is a secret, it’s just a matter of learning the words.”7

Scrabble words may not be important to your life. However, memory is essential to learning things well. Programmers must remember the syntax for the commands in their code. Accountants need to memorize ratios, rules, and regulations. Lawyers must remember precedents and statutes. Doctors need to know tens of thousands of factoids, from anatomical descriptions to drug interactions. Memory is essential, even when it is wrapped up in bigger ideas such as understanding, intuition, or practical skill. Being able to understand how something works or how to perform a particular technique is useless if you cannot recall it. Retention depends on employing strategies so the things you learn don’t leak out of your mind. Before discussing strategies of retention, however, let’s take a look at why remembering things is so difficult.

Why Is It So Hard to Remember Things?

Richards is an extreme case, but his story nonetheless illustrates many themes that are important for anyone who wants to learn something: How can you retain all of the things you learn? How can you defend against forgetting hard-won facts and skills? How can you store the knowledge you’ve acquired so that it can be easily retrieved exactly when you need it? In order to understand learning, you need to understand how and why you forget.

Losing access to previously learned knowledge has been a perennial problem for educators, students, and psychologists. Fading knowledge impacts the work you do as well. One study reported that doctors give worse medical care the longer they have worked, as their stored knowledge from medical school is gradually forgotten, despite their working in the profession full-time. Quoting from the original abstract:

Physicians with more experience are generally believed to have accumulated knowledge and skills during years in practice and therefore to deliver high-quality care. However, evidence suggests that there is an inverse relationship between the number of years that a physician has been in practice and the quality of care that the physician provides.8

Hermann Ebbinghaus, in one of the first psychological experiments in history, spent years memorizing nonsense syllables, much in the same way Richards memorizes Scrabble words, and carefully tracking his ability to recall them later. From this original research, later verified by more experimentally robust studies, Ebbinghaus discovered the forgetting curve. This curve shows that we tend to forget things incredibly quickly after learning them, there being an exponential decay in knowledge, which is steepest right after learning. However, Ebbinghaus noted, this forgetting tapers off, and the amount of knowledge forgotten lessens over time. Our minds are a leaky bucket; however, most of the holes are near the top, so the water that remains at the bottom leaks out more slowly.

Over the intervening years, psychologists have identified at least three dominant theories to help explain why our brains forget much of what we initially learn: decay, interference, and forgotten cues. Though the jury is still out on the exact mechanism underlying human long-term memory, these three ideas likely form some part of explaining why we tend to forget things and, conversely, provide insight into how we might better retain what we’ve learned.

Decay: Forgetting with Time

The first theory of forgetting is that memories simply decay with time. This idea does seem to match common sense. We remember events, news, and things learned in the past week much more clearly than things from last month. Things learned this year are recalled with much greater accuracy than events from a decade ago. By this understanding, forgetting is simply an inevitable erosion by time. Like sands in an hourglass, our memories inexorably slip away from us as we become more distant from them.

There are flaws with this theory being the complete explanation, however. Many of us can vividly recall events from early childhood, even if we can’t remember what we ate for breakfast last Tuesday. There also seem to be patterns in which things are remembered and which are forgotten that go beyond the time since they were originally learned: vivid, meaningful things are more easily recalled than banal or arbitrary information. Even if there is a component to our forgetting that is simply decay, it seems exceedingly unlikely that this is the only factor.

Interference: Overwriting Old Memories with New Ones

Interference suggests a different idea: that our memories, unlike the files of a computer, overlap one another in how they are stored in the brain. In this way, memories that are similar but distinct can compete with one another. If you’re learning programming, for instance, you may learn what a for loop is and remember it in terms of doing something repeatedly. Later, you may learn about while loops, recursion, repeat-until loops, and go-to statements. Now, each of these has to do with doing something repeatedly, but in different ways, so they may interfere with your ability to remember correctly what a for loop does. There are at least two flavors of this: proactive interference and retroactive interference. Proactive interference occurs when previously learned information makes acquiring new knowledge harder. Think of this as if the “space” where that information wants to be stored is already occupied, so forming the new memory becomes harder. This can happen when you want to learn the definition of a word but have difficulty because that word already has a different association in your mind. Consider trying to learn the concept of negative reinforcement in psychology—here the word “negative” has the meaning “absent,” as opposed to “bad,” so negative reinforcement is when you encourage a behavior by removing something, say a painful stimulus. However, since the earlier meaning of negative as “bad” already exists, you may have difficulty remembering this and it becomes easy to incorrectly equate negative reinforcement with punishment. Retroactive interference is the opposite—where learning something new “erases” or suppresses an old memory. Anyone who has learned Spanish and later tried to learn French knows how tricky retroactive interference can be, as French words pop out when you want to speak Spanish again.

Forgotten Cues: A Locked Box with No Key

The third theory of forgetting says that many memories we have aren’t actually forgotten but simply inaccessible. The idea here is that in order to say that one has remembered something, it needs to be retrieved from memory. Since we aren’t constantly experiencing the entirety of our long-term memories simultaneously, this means there must be some process for dredging up the information, given an appropriate cue. What may happen in this case is that one of the links in the chain of retrieving the information has been severed (perhaps by decay or interference) and therefore the entire memory has become inaccessible. However, if that cue were restored, or if an alternative path to the information could be found, we would remember much more than is currently accessible to us.

This explanation also has some advantages. Intuitively it seems to be somewhat true, as we all know the tip-of-the-tongue experience, when we feel as though we should be able to remember a fact or word but we’re not able to summon it up immediately. It might also suggest that relearning things is much faster than learning them initially, because relearning is closer to repair work, while original learning is a completely new construction. Forgetting cues seems highly likely as a partial, if not complete, explanation of forgetting many things.

Cue forgetting as a complete explanation for our memory woes isn’t without its problems, however. Many memory researchers now believe that the act of remembering is not a passive process. In recalling facts, events, or knowledge, we’re engaging in a creative process of reconstruction. The memories themselves are often modified, enhanced, or manipulated in the process of remembering. It may be, then, that “lost” memories that are retrieved through new cues are actually fabrications. This seems especially likely in the case of “recovered” witness testimony from traumatic events, as experiments have shown that even highly vivid memories that feel completely authentic to the subject can be untrue.9

How Can You Prevent Forgetting?

Forgetting is the default, not the exception, so the ultralearners I encountered had devised various strategies for coping with this fact of life. These methods roughly divide into tackling two similar but different problems. The first set of methods deals with the problem of retention while undertaking the ultralearning project: How can you retain the things you learned the first week, so that you don’t need to relearn them by the last week? This is particularly important for memory-intensive ultralearning efforts such as Benny Lewis’s language learning and Roger Craig’s Jeopardy! trivia mastery. In these domains and many others, the volume of information to be learned is often so large that the forgetting becomes a practical obstacle almost immediately. The second set of methods, in contrast, has to do with the longevity of the skills and knowledge acquired after the project has been completed: Once a language has been learned to a level you’re satisfied with, how can you keep yourself from forgetting it completely a couple years later?

The ultralearners I encountered had devised differing methods for dealing with these two problems, which varied in effort and intensity. Some, like Craig, preferred elaborate electronic systems that can optimize memory with fancy algorithms, leaving little waste and inefficiency, if at the cost of introducing greater complexity. Others, like Richards, seem to prefer basic systems that succeed on their simplicity.

You need to pick a mnemonic system, which will both accomplish your goals and be simple enough to stick to. During intense periods of language learning, the sheer volume of vocabulary often meant that spaced-repetition systems were helpful for me. Other times, I preferred having conversations to maintain my speaking ability, even though this method is not quite as precise. With other subjects, I’m happier to allow for some degree of forgetting as long as I practice the skills I need to use continuously and have the ability to relearn.

My approaches may not reach a theoretical ideal, but they may end up working better because they have fewer possibilities for error and can be sustained more easily. Regardless of the exact system used, however, all systems seemed to work according to one of four mechanisms: spacing, proceduralization, overlearning, or mnemonics. Let’s look at each of these mechanisms of retention first, in order to make sense of the quite different and idiosyncratic manifestations used in different ultralearning projects.

Memory Mechanism 1—Spacing: Repeat to Remember

One of the pieces of studying advice that is best supported by research is that if you care about long-term retention, don’t cram. Spreading learning sessions over more intervals over longer periods of time tends to cause somewhat lower performance in the short run (because there is a chance for forgetting between intervals) but much better performance in the long run. This was something I needed to be careful about during the MIT Challenge. After my first few classes, I switched from doing one class at a time to doing a few in parallel, to minimize the impact that the crammed study time would have on my memory.

If you have ten hours to learn something, therefore, it makes more sense to spend ten days studying one hour each than to spend ten hours studying in one burst. Obviously, however, if the amount of time between study intervals gets longer and longer, the short-term effects start to outweigh the long-term ones. If you learn something with a decade separating study intervals, it’s quite possible that you’ll completely forget whatever you had learned before you reach the second session.

Finding the exact trade-off point between too long and too short has been a minor obsession for some ultralearners. Space your study sessions too closely, and you lose efficiency; space them too far apart, and you forget what you’ve already learned. This has led many ultralearners to apply what are known as spaced-repetition systems (SRS) as a tool for trying to retain the most knowledge with the least effort. SRS was a major force behind Roger Craig’s Jeopardy! trivia memorization, and I used the systems extensively when learning Chinese and Korean. Although you may not have heard of this term, the general principle is the backbone of many language-learning products, including Pimsleur, Memrise, and Duolingo. These programs tend to hide the spacing algorithm in the background, so you don’t need to bother yourself with it. However, other programs, such as the open-source Anki, are the preferred tool of more extreme ultralearners who want to squeeze out a little more performance.

SRS is an amazing tool, but it tends to have quite focused applications. Learning facts, trivia, vocabulary words, or definitions is ideally suited for flash card software, which presents knowledge in terms of a question with a single answer. It’s more difficult to apply to more complicated domains of knowledge, which rely on complex information associations that are built up only through real-world practice. Still, for some tasks, the bottleneck of memory is so tight that SRS is a powerful tool for widening it, even if there are some drawbacks. The authors of a popular study guide for medical students center their approach around SRS, because a medical student must remember so many things and the default strategy of forgetting and relearning is quite costly in terms of time.10

Spacing does not require complex software, however. As Richards’s story clearly demonstrates, simply printing lists of words, reading them over, and then rehearsing them mentally without having them in front of you is an incredibly powerful technique. Similarly, semiregular practice of a skill is often quite helpful. After my year of learning languages, I wanted to ensure that I didn’t forget them. My approach was fairly simple: schedule thirty minutes of conversation practice once a week, to be done over Skype using italki, an online service for tutoring and language exchange partners all over the world. I maintained this for one year, after which I dropped to once-per-month practice for another two years. I don’t know whether this practice schedule was ideal, and I had other opportunities to practice that came up spontaneously in that time period that also helped, but I believe it was much better than doing nothing and letting the skills atrophy. When it comes to retention, don’t let perfect become the enemy of good enough.

Another strategy for applying spacing, which can work better for more elaborate skills that are harder to integrate into your daily habits, is to semiregularly do refresher projects. I leaned toward this approach for the things I learned during the MIT Challenge, since the skill I wanted most to retain was writing code, which is tricky to do on only half an hour per week. This approach has the disadvantage of sometimes deviating quite a lot from optimal spacing; however, if you’re prepared to do a little bit of relearning to compensate, it can still be a better approach than completely giving up practice. Scheduling this kind of maintenance in advance can also be helpful, as it will remind you that learning isn’t something done once and then ignored but a process that continues for your entire life.

Memory Mechanism 2—Proceduralization: Automatic Will Endure

Why do people say it’s “like riding a bicycle” and not “like remembering trigonometry?” This common expression may be rooted in deeper neurological realities than it first appears. There’s evidence that procedural skills, such as riding a bicycle, are stored in a different way from declarative knowledge, such as knowing the Pythagorean Theorem or the Sine Rule for triangles. This difference between knowing how and knowing that may also have different implications for long-term memory. Procedural skills, such as the ever-remembered bicycling, are much less susceptible to being forgotten than knowledge that requires explicit recall to retrieve.11

This finding can actually be used to our advantage. One dominant theory of learning suggests that most skills proceed through stages—starting declarative but ending up procedural as you practice more. A perfect example of this declarative-to-procedural transition is typewriting. When you start typing on a keyboard, you must memorize the positions of the letters. Each time you want to type a word, you have to think in terms of its letters, recall each one’s position on the keyboard, and then move your finger to that spot to press it. This process may fail; you may forget where a key is and need to look down to type it. However, if you practice more and more, you stop having to look down. Eventually you stop having to think about the letters’ positions or how to move your fingers to meet them. You may even reach a point where you don’t think of letters at all and whole words come out at a time. Such procedural knowledge is quite robust and tends to be retained much longer than declarative knowledge. A quick observation is enough to verify this: when you’ve gotten really good at typing and someone asks you to quickly say where on a keyboard the letter w is, you might need to actually put your hands in the keyboard position (or imagine you’re doing so) and pretend to type the w to say definitively. This is exactly what happened to me as I was typing out this paragraph. What has happened is that what was originally the primary access point to knowledge, your explicit memory of the key location, has faded away and now needs to be recalled with the more durable procedural knowledge encoded in your motor movements. If you’ve ever had to enter a password or pin code you use often, you may be in a similar position, where you remember it by feel and not by its explicit combination of numbers and letters.

Because of the fact that procedural knowledge is stored for longer, this may suggest a useful heuristic. Instead of learning a large volume of knowledge or skills evenly, you may emphasize a core set of information much more frequently, so that it becomes procedural and is stored far longer. This was an unintentional side effect of my friend’s and my language-learning project. Being forced to speak a language constantly meant that a core set of phrases and patterns was repeated so often that neither of us will ever forget them. This may not hold true for a bunch of less frequently used words or phrases, but the starting points of conversations are nearly impossible to forget. The classic approach to language studies, in which students “move on” from beginner words and grammatical patterns to more complicated ones may sidestep this, so that those core patterns aren’t sticky enough to last for years without repeated practice.

Failing to fully proceduralize core skills was a major flaw of my first major self-education effort, the MIT Challenge, which I was able to improve upon in my subsequent language-learning and portrait-drawing projects. Whereas the MIT Challenge did have core mathematical and programming skills that were often repeated, what ended up being proceduralized was more haphazard rather than reflecting a conscious decision to automate the most essential skills of applying computer science.

Most skills we learn are incompletely proceduralized. We may be able to do some of them automatically, but other parts require us to actively search our minds. You might, for instance, be able to easily move variables from one side of an equation to the other in algebra without thinking. But you may have to think a bit more when exponents or trigonometry is involved. Perhaps, owing to their nature, some skills cannot be completely automated and will always require some conscious thought. This creates an interesting mix of knowledge, with some things retained quite stably over longer periods of time and others susceptible to being forgotten. One strategy for applying this concept might be to ensure that a certain amount of knowledge is completely proceduralized before practice concludes. Another approach might be to spend extra effort to proceduralize some skills, which will serve as cues or access points for other knowledge. You may aim to completely proceduralize the process you use to start working on a new programming project, for example, so that you can get over that hump in the process of writing a new program. These strategies are somewhat speculative, but I think there are lots of potential ways the declarative-to-procedural transition of knowledge might be applied by clever ultralearners in the future.

Memory Mechanism 3—Overlearning: Practice Beyond Perfect

Overlearning is a well-studied psychological phenomenon that’s fairly easy to understand: additional practice, beyond what is required to perform adequately, can increase the length of time that memories are stored.12 The typical experimental setup is to give subjects a task, such as assembling a rifle or going through an emergency checklist, allowing them enough time to practice that they can do it correctly once. The time from zero to this point is considered the “learning” phase. Next, allow the subjects different amounts of “overlearning,” or practice that continues after the first correct application. Since subjects are already doing the skill correctly, performance doesn’t improve past this point. However, the overlearning can extend the durability of the skill.

In the typical setting in which overlearning has been studied, the duration of the overlearning effects tends to be quite short; practicing a little longer in one session produces an additional week or two of recall. This may imply that overlearning is primarily a short-term phenomenon: something useful for skills like first aid or emergency response protocols, which are rarely practiced but need to be kept fresh in between regular training sessions. I suspect, however, that overlearning might have longer-term implications if it is combined with spacing and proceduralization over much longer projects. In my own personal experience drawing portraits, for instance, the thought process used for mapping out the facial features I learned from Vitruvian Studio was repeated so many times that it’s hard to forget, even though my major practice time was only during one month. Similarly, certain reflexes of programming or mathematics I can still easily recall from my MIT Challenge days, even without practice in the interim, because they happened to be patterns that were repeated far more than was necessary to perform them adequately at the time (because they were components of more elaborate problems).

Overlearning dovetails nicely with the principle of directness. Because direct use of a skill frequently involves overpracticing certain core abilities, that kernel is usually quite resistant to forgetting, even years later. In contrast, academically learned subjects tend to distribute practice more evenly to cover the entire curriculum to a minimum level of competency in each area, regardless of the centrality of subtopics to practical applications. Many people I’ve known who have learned a language that I also speak but who learned it through years of formal schooling have much more impressive vocabularies or knowledge of grammatical nuances than I do. However, those same people may trip over fairly basic phrases, because they learned every fact and skill evenly, rather than overlearning the smaller subset of very common patterns.

There seem to be two main methods I’ve encountered for applying overlearning. The first is core practice, continually practicing and refining the core elements of a skill. This approach often works well paired with some kind of immersion or working on extensive (as opposed to intensive) projects, after the initial ultralearning phase has been completed. The shift from learning to doing here may actually involve a deeper, subtler form of learning, which shouldn’t be discounted as simply applying previously learned knowledge.

The second strategy is advanced practice, going one level above a certain set of skills so that core parts of the lower-level skills are overlearned as one applies them in a more difficult domain. One study of algebra students demonstrated this second strategy.13 Most students who had taken an algebra class and were retested years later had forgotten huge amounts of what they had learned. This could have been either because the information was truly lost or simply because forgotten cues rendered the majority of it inaccessible. Interestingly, this rate of forgetting was the same for better- and poorer-performing students; better students retained more than weaker ones, but the rate at which they had forgotten was the same. One group, however, did not show such a steep decline in forgetting: those who had taken calculus. This suggests that moving up a level to a more advanced skill enabled the earlier skill to be overlearned, thus preventing some forgetting.

Memory Mechanism 4—Mnemonics: A Picture Retains a Thousand Words

The final tool common to many ultralearners I encountered was mnemonics. There are many mnemonic strategies, and covering them all is outside the scope of this book. What they have in common is that they tend to be hyperspecific—that is, they are designed to remember very specific patterns of information. Second, they usually involve translating abstract or arbitrary information into vivid pictures or spatial maps. When mnemonics work, the results can be almost difficult to believe. Rajveer Meena, the Guinness World Record holder for memorizing digits of the mathematical constant pi, knows the number to 70,000 decimal places.14 Master mnemonicists, who compete in championships of memory, can memorize the order of a deck of cards in under sixty seconds and can repeat a poem verbatim after only a minute or two of studying. These feats are quite impressive, and even better, they can be learned by anyone patient enough to apply them. How do they work?

One common, and useful, mnemonic is known as the keyword method. The method works by first taking a foreign-language word and converting it into something it sounds like in your native language. If I were doing this with French, for example, I might take the word chavirer (to capsize) and convert it into “shave an ear,” to which it is close enough in sound for the latter to serve as an effective cue for recalling the original word. Next I create a mental image that combines the sounds-like version of the foreign word and an image of its translation in a fantastical and vivid setting that is bizarre and hard to forget. In this case, I could imagine a giant ear shaving a long beard while sitting in a boat that capsizes. Then, whenever I need to remember what “capsize” is in French, I think of capsizing, recall my elaborate picture, which links to “shaving an ear” and thus . . . chavirer. This process sounds needlessly complicated and elaborate at first, but it benefits from converting a difficult association (between arbitrary sounds and a new meaning) into a few links that are much easier to associate and remember. With practice, each conversion of this type may take only fifteen to twenty seconds, and it really does help with remembering foreign-language words. This particular kind of mnemonic works for this purpose, but there are others that work for remembering lists, numbers, maps, or sequences of steps in a procedure. For a good introduction to this topic, I highly recommend Joshua Foer’s book Moonwalking with Einstein: The Art and Science of Remembering Everything.

Mnemonics work well, and with practice, anyone can do them. Why, then, are they not front and center in this chapter, instead of at the end? I believe that mnemonics, like SRS, are incredibly powerful tools. And as tools, they can open new possibilities for people who are not familiar with them. However, as someone who has spent much time exploring them and applying them to real-world learning, their applications are quite a bit narrower than they first appear, and in many real-world settings they simply aren’t worth the hassle.

I believe there are two disadvantages to mnemonics. The first is that the most impressive mnemonics systems (like the one for memorizing thousands of digits of the mathematical constant pi), also require a considerable up-front investment. After you’re done, you can memorize digits easily, but this isn’t actually a very useful task. Most of our society adapts around the fact that people generally cannot memorize digits, so we have paper and computers do it for us. The second disadvantage is that recalling from mnemonics is often not as automatic as directly remembering something. Knowing a mnemonic for a foreign-language word is better than failing to remember it entirely, but it’s still too slow to allow you to fluently form sentences out of mnemonically remembered words. Thus mnemonics can act as a bridge for difficult-to-remember information, but it’s usually not the final step in creating memories that will endure forever.

Mnemonics, therefore, are an incredibly powerful if somewhat brittle tool. If you are doing a task that requires memorizing highly dense information in a very specific format, especially if the information is going to be used over a few weeks or months, they can enable you to do things with your mind that you might not have thought possible. Alternatively, they can be used as an intermediate strategy to smooth initial information acquisition when the information is quite dense. I’ve found them useful for language learning and terminology, and, paired with SRS, they can form an effective bridge from feeling as though there’s no way you can possibly remember everything to remembering it so deeply that you can’t possibly forget. Indeed, in a world before paper, computers, and other externalized memories, mnemonics were the main game in town. However, in the modern world, which has developed excellent coping mechanisms for the fact that most people cannot remember things as a computer can, I feel that mnemonics tend to serve more as cool tricks than as a foundation you should base your learning efforts on. Still, there is a devoted subset of ultralearners who are fiercely committed to applying these techniques, so my word shouldn’t be the final verdict.

Winning the War Against Forgetting

To retain knowledge is ultimately to combat the inevitable human tendency to forget. This process occurs in all of us, and there’s no way to avoid it completely. However, certain strategies—spacing, proceduralization, overlearning, and mnemonics—can counteract your short- and long-term rates of forgetting and end up making a huge difference in your memorization.

I opened this chapter by discussing Nigel Richards’s mysterious Scrabble mastery. How he is able to recall so many words so quickly and see them in a set of scrambled tiles will likely remain an enigma. What we do know about him fits the picture of other ultralearners who have dominated memory-intensive subjects: active recall, spaced rehearsal, and an obsessive commitment to intense practice. Whether you or I have the will to go as far as Richards does is an open question, but with hard work and a good strategy, it seems likely to me that the battle against forgetting need not be a losing one.

Though Richards’s Scrabble practice may give him the benefit of memorizing words he doesn’t know the meaning of, real life tends to reward a different kind of memory: one that integrates knowledge into a deep understanding of things. In the next principle, we’ll look at going from memory to intuition.

Chapter XI

Principle 8

Intuition

Dig Deep Before Building Up

Do not ask whether a statement is true until you know what it means.

—Errett Bishop, mathematician

To the world, he was an eccentric professor and Nobel Prize–winning physicist; to his biographer, he was a genius; but to those who knew him, Richard Feynman was a magician. His colleague the mathematician Mark Kac once posited that the world holds two types of geniuses. The first are ordinary geniuses: “Once we understand what they have done we feel certain that we, too, could have done it.” The other type are magicians, whose minds work in such inscrutable ways that “Even after we understand what they have done, the process by which they have done it is completely dark.” Feynman, by his reckoning, was “a magician of the highest caliber.”1

Feynman could take problems others had worked on for months and immediately see the solution. In high school, he competed in mathematics tournaments, where he would often get the correct answer while the problem was still being stated. While his competitors had just begun to compute, Feynman already had the answer circled on the page. In his college days, he competed in the Putnam Mathematics Competition, with the winner receiving a paid scholarship to Harvard. This competition is notoriously difficult, requiring clever tricks rather than straightforward application of previously learned principles. Time is also a factor, and some examination sessions have a median score of zero, meaning the typical competitor didn’t get even one right. Feynman walked out of the exam early. He scored first place, with his fraternity brothers later being amazed at the drastic gap between Feynman’s score and the next four on the list. During his work on the Manhattan Project, Niels Bohr, then one of the most famous and important living physicists, asked to speak with Feynman directly, to run his ideas by the young grad student before talking with the other physicists. “He’s the only guy who’s not afraid of me” was Bohr’s explanation. “[He] will say when I’ve got a crazy idea.”2

Nor was Feynman’s magic restricted to physics. As a child he went around fixing people’s radios, in part because paying an adult for repairs in the Depression was too costly but also because the radio owners marveled at his process. Once, while he was lost in thought trying to figure out why a radio was producing an awful noise as it started up, the owner of the radio got impatient. “What are you doing? You come to fix the radio, but you’re only walking back and forth!” “I’m thinking!” came the reply, at which the owner, startled at the boldness for which Feynman would later become famous, laughed. “He fixes radios by thinking!”

As a young man during the construction of the atomic bomb in the Manhattan Project, he occupied his free time picking the locks of his supervisors’ desks and cabinets. He once broke into a senior colleague’s filing cabinet, where the secrets for building a nuclear bomb were kept, as a practical joke. Another time, he demonstrated his technique to a military official, who, instead of fixing the security flaw, decided the proper course was to warn everyone to keep Feynman away from their safes! Later, upon meeting a locksmith, he found that his reputation had grown to the point where the professional said, “God! You’re Feynman—the great safecracker!”

He also created the impression of being a human calculator. On a trip to Brazil, he went toe to toe against an abacus salesman, computing difficult figures such as the cube root of 1,729.03. Not only did Feynman get the right answer, 12.002, but he got it to more decimal places than the abacus salesman, who was still furiously calculating to get to 12 when Feynman displayed his five-digit result. This ability impressed even other professional mathematicians, to whom he argued that he could, within one minute, get the answer to any problem that could be stated in ten seconds to within 10 percent of the correct number. The mathematicians threw questions at him such as “e to the power of 3.3” or “e to the power of 1.4,” and Feynman managed to spit back the correct answer almost immediately.

Demystifying Feynman’s Magic

Feynman was certainly a genius. Many people, including his biographer James Gleick, are satisfied to leave it at that. A magic trick, after all, is most dazzling when you don’t know how it is done. Perhaps this is why many accounts of the man have focused on his magic instead of his method.

Though Feynman was quite smart, his magic had its gaps. He excelled in math and physics but was abysmal in the humanities. His college grades in history were in the bottom fifth of his class, in literature in the bottom sixth, and his fine arts grades were worse than those of 93 percent of his fellow students. At one point, he even resorted to cheating on a test to pass. His intelligence, measured while he was in school, scored 125. The average college graduate has a score of 115, which puts Feynman only modestly higher. Perhaps, as has been argued afterward, Feynman’s genius failed to be captured in his IQ score, or it simply was a poorly administered test. However, for someone so celebrated for a mind beyond comprehension, these facts remind us that Feynman was mortal.

What about Feynman’s mental calculus? In this case, we have Feynman’s words himself for how he could compute so much faster than the abacus salesman or his mathematician colleagues. The cube root of 1,729.03? Feynman explained, “I happened to know that a cubic foot contains 1728 cubic inches, so the answer is a tiny bit more than 12. The excess, 1.03, is only one part in nearly 2000, and I had learned in calculus that for small fractions, the cube root’s excess is one-third of the number’s excess. So all I had to do was find the fraction 1/1728, and multiply by 4.” The constant e to the power of 1.4? Feynman revealed, “because of radioactivity (mean-life and half-life), I knew the log of 2 to the base e, which is .69315 (so I also knew that e to the power of .7 is nearly equal to 2).” To go to the power of 1.4, he’d just have to multiply that number against itself. “[S]heer luck,” he explained.3 The secret was his impressive memory for certain arithmetic results and an intuition with numbers that enabled him to interpolate. However, the lucky picks of his examiners allowed him to leave an impression of a magical ability to calculate.

How about the famous lock picking? Once again, it was magic, in the same sense as a magician performing well-practiced tricks. He obsessed over figuring out how combination locks worked. One day he realized that by fiddling with a lock when it was open, he could figure out the last two numbers on the safe. He would write them down on a note after he left the person’s office and then could sneak back in, crack the remaining number with some patience, and leave ominous notes behind.

Even his magical intuition for physics had its explanation: “I had a scheme, which I still use today when somebody is explaining something that I’m trying to understand: I keep making up examples.” Instead of trying to follow an equation, he would try to imagine the situation it described. As more information was given, he’d work it through on his example. Then whenever his interlocutor made a mistake, he could see it. “As they’re telling me the conditions of the theorem, I construct something which fits all the conditions. You know, you have a set (one ball)—disjoint (two balls). Then the balls turn colors, grow hairs, or whatever, in my head as they put more conditions on. Finally they state the theorem, which is some dumb thing about the ball which isn’t true for my hairy green ball thing, so I say, ‘False!’”4

Magic, perhaps, Feynman did not possess, but an incredible intuition for numbers and physics he certainly did. This might downplay the idea that his mind worked in a fundamentally different way from yours or mine, but it doesn’t negate the impressiveness of his feats. After all, even knowing the logic behind Feynman’s sleight of hand, I’m certain I wouldn’t have been able to calculate the numbers he did so effortlessly or follow some complex theory in my mind’s eye. This explanation doesn’t provide the satisfying “Aha!” that it would have had the magician’s trick been revealed as something trivial. Therefore, we need to dig deeper to an understanding of how someone such as Feynman could develop this incredible intuition in the first place.

Inside the Mind of the Magician

Psychological researchers have investigated the problem of how intuitive experts, such as Feynman, think differently about problems than novices do. In a famous study, advanced PhDs and undergraduate physics students were given sets of physics problems and asked to sort them into categories.5 Immediately, a stark difference became apparent. Whereas beginners tended to look at superficial features of the problem—such as whether the problem was about pulleys or inclined planes—experts focused on the deeper principles at work. “Ah, so it’s a conservation of energy problem,” you can almost hear them saying as they categorized the problem by what principles of physics they represented. This approach is more successful in solving problems because it gets to the core of how the problems work. The surface features of a problem don’t always relate to the correct procedure needed to solve it. The students needed much more trial and error to home in on the correct method, whereas the experts could immediately start with the right approach.

If the principles-first way of thinking of problems is so much more effective, why don’t students start there instead of attending to superficial characteristics? The simple answer may be that they can’t. Only by developing enough experience with problem solving can you build up a deep mental model of how other problems work. Intuition sounds magical, but the reality may be more banal—the product of a large volume of organized experience dealing with the problem.

Another study, this time comparing chess masters and beginners, offered an explanation of why this might be so.6 The memory for chess positions of experts and novices was tested by showing them a particular chess setup and then asking them to re-create it on an empty board. The masters could recall far more than the beginners. The new players needed to put down pieces one by one and were often unable to fully remember all the details of the position. The masters, in contrast, remembered the board in larger “chunks” with several pieces corresponding to a recognizable pattern put down at the same time. Psychologists theorize that the difference between grand masters and novices is not that grand masters can compute many more moves ahead but that they have built up huge libraries of mental representations that come from playing actual games. Researchers have estimated that having around 50,000 of these mental “chunks” stored in long-term memory is necessary to reach expert status.7 These representations allow them to take a complex chess setup and reduce it to a few key patterns that can be worked with intuitively. Beginners, who lack this ability, have to resort to representing each piece as a single unit and are therefore much slower.*

This facility of chess grand masters, however, is limited to the patterns that come from real chess games. Give beginners and experts a randomized chess board (one that doesn’t arise from normal play), and the experts no longer display the same marked advantage. Without the library of memorized patterns at their disposal, they have to resort to the beginner’s method of remembering the board piece by piece.

This research gives us a glimpse into how the mind of a great intuitionist such as Feynman operated. He, too, focused on principles first, building off examples that cut straight to the heart of what the problem represented rather than focusing on superficial features. His ability to do this was also built off an impressive library of stored physics and math patterns. His mental calculation feats seem impressive to us but were trivial to him, because he happened to know so many mathematical patterns. Like chess grand masters, when given real physics problems he excelled because he had built a huge library of patterns from real experiences with physics. However, his intuition, too, would fail him when the subject of his study wasn’t built on those assumptions. Feynman’s mathematician friends would test him on counterintuitive theorems from mathematics. His intuition there would fail when properties of the procedure (such as that an object can be cut into infinitely small pieces) defied the normal physical limitations that aided his intuition elsewhere.

Feynman’s magic was his incredible intuition, coming from years of playing with the patterns of math and physics. Could emulating his approach to learning enable someone else to capture some of that magic? Let’s look at some of Feynman’s hallmark approaches to learning and solving problems and try to reveal some of the magician’s secrets.

How to Build Your Intuition

Simply spending a lot of time studying something isn’t enough to create a deep intuition. Feynman’s own experience demonstrates this. On numerous occasions, he would encounter students who memorized solutions to a particular problem but failed to see how they applied outside the textbook domain. In one story, he tricked some of his classmates into believing that a French curve (a device for drawing curved lines) was special because, no matter how you hold it, the bottom is tangent to a horizontal line. This, however, is true of any smooth shape, and it is an elementary fact of calculus that his fellow classmates should have realized. Feynman saw this as an example of a particularly “brittle” way of learning things, since students didn’t really think about relating what they had learned to problems outside the textbook.

How, then, can someone avoid a similar fate—spending a lot of time learning something without really developing the flexible intuition for it that made Feynman famous? There’s no precise recipe for doing so, and a healthy dose of experience and smarts certainly helps. However, Feynman’s own account of his learning process offers some useful guidelines for how he did things differently.



要查看或添加评论,请登录

Jose R F Junior的更多文章

社区洞察

其他会员也浏览了