Parenting through Cooperative Pressures

I recently argued that we needed to use all the layers of Bruce Schneier’s Cooperative Pressure stackup to help kids (and workers) align with society. Each of these layers uses a different type of “pressure” to guide people into doing the right thing. Each pressure is distinct, and works better with different people at different times. Because people can be so different, and even the same person can be different at different times, we need to use all the layers to grow a good society and good people.

My post didn’t really cover how to use those layers though. Here I’m going to talk about ways that I think are good for guiding kids to do what’s right. I have two kids and I’ve done a lot of tutoring, but I’m not a teacher or a child psychologist. This post is my best guess, but don’t take it as gospel.

At a high level, I’m mostly thinking about ways to give kids structure. Give them a lot when they need it, while also helping them learn and grow into people who need less structure overall.

The main ways I think of giving kids structure and motivating them are:

moral pressure: help kids see the good in what you want them to do
social pressure: help kids see how what you’re asking for helps their friends, or helps them to connect with others
institutional pressure: help kids see how following a rule will get them a reward that they want
security: physically prevent a kid from doing something harmful

I’ll give examples of how to use each of these below. If you want a more official discussion of (some of) this from a different perspective, I recommend Playful Parenting as a general guide to helping kids grow.

Where are you taking your kid?

Before diving into different ways you can motivate your kid, or apply pressure to change their behavior, let’s talk about why you’d want to do this in the first place. My goal as a parent is to raise my kids up into capable, hard working, kind, curious, happy people. I want them to become people that I want to know and be friends with. I want them to thrive.

Much of parenting is helping kids to adjust their understanding of the world and their behavior so that they can thrive within the society they were born into. That means helping them grow and change and learn skills in order to do things that they don’t yet understand are valuable. In order to do that, we want to create a bridge from their current behavior to more virtuous and useful behavior.

This sounds very generic, and that’s because every kid and every family is different. The kid’s current behavior might be screaming every question they have, in which case we want a bridge that leads them to use indoor voices and wait for appropriate times to ask questions. The kid’s behavior might be showing frustration by hitting, in which case we want a bridge that leads them to using their words to describe their problems and negotiate. Every situation will be different.

The commonality between all these situations is that we need the bridge we build away from misaligned behavior to be well anchored at both ends. Given the kid’s current values and understanding of the world, it needs to make sense for them to get on the bridge in the first place. We also want the bridge to lead them to a place they’re happy with, so that looking back on their lives they’re happy with how they ended up.

The Cooperative Pressures that I’ll talk about below are ways to anchor this bridge where the kids already are. They give a reason to the kid to change what they’re doing now, in the moment.

Anchoring the bridge at the other end depends on choosing the right goal. If you just want a kid to shut up, you can force them to change by using the pressures here. The bridge you build that way won’t lead somewhere the kid will like, and that will cause problems for them and for the whole family.

I think most (all?) parents have a felt sense for what it would look like for their kids to thrive. It’s worth spending some time thinking about that and writing down what it looks like for your family. Then when you’re parenting, you can use these cached thoughts to help guide the exact ways that you use each of the pressures.

For me, seeing my kids thrive means seeing them take charge of their own needs. It means they get their own snacks, eventually make their own dinners. It means they can talk with us and their friends about how they’re feeling, and ask for help when they need it. It means they can share their opinions openly and honestly, but do so while respecting those around them. It means they offer to help their friends. It means they work hard, and that they can play and have fun. That’s the direction I want to guide my kids in.

Every kid is different, every day is different

What works for one kid one time may not work another time, or another kid. When addressing behavior problems, the first thing to do is look at what is actually happening right in the moment. This can be hard, because often parents can get really stressed out if their kid is misbehaving. Try to take a breath and then think about the underlying need for the kid. Not necessarily what they are explicitly asking for (though do pay attention to that), but what the emotional need is.

Are they looking for connection? Do they want more control of their decisions? It may be really hard to give them exactly what they’re asking for, but maybe you can help give them something that serves the same underlying need.

What the kid is doing also helps define what kind of pressure is best to bring to bear. A kid in full meltdown mode won’t be able to respond to rewards or punishments, even if they might want to otherwise.

One goal is to try to catch issues early, so that pressures higher on the stack are more effective. If a kid feels safe, in control, and connected to their family, then appeals to their own virtue are more likely to be effective.

When you have to intervene in issues where the kid is already highly emotional in some way, you can start out at a lower level of Schneier’s stack. If a kid is very young, very distracted, or very emotionally activated, you may have to physically prevent them from doing something harmful. As they become more able to think and pay attention, you can move up the stack to rules, connection, and virtue.

The key when picking how to motivate a kid is just to try something. If it doesn’t work, you can drop down a layer. When things start to work, try to go up a layer. It doesn’t matter if you pick wrong at first, just try to pay attention to how it’s working for the kid and adjust.

One thing you do want to do is maintain consistency. You need to be as virtuous, and as rule-following, as you’re asking your kids to be. That means that if you make a rule/reward agreement when your kid is upset, you need to follow through on it after they’re more calm when it might seem less necessary. It means only making agreements and rules that you can stand behind.

Cooperative Pressures

Keep in mind that you don’t have to use these in isolation. You can use multiple strategies from different layers, and lower layers can act as a backstop if higher layers aren’t enough at a given time. The layer that works best for your kid can change (up or down) over time and for different activities.

Security

Security here means physically preventing harmful actions. It can mean grabbing a kid’s arm before they can run into a busy street. It can mean carrying your kid to a sink so they can wash their dirty hands. It can mean moving hard objects out of the way while they scream and flail and kick their feet because they’re so upset.

When kids are small toddlers, security is basically all you have. They aren’t old enough to comprehend or remember rules about putting stuff in power sockets, so you have to secure the sockets to prevent it.

Similarly if older kids are very upset, they might be unable to listen and just need space to chill out. Once they’ve had time to calm down, they might be more able to respond to higher levels in the Trust Pressures stack.

No matter the kid’s age, security should be used as minimally as possible for that kid in that situation. When kids are very young a lot is required all the time, but as they age you can quickly move up to other methods.

Institutional Pressure

Institutional pressure is made up of the rules your family follows and the rewards and punishments related to those rules.

In general, rules work better when kids know about them beforehand. Pulling out some new rule by surprise is going to be hard for a kid to think about when they’re feeling overwhelmed. Obviously that will still happen a lot, but when it does it makes sense to talk about the rule more after the triggering situation has passed and when the kid is more centered.

When talking to kids about rules, try to:

make the rule understandable to the kid
make rewards and punishments proportional to the severity of the situation
make rules that apply to everyone appropriately
involve the kids in making rules, rewards, or punishments
allow or encourage bargaining as long as things stay safe

It’s also ok for rules to be silly! My family has a rule that if my kids make a threatening motion towards their mom (like a punch or a kick in the air that’s pointed at her), then she makes a tickling motion back at them. The kids both love it and hate it. When it happens they giggle and shriek, then stop their violent motions.

Rewards can be very useful in short term situations, but you may not want to come to depend on them all the time. As an example, I spent a couple months giving my kid one chocolate chip for each word he wrote in his homework. That’s not sustainable long term, but it was a great backstop while we worked on the underlying reasons he was having trouble with his homework. As the reward was less necessary we slowly stopped the reward. It’s important to be honest and up front about this kind of shift with the kid. You can say something like “that reward was good, but now you’re a little more grown up and it’s time for something different.” If the kid wants to bargain and negotiate, that’s great! Bargain your way into a slow fade of the now superfluous reward.

Social Pressure

The way Schneier describes this is actually more of a “reputational pressure” of wanting people to think well of you. I want to expand that to more of a social pressure, where you want to have strong relationships with people in general (not just have a good reputation). I see this as the friendship and connection that can grow when two people are acting well towards each other. A desire for this growth in connection acts as a pressure towards that good action.

One thing you as a parent don’t want to do here is use your connection with your child as a bargaining chip. Social pressure is not institutional pressure, and your connection with your kid shouldn’t be a reward for their good behavior. I’m not advocating for emotional manipulation of any kind. I do think unconditional love and connection between a parent and their kid is good.

At the same time, you (the parent) are a person. You have feelings and needs that are as real as your kid’s, and those feelings and needs should be respected. It’s ok for you to feel upset if your kid is mean to you, or to feel proud and adoring when your kid does something really cool. All of those feelings are good, and it’s ok to act on them. The one trick is being careful so that any emotional baggage you have doesn’t fall on the kid if they happen to trigger it.

If a kid is yelling in your ear repeatedly, it’s ok to say “I love you but I don’t want to be near you if you do that”. This isn’t a threat, and your continued presence with them isn’t a reward. It’s boundary setting to maintain the health of the relationship.

Kids need connection. Real, honest connection. With their parents, their friends, their community members. You can provide that real, honest connection sometimes. You can be real about when you’re proud or excited by something they do. You can show them if you’re upset or hurt by something that they did. You can let them be motivated by that.

The way to use social pressure with your kid is through providing information. You can help them to understand how other people are likely to react to things. As an example, kids may not understand that screaming really loudly can upset others and push people away. Helping to explain that will let the natural social pressure influence with what the kid is doing.

Moral Pressure

Moral pressure is what pushes people to do something because it’s right. It’s the internal sense of virtue that people have. In order to act from a moral stance, kids need to be centered. It will be hard for a kid to act out of internal virtue if they are afraid, enraged, or feels no sense of control. For this reason, I think moral pressure is best taught before or after a behavioral situation, and not right when it’s happening.

Instilling virtue in children is one of the most important parts of parenting, so it’s frustrating that it’s hardest in the moments where it’s most salient. I think it’s almost always worth calling out the moral dimension of any dilemma, but I’ll usually fall back to another kind of pressure pretty quickly. Once an issue is resolved and my kid is more calm or connected, I can return to the moral dimension and talk about other ways to resolve that kind of issue when it comes up again.

One thing I really liked from Hunt, Gather, Parent was the heavy emphasis on stories. Tell your kids stories about people who show the virtues you want to instill. Talk about those stories extensively. Find ways to connect their current behavior to stories they already know.

When kids are really young, you can make the stories into little monster tales about bad behavior. Hunt, Gather, Parent gives the example of telling your kid there’s a monster in the lake in order to get them to stay away from the water. In our family we told stories about mold-monsters that would eat the couch if you poured juice on the couch, or ant-monsters that would eat the house if you left crumbs on the floor. These types of stories help to make abstract or difficult to explain risks legible to young kids.

I have friends who hate this idea, and consider these types of monster stories to be lying to kids. While I do think there are bad ways of doing this, I don’t think telling monster stories is automatically bad. In particular, I think that very often the goal is to communicate abstract risks or abstract values to kids that are too young to really get it. A three year old kid doesn’t understand the risk of falling in a lake and drowning, but they grasp the risk of monsters very quickly. They don’t understand how mold grows, or how gross having ants in the kitchen is. But they do understand a monster that could get them.

When my family used this strategy, our kids would sometimes ask us if the monsters were real. We would often say something like “mold monsters aren’t real, but mold could grow.” If they had asked about a lake monster, we would have admitted right away that it wasn’t real. But the story explains the risk. The idea of a mold monster did get our kids to stop dripping their juice on the couch, even thought we weren’t trying to get them to truly believe in the monster.

It’s also important to keep updating how you talk with kids about values over time. You shouldn’t be warning a 14 year old away from a lake due to monsters. Just like at some point a kid discovers that the real Santa is the love they have for their family and friends, they’ll eventually learn that the real lake monster is inability to swim and the real mold monster is just mold.

When you talk to your kids about risk, values, and virtues, do so in a way that connects to what they’re able to process and think about now. As they grow, you’ll keep finding situations to readdress those same values and risks, and you can adjust each time to where your kid is currently at.

Alfie Kohn talks about using the 3 Cs of Content, Collaboration, and Choice to help instill goodness in kids.

Content is what you’re asking of kids. Are you asking to do something actually important and worthwhile? Are you upset over something real, or over something that doesn’t really impact anyone? Is the content of our request understandable by our kid right here and now, given how they’re feeling? Coming up with good content is why I earlier suggested that you write down explicitly what it looks like for your kid to thrive. It can be easy to push a kid in a wrong direction out of frustration, so pay attention to content in order to avoid this.

Collaboration means that kids need to be involved in deciding what they do. If they aren’t actively deciding on their own actions, then they aren’t acting out of virtue or an internal moral sense. Sometimes this is totally fine, but as parents we do want to instill the moral sense. That means that sometimes your kid needs to be doing things that they decide on, not you. Collaboration can be weak, such as explaining why it’s important that a kid follow some rule. Collaboration can be strong, such as when parents encourage their kids to help make a plan for how to fix some problem.

Choice is also connected with collaboration. Letting the kid decide what to do lets them exercise their virtue. They exercise it in the sense of enacting virtue, but also in the sense of strengthening their virtue. They need to choose it, and you as a parent need to set up opportunities for them to make those choices.

Social Pressure Connection

Let’s return to the social pressure layer of Schneier’s stack. It’s much more important for parent/child relationships than it is for the relationships that Schneier discusses in his book.

To get a sense for how important, let’s look at Playful Parenting’s advice. That book makes the argument that kids really have two axes for their emotional needs: connection and agency. We need connection and agency in order to support teaching virtue. Kohn would phrase it as collaboration and choice. The connection is critical to supporting kids in figuring out how they want to act. They need to be able to work through different ideas and try them on, to find the idea that fits the situation. Trying on new ideas is way more comfortable when you have the support of a family member right there with you.

When kids feel connected, it will be much easier for them to hear your asks. It will be easier for them to activate their own preexisting moral virtues. This is why it’s so important that connection not be a reward. That undercuts their ability to actually engage with you.

Play

There’s a lot more to be said about how to do all of the above well. In particular, as you approach all of these pressures it’s best to try to do so with a sense of play. Make jokes. Make up games about what you’re trying to teach them to do. Make up fake rules for kids to break. Make up goofy rewards and punishments for minor real rules.

This may sound like a lot of work, but you can let your kid guide it with their own sense of humor. Just “yes, and…” when you can.

Play is a major way that kids think. Playing with your kids will help them to understand the rules, and why they’re there. With kids, more serious means less thought. To get them thinking, get them playing.

I’ve already recommended Playful Parenting, and I’ll do so again here because I think it’s such a good book for giving examples of how to approach interactions with your kid in general.

Virtue is Necessary, but not Sufficient

With many thanks to my wife for listening to me complain about people being wrong, and for her wisdom about ways to be right.

Alfie Kohn wrote Punished by Rewards over 30 years ago; it was published in 1993. In large part, it’s a direct response to the behaviorism of B.F. Skinner, and the book starts with that. Reading this book from 2026, it’s hard for me to understand everything that Kohn is reacting to. I think a lot of behaviorism has either fallen away or become so omnipresent as to be invisible. This makes the book a bit like reading a history of a long-ago war: I find both belligerents slightly alien, but I do hope to learn something about their philosophy and strategy.

Behaviorism as described by Kohn strikes me as slightly Buddhist: there’s no such thing as an internal “I”, just the environmental contingencies and conditioned responses. Skinner apparently went on to say that he himself had no agency. I find this pretty wild, and Kohn seems to find it funny. In spite of how weird it is to me, I want to take it at face value to avoid the typical mind fallacy.

The kind of psychology created by someone with no sense of self would obviously not focus on any kind of self-definition or agency. Behaviorism is all about responses that people have to stimulus. If you can train a dog or an LLM through reinforcement learning, surely you can train a human in the same way.

Alfie Kohn is a person who is incredibly driven by his own moral compass. I opened Punished by Rewards expecting a book about how to incentivize behavior, and what I got was mainly a polemic about the evils of manipulation and control. The book does have a lot of details about how rewards and punishments can fail, but even when they work Kohn will attack them as immoral.

If BF Skinner is someone with no sense of self who sees education as a training program of operant conditioning, then Alfie Kohn is his mirror. Kohn is a person with an extreme sense of self who sees education as a way infusing students with the right kind of self. I imagine that Skinner would say it doesn’t matter if someone enjoys doing homework as long as they do it, while my imagined Kohn says that whether they enjoy it is the only thing that matters.

Kohn’s book is not simply a rejection of punishments and rewards, nor a celebration of personal autonomy and selfhood. It tries to show a path to a new way of organizing life. He explicitly wants to call into question our most foundational institutions. He wants to push people to live in a totally different way that more fully embodies the virtue of caring for others. He wants us to reject any kind of interaction that doesn’t serve that goal. In arguing for this, he suggests that we throw out many of the things that allow a pluralistic society to function.

My Purpose in Reading the Book

I have a few goals here:

Argue that Alfie Kohn is wrong in his rejection of institutional pressure to motivate people.
Sift through Kohn’s book to find the ways that rewards actually can be useful, according to his description of their function and my definition of being useful.
Discuss the nature of virtue, and encourage people (you!) to be more virtuous in ways that include what Kohn cares about.

I have two kids, and I want to raise them as well as I can. I want to do everything I can to make their world as caring, wonderful, and interesting as it can be. I want to have a large impact on their lives, and I want for them to come to value the same things that I value and to be good people. I believe that necessarily means I want to change the way their brains work, but I want to do this in the most kind and least coercive way that I can. This seems like something Kohn desperately wants to support, and the last chapter of his book is all about how to do it.

I want to support my kids’ childhoods in such a way that they grow up to be people that I would be proud of and would like to know.

When I started the book, I was hoping to get a better handle on how to structure rewards to inform but not manipulate. I was hoping to understand when rewards degrade intrinsic motivation and when they don’t. Basically, I wanted to sharpen a tool in my toolbox. I had a small expectation that maybe the book would convince me to give up rewards for something new, if it argued well enough that rewards were counterproductive.

After a few chapters I realized that the book was arguing that rewards would work sometimes, but I should never use them. I realized that it was making serious moral claims that I didn’t always agree with, and doing it in ways that were sometimes manipulative in their own right.

The Moral of the Story

Punished by Rewards is a book about morality. It doesn’t use these words, but my take on the book is that Kohn sees behaviorism as akin to deontology, and Kohn advocates for virtue ethics.

Deontology just means “ethics is following the rules”. Different deontologies have different rulesets, but fundamentally they all just agree that you should submit to the ruleset to be a good person. If that’s what you think ethics is, then behaviorism makes a lot of sense. Reinforcement and conditioning can get people to follow rules, so you’re done.

Virtue ethics is more complex. It’s not about “following the rules” so much as it’s about “being a good person internally.” In deontology, it doesn’t really matter why you follow the rules. In virtue ethics, motivation and internal mind state are the most important things. Kohn spends a lot of time talking about how rewards might get people to do something, but they don’t make someone into the kind of person who would want to do the thing outside of the reward context.

The first part of Kohn’s book is dedicated to showing how rewards and punishments are manipulation, and are omnipresent in society. Kohn is both anti-coercion and anti-hierarchy. It’s such a huge thing for him that I took a break from the book at one point to look up whether he was an anarchist (while he did give interviews on anarchist podcasts, he claims to be a democratic socialist). Behaviorism, according to Kohn, is inimical to democracy and is all about controlling others.

Kohn’s morality is very strict. He will rail against the slightest whiff of manipulation, coercion, or hierarchy. I find this kind of infuriating, because his book is itself incredibly manipulative. As far as I can tell, Kohn never lies and never stretches the meaning of a study. What he does do is argue everything in the most manipulative way possible. He’ll make up examples about how pay-for-performance is wrong because someone suffering from a compensation plan at work might come home and abuse his child. He’ll come right up to arguing for something that’s not supported by the evidence, then gesture at it as though it must be true. When Kohn doesn’t have solid evidence against something, he’ll often just say there are “serious doubts” about it and switch topics.

I find what Kohn does to be much more manipulative than what he’s arguing against. Instead of changing payoffs for my actions, the book presents information in a way that seems intentionally geared to confuse me about what the payoffs of my actions might be. If I follow Punished by Rewards, then I would no longer be able to make my own decisions about what to do. Instead I’d be making Kohn’s decisions and I’d be often confused and surprised about the outcomes.

Kohn’s arguments generally assume the worst of anyone he’s arguing against. He also seems incapable of dealing with tradeoffs. Either something supports his virtue-based agenda, or it is bad and should be removed. The idea that something could be useful for a different but also good goal is totally absent from the book.

This is really frustrating because I am so sympathetic to the underlying arguments about virtue and agency. Kohn wants to help kids develop agency. He wants them to be good and decent people outside of following a set of rules. Kohn sees people as having the “capacity for responsible action, the natural love of learning, and the desire to do good work.” He wants to teach people in a way that fosters those virtues, instead of smothering them. That is all wonderful and I want to support it, but Kohn’s manipulative arguments and confusion between moral and pragmatic choices make that difficult for me.

Supporting Trust

Let’s talk about computer security. Actually no, let’s talk about trust. If you work in computer security for long enough, you end up thinking a lot about what it means to trust someone, and what it means to support people in performing tasks while preventing them from abusing infrastructure. Bruce Schneier is a security researcher who started writing cryptography books decades ago and has slowly migrated to writing books about society and trust.

Bruce Schneier’s book Liars and Outliers is about security and trust. It makes the argument that society needs enormous amounts of trust to function, and that trust is fostered through a variety of means. Schneier sorts the ways that society supports trust among its members into four different “pressures” that push people to act well. Even though Kohn’s book was written decades before Schneier’s, Punished by Rewards takes a strong stance about Schneier’s pressures.

At the base is moral pressure: people do the right thing because they know it’s right. People have an idea of what it means to “be a good person” which they adhere to. Kohn wants all interactions with all people to be fundamentally based around this.
Next is social pressure: people do the right thing because they want to be liked. If you jump off a bridge or donate to charity because all your friends are doing it, it’s social pressure that pushed you into it. Kohn thinks this is manipulative and wants teachers and parents to avoid it.
The third thing impacting how people behave is institutional pressure: the rules, rewards, and punishments that constrain behavior. If you think about cheating on your taxes but decide not to because the fine when you get caught would be too high, then institutional pressure has done its job. Most of Kohn’s book is written against this layer of trust maintenance.
Finally, if none of the above layers cause somebody to do the right thing, security is the set of tools we have to physically prevent someone from doing harm. Kohn is ok with this layer when it comes to preventing toddlers from running into the street, but it’s not clear how much further he’s willing to go in accepting this.

If I squint a little, I can map virtue ethics to Schneier’s moral pressure and deontology to Schneier’s institutional pressure. What I really like about Schneier’s book is that he acknowledges the use of and need for all of these things. Obviously we want everybody to do the right thing for moral reasons, but we can’t depend on it. We need the other layers in order to enable the kind of society that we have. If you interact regularly with people you don’t know closely, all layers of this stack are critical.

Punished by Rewards is about raising children and motivating adults in the workplace. It makes sense that it focuses mainly on Schneier’s moral pressure layer, since it’s about interacting with people you know well. Where Kohn’s book goes wrong is in rejecting the other three layers as immoral because they are controlling. Those layers allow a pluralistic society, and it turns out that schools and the workplace are two very pluralistic institutions. We want our kids to learn how to interact well with all of these layers.

Kohn is a black and white thinker. He identifies that kids need to be taught virtue. He sees that rewards and punishments don’t do that. He concludes that rewards and punishments are therefore bad. What he is not seeing is that these other methods of supporting trust are valuable on their own. They form a backstop to virtue, and help people interact with each other in safe and sane ways all the while everyone is at different places in their virtue-learning journey.

Rewards, Punishments, and Consequences

Let’s get one thing out of the way up front: it can be hard to define what is a reward. It seems like it should be easy, but the edge cases are thorny and depend on a lot of context and unobservable thought processes.

Imagine this: a parent and their child are in their living room. Next to them is a pile of LEGOs on the floor. The parent says, “if you clean up your LEGO mess, then we can go get ice cream.”

Depending on context, this could be reward, a punishment, or a consequence. Let’s see how:

Reward: The parent wants the kid to clean up, and is explicitly thinking about ways to convince the kid into doing it. The parent intends ice cream as a reward. The ice cream is only going to happen because the kid cleans up.
Punishment: The parent is upset that the kid made a mess. They had a plan to go get ice cream, but now the parent is threatening to withhold it. They want to cause a tiny bit of anguish in the kid in order to convince them to clean up.
Consequence: The parent and kid have been working on personal responsibility. The kid knows that if they make a mess it’s their responsibility to clean it up. The parent doesn’t want their child to suffer in any way, and also doesn’t want to bribe them in any way. The parent just wants ice cream, similar to the kid. The statement the parent makes is simply informing the kid about what order the events are going to happen in.

What makes the above more complicated is that the kid may have a different view of what’s going on than the parent. I think it might be pretty common for a parent acting in a “consequences” mode to be interpreted in a “punishment” mode by the kid.

I’ve seen a lot of people on social media commenting that the word “consequences” is just a smoke screen for punitive parents to use to get away with abuse. Kohn not only agrees with this, his book might be the source of the argument. While I expect that some parents and teachers do punish under the guise of consequences, I actually think that many times parents (and maybe teachers too) are just exasperated and not sure how to convey the responsibilities that they are trying to instill into their kids. Correctly conveying if something is a reward, a punishment, or a consequence is subtle and changes dynamically as the kid grows and learns.

Kohn doesn’t see this subtlety. Instead, Kohn defines punishment as “an intervention that forces someone to do something she’d rather not do (or prevents her from doing something she wants to do) as a way of trying to change her behavior.” Kohn uses this definition to argue that all consequences are punishments.

One of his examples of a consequence that’s actually a punishment is “Food is taken away from children who persist in not washing up before dinner.” This hits close to home for me, because one of my kids went through a phase where he didn’t want to wash his hands after using the bathroom. Our entire family was getting sick way more often than normal, and we didn’t know why. At some point, I noticed that my kid’s hands smelled like poop while he was grabbing noodles with his hands and slurping them up. We discovered that he had been turning on the sink and then turning it off, without actually washing his hands.

The immediate consequence of my kid not washing his hands was that we all got sick all the time. We reinforced the rule that he had to wash up after using the bathroom. We made it clear that it was a rule about health, and that we were getting sick from germs. He still showed up to dinner with poop hands regularly. My wife and I made a new “consequence”. He had to wash hands, and they had to not smell like poop, before we’d let him eat dinner. This even resulted in us washing his hands for him a few times.

Kohn doesn’t expand on his argument that preventing a kid from eating with poop hands is punishment. He does have an argument about a similar type of health and safety activity though:

If a child tips her chair back too far, she will fall over. That is a “natural consequence”—and the fact that it qualifies for that label offers no argument for letting it happen; caring adults go out of their way to prevent many such consequences from occurring. If, by contrast, a child who tips her chair back is forced to stand up for the rest of the period, that is a punishment.

For my family, the natural consequence of our kid’s behavior was that we were all getting sick all the time. That sucked, and an artificial consequence helped turn that around. In fact, the artificial consequence was the way that prevented real negative consequences from occurring. What’s Kohn’s advice for how to get a kid to change their behavior without consequences? He has no specific advice beyond talking with the kid (though more on that later).

I find this confusing. Kohn wants to raise kids who can be responsible and self-directed. Kids sometimes make poor choices because they don’t look forward as far as adults do to see certain natural consequences. Parents can help prevent natural consequences by instituting artificial consequences with a shorter time horizon. Helping kids understand natural consequences via artificial consequences is helping them to become more self-directed.

If a natural consequence is near enough, and small enough, then maybe it makes sense to just let the kid learn about it directly. But we don’t let kids run into the street because the natural consequences are too big. We shouldn’t be letting them get themselves and then their whole family sick just to avoid any hint of “control”.

In fact, that’s exactly what removing a girl’s chair does if she keeps tipping it back. The point of a consequence is that the parent or teacher decides some activity is too risky (not washing hands, tipping a chair). Then they do what’s necessary to prevent the risk (prevent sickness, prevent a tip from turning into a fall). If the girl doesn’t have a chair, she can’t tip it. It’s not about making the girl suffer, it’s about preventing the risky behavior. The teacher should also explain the rule and motivate it, so the girl can learn why it can be dangerous instead of having to rely on institutional pressure to save her skull. Kohn doesn’t seem to see this. His list of consequences that are actually punishments is a mix of things that I’d consider to be obviously one or the other.

Kohn apparently didn’t have kids when he wrote the original book, and an afterward written years later does express the “consequences” formulation like I showed above in a more generous light. Then he goes on to say that he wouldn’t change a word of his book after having raised his own kids.

One additional point should be said about consequences in the form of evaluations. If a kid is performing some action that they could improve at, getting honest and detailed feedback will help them improve. Parents and teachers can give them this feedback, but it’s easy for feedback (consequence) to turn into praise (reward).

Kohn hates praise. He thinks it’s manipulative, and is against praising kids. Personally I like to let people know when I think what they’ve done is cool. Kohn advises parents to carefully consider a whole checklist of things before telling a kid that they like something. I think it’s fine to just say “I really like what you did there!” without having to interrogate your own emotional state to see if you might be subconsciously manipulating your kid. I do think it is worth checking to see if you are only praising your kid in order to reinforce certain behaviors, or if you’re just honestly sharing your opinion. If you are honestly sharing your opinion, I think the rest of Kohn’s checklist can suck it.

Some things that are often couched as “feedback” may not be consequence. Grades are a great example of this. Grades (in today’s schools) are a target. You get a good grade as a reward for doing what the teacher wants, and a bad grade as a punishment for failing. While meant as high level feedback, I don’t think they serve that purpose well. Kohn seems to agree, though I’ll talk more about that later.

Rewards and Payoffs

Alfie Kohn defines rewards as being manipulative. Rewards are the goodies that people dangle in front of others to get them to do something. They exist on Schneier’s social and institutional pressure layers.

This can make it a bit confusing to talk about why people do things. As an example, people sometimes argue that charity is selfish because when people give to a good cause then they experience the reward of feeling good about themselves. In other words, living according to your idea of virtue makes you feel good, and that good feeling is a reward. I find the actual argument tedious, and I only bring it up to focus on the wording used here.

This argument uses the word reward differently from how Kohn would use it. In this case it just means benefit, and it’s not necessarily about anyone external trying to change your actions. When discussing economics or game theory, the word reward is much more general than how Kohn uses it (though Kohn might disagree and say economists are all manipulative).

I’m going to use the word “payoff” to collectively represent all of the benefits (or costs) of an action. This might be the moral pressure payoff of feeling good for doing the actions of a good person. It might be the social pressure payoff of somebody being friendly towards you after you help them. It might be the institutional pressure payoff of getting a piece of candy for doing homework. All of these positive outcomes of an action are payoffs, but not all of them are what Kohn would call a reward.

When Rewards and Punishments Fail (and When They Succeed)

Kohn has several chapters dedicated just to research on rewards in different situations. He will willingly admit that they work well for a variety of tasks.

Want your kid to improve at multiplication? Reward will probably help there. Token economies, where kids perform behaviors to earn tokens redeemable for goodies, work quite well as long as the tokens keep flowing. Kohn says, very explicitly, that they can work quite well to get kids to do stuff. He just doesn’t care about when they work, because according to him “rewards must be judged on whether they lead to lasting change.” Obviously, I disagree. Rewards may work very well at exerting institutional pressure without being good for moral pressure. And that’s fine.

Kohn has a lot to say about when rewards fail. I’ll summarize it:

Rewards work when they are present. If you take rewards away then the behavior will go back to normal.
Rewards work when people can’t cheat them. If people are self-reporting to get rewards, you won’t get better performance.
Rewards don’t make people more creative.
Rewards don’t improve the best someone can do, and may slightly degrade peak performance.

Kohn often cites studies that compare people who do things for rewards to those who do them without rewards. He then compares the quality of the work that each group does (or sometimes compares learning rate). I don’t think this is particularly useful. Obviously if someone will do something without a reward then there’s no need to reward them for it! Even Skinner would say this, as rewarding for things that are already happening is just a waste of resources. I guess rewards for high performance were a big thing in the 80s, and he felt a strong need to argue against it.

What matters to me personally isn’t the performance of two groups doing the same thing for different reasons. What matters to me is whether rewards cause people to do things they otherwise wouldn’t. The times that I use rewards with my kids, it’s generally when they have a huge emotional aversion to doing something. Yes, I do want to address that emotional aversion directly, but that honestly seems like it’s going to take months for a lot of issues.

One of my kids used to get really uncomfortable and upset about his own writing. While I could have told my kid he doesn’t have to do any homework for the months it takes to understand why he didn’t want to write, things seemed to work better when we bribed him to write. For a long time, we were sitting right next to him and giving him one chocolate chip for every word he wrote. His homework got done, he got writing practice, and his skill at writing actually improved. That did lower the barrier to writing without rewards.

I agree that the writing my kid does when we give him a chocolate chip for every word is not high quality. It’s often not legible, or he intentionally writes words in a very silly way that takes up an entire page. That also doesn’t matter to me at all. I just want him to get used to writing, both emotionally and physically. Some magical world where he did writing without reward is not an option I actually had available to me. I tried literally dozens of things to understand the emotional block that he had to writing. I asked him, I told stories, I suggested possibilities, I tried changing the ergonomics of his pencils. My kid didn’t want to address it or share his struggle. Over the course of several months, the literal best thing to get him to write was direct rewards.

What’s happening here is that the chocolate chip rewards are institutional pressure. There’s a rule (write one word, get one chip). The rule changes the payoffs for doing homework. My kid does homework even though the moral pressure for it isn’t there. This makes doing homework less stressful, because my family can focus on the moral virtue of writing or learning independently of an assignment that’s due in two days. We do actually have to address the moral domain here, as the chocolate chips have no impact on it.

Kohn warns about this when he says that rewards don’t require finding out why the student isn’t doing the work in the first place. I think one of the major societal problems that the book points at is that people confuse institutional pressure for moral pressure. A lot of the failures of rewards come down to people expecting moral reform from institutional change. Instead of seeing this as effort on institutional pressure displacing effort on moral pressure, Kohn sees it as a direct undermining of the moral pressure. I don’t think his book is convincing of that.

In hindsight, I think one of the main reasons that the chocolate for writing plan worked so well for us is that writing words helped make my kid more comfortable with writing. The thing that we were rewarding was directly beneficial for him to be doing. I think this also explains why rewarding multiplication practice helps make people better at multiplication. The reward causes the practice, and the practice just is learning.

Kohn suggests that rewarding quantity not quality seems to work, but I find this particular point confusing. The parable of the pottery class is a story about a teacher splitting their class into two groups: one graded on quantity and the other on quality. It turns out that the group graded on quantity also ends up producing higher quality, with the moral of the story being that one should practice a lot to get good. I realize this is just a single example, but I think the reason it resonates so much with people is that a lot of folks have direct experience of doing something a lot and thus getting better at it. Just like with my kid and writing. It makes me wonder if the studies Kohn cites don’t last long enough to show this effect, or if something else is going on.

From Kohn, I think there are two high-level reasons that rewards fail. The first one is that people are not rewarding the right thing. This risk should be obvious to any programmer who’s ever implemented a reinforcement learning algorithm. It is Goodharting. For those non-programmers and non-economists in the audience, Goodhart’s Law says that “when a measure becomes the target, it ceases to be a good measure.” In the case of education, that means that if we start rewarding people for e.g. doing homework, then the amount of homework done won’t correlate as highly with how much people learn.

Kohn has a lot of examples of how rewards diminish learning. Offering rewards for work completed leads kids to:

do easier work
be less creative
pay less attention to things unrelated to the rewarded work

None of that is at all surprising if we understand that kids are doing what they are rewarded for. If a kid is being rewarded for A, then that provides no additional incentive for them to do B. In fact, the opportunity cost for them to do B is higher, since A has more benefit than normal. Just because a teacher thinks B looks like A doesn’t mean kids will care. Kids are really smart! They can figure out what’s truly being rewarded.

While Goodhart’s Law dates back to 1975, it doesn’t appear explicitly in Punished by Rewards. This is unfortunate, because Kohn could have summarized at least a chapter of his book with just that one law.

The nice thing about Goodhart’s Law is that it points to the places where rewards can work. “What gets measured gets managed” is a corollary of Goodhart’s law. If a teacher can reward the specific behaviors they want, then it doesn’t matter that those behaviors become the target. This helps explain why rewarding my kid with one chocolate chip per word written worked. I wasn’t trying to reward quality or effort or word length. I literally just wanted him to write words.

The corollary to Goodhart’s law implies that we can incentivize the behaviors we want as long as we actually reward for that thing in particular. Should we actually do this, or are there other problems with rewards? This brings us to the second failure mode that Kohn talks about: external rewards undermine other reasons for doing things.

It’s common to know someone who decided they hated math because of school. Some even hate reading books because of school. Not many hate music due to school, but I wouldn’t be surprised if many did dread the idea of playing music themselves due to school. We simply have to address the fact that, while schools may improve people’s ability to do certain things, they also may cause a person to hate or fear doing those things. Is this because of the rewards and punishments used in school?

Closer to home, how worried should I be about bribing my kid to write using chocolate chips? After a few weeks, he seemed less averse to writing than when we started. Is this going to backfire in the long term and make him hate writing or homework overall?

Why do external rewards sap intrinsic motivation? Kohn suggests that rewarding people for something causes them to think that thing is less valuable. A student might think something like “if they have to pay me to do math, then math must be inherently worthless.” Rewards also encourage kids to focus on how well they are doing towards getting the reward, rather than on the activity being performed. This can distract them and keep them from noticing that the thing they’re doing actually can be fun. Both of these possibilities are concerning, but I don’t think they are insurmountable.

Many of the other arguments that extrinsic motivation degrades intrinsic motivation are fundamentally behaviorist. They boil down to the argument that rewards operant condition people into associating the activity with being controlled. What happened to Kohn’s celebration of human agency? Do we expect operant conditioning to work, or not? If so, then all we need to do is figure out the right way to apply rewards. If not, then why should I buy these arguments about intrinsic motivation being undermined?

Kohn has a whole chapter that’s just a laundry list of the limitations of rewards. I agree with some and disagree with others. I agree that rewards don’t usually address the root cause of a misbehavior. I don’t really agree that rewards are automatically and intrinsically a threat of punishment. I don’t agree that rewards are intrinsically destructive to relationships.

Overall, reading this book did convince me that rewards are more limited than I had thought. They really do only work on the level of institutional pressure or social pressure, and we do need to help people to develop intrinsic moral drives about engaging with their world. Rewards won’t do that, but that doesn’t mean they’re bad or useless. They can be a great institutional backstop to support other efforts at resolving problems on a deeper level.

Just a few weeks ago, my kid started doing his homework without chocolate chips. It’s no longer a huge slog to get him to write each word, he just sits down and does it. Family homework time has been transformed.

His mom asked him what changed, and he said that he “used to only do homework if it was so easy it could be done in one second, or if it was fun.” Now he does it even if it’s hard. This is a change at Schneier’s moral level. It wasn’t caused by the chocolate chips, but they didn’t prevent it either.

People Popping M&Ms into their Own Mouths

It’s a little funny to me that Kohn spends so much time railing against certain reward methods as intrinsically controlling and manipulative, because I have seen dozens of people trying desperately to use those methods on themselves. While I’ve never been to CFAR myself, enough of my friends have gone that I’ve learned a lot about their techniques from osmosis. I was also fairly active in the rationalist community for a number of years (right up until I had kids, ironically). The rationalists have a very strong interest in using the ideas of psychology to improve their own lives, and they are happy to use behaviorism to do it.

Kohn describes “popping M&Ms into your own mouth” as simply another way others try to control you. If a teacher gets you to give yourself a reward, she doesn’t have to watch you all the time. This exact behavior is literally something that I’ve heard rationalists recommend in order to fight procrastination. Other techniques suggested by the rationalists are things like setting up larger rewards after tasks, gamifying tasks, and setting up automated fines for failure. These all embrace behaviorism directly. I’ve tried them all, and generally think Kohn wouldn’t be surprised by my experience.

Beeminder’s fines for failure did help me back when I was in grad school to keep on track. Habitica’s gamified XP for task completion reminded me regularly to do flash cards, go grocery shopping, etc. Offering myself rewards like short vacations for finishing major projects did help me buckle down and grind. I didn’t enjoy any of it though, and I don’t do it any more. I don’t particularly think any of those behaviorist interventions changed my long term preferences or tendencies. They worked to get me to do stuff, but I didn’t then become more likely to do that stuff without reward.

That was useful when I really needed to study. Threshold effects on education are real. Putting in a bunch of effort to get my Master’s degree did open up lots of opportunities for me. I did learn a lot in the process. Putting in the time and doing the work was good for me, even though I had to threaten and reward myself into it at times.

Jonathan Haidt views the human mind as like an elephant with a rider. The elephant is the mind’s unconscious. It is large, powerful, and cares about immediate concerns. The conscious mind is a rider sitting on the elephant’s back that can see farther away and make longer term plans. The rider can’t push the elephant around. The best the rider can do is give small directions, and if the relationship is good between the two then the elephant might listen.

The behaviorist interventions attempted by rationalists are ways that the rider can influence the elephant. When the rider is right about their desire being good for the elephant too, this leads to a really good outcome for both. I wonder if Kohn would say that the rider is trying to control the elephant here, but since they’re both the same person I also wonder how useful that framing is.

I’ve presented behaviorism as favoring deontology (good is rule following) and Punished by Rewards as favoring virtue ethics (good is embodying principles). The rationalists generally favor utilitarianism: good is whatever gets valuable outcomes.

Utilitarianism doesn’t fit very nicely into Schneier’s 4 layer dip of trust. If someone is sitting outside of that context and just choosing what to do, they may end up choosing techniques from all the different layers.

School

Kohn says that kids’ interest in learning is the primary criterion against which schools should be judged. I, personally, think we should judge schools by how well our students learn. I do not think that interest is enough, I think we actually need good pedagogy and teachers that teach. Learning can be hard. It can be a slog. I know this because I have tried to learn some very hard things as an adult. I feel motivated to do this, but I also sometimes complain about it. I don’t think that’s bad.

In fact, Kohn claims repeatedly in his book that kids are naturally interested in learning, curious, and hardworking. If he believes this, and he believes that the primary measure of a school is whether kids are interested in learning, then I’m surprised he doesn’t go all the way and propose doing away with schools. Some folks do, and there’s a whole collection of unschooling aficionados who let children do whatever they want with the idea that they’ll eventually want to learn. (Kohn is skeptical of a similar trend known as free-range kids.)

I’m sympathetic to unschooling after seeing my own kids hoover up non-fiction books, science comics, number blocks, chemistry kits, and electronics gadgets. I also think that some parts of learning are effortful and annoying, and that people sometimes do need to buckle down in order to get really good at stuff. I suspect that some of the benefits of unschooling are drafting off of the infrastructure of the normal educational system, and that if everyone started unschooling then things might break down.

Kohn talks a lot about how to help individual students without rewards and punishments. He doesn’t talk much about the group interactions of a classroom, beyond noting that teachers sometimes use it as an “excuse” to reward and punish.

School should be an environment where most people can learn, and that does mean that individuals shouldn’t be disrupting class. Any discussion of how to organize class needs to take this into account. Just talking about reforming existing power structures is too easy. It lets the reformer off the hook for doing the hard part of saying how they would accomplish the good parts of the existing structure.

Kohn asks if “we want only to control short-term behaviors, or do we want to help children become responsible decision-makers?” He comes down squarely on the latter, but it seems obvious to me that both of them are actually very important in school settings. If every student had a dedicated 1:1 teacher, then maybe we could put all effort on long term responsibility development. That’s not our world, so we actually do need solutions that work in the short-term for large groups.

If a student is disrupting class and preventing others from learning, we need a way to deal with that. Rewards and punishments are an institutional pressure that helps here. There are of course moral ways of doing this too. Kohn lists some of them, such as involving kids in coming up with classroom expectations. I support those completely, and I think that rewards and punishments can serve as a backstop.

Kohn is against that tired old slogan “I want to see what you can do, not what your neighbor can do.” But that’s stupid to be against! It actually matters if my kids can do math or write. I don’t care about the group performance of their second grade class, I care if they are learning!

I don’t want to go too far here. Group work is great. It’s a good way for kids to share knowledge, to get social payoff for knowing things, to help each other, to grow. I think there’s a huge place in school for group work. I also think it’s possible for groups to end up lopsided, with one or two kids doing most of the work. Solo tests (“I want to see what you can do”) are there to act as a signal about who’s learning what.

Kohn might push back on me here and say that grades are punitive, but I don’t care about the grades. I care about whether my kids are learning, or if the classroom environment is set up such that my kid doesn’t get the opportunity because another kid is doing the whole project (or because the curriculum isn’t working, but that’s a bigger problem). There needs to be some signal there that is indicative of each individual child’s learning.

Kohn does note that grades are often a parent’s only view into what is happening at school. I care a lot about what my kids are learning, and I don’t quite trust schools to do a good job on their own. Grades aren’t a great solution to this, because if I don’t trust the school to do a good job teaching my kid then why would I trust them to do a good job evaluating? Grades aren’t great, but they are better than nothing and they can give warning signs. Kohn says that parents will need alternate sources of information on their kids’ progress when grades are done away with, and I’m glad he emphasizes it.

I’m a big fan of mastery learning, and one of the main features of that style of education is that kids don’t move on until they really actually know the material they’re learning at the moment. That means the teacher needs to know what the students can do. Kohn gives a big long list of reasons for grades, but not one of his reasons is so that the teacher can know how the student is doing. He buries that particular use like six paragraphs down and hedges it by saying that a competent teacher would just “have a good sense” for how the kids are doing.

How does the teacher get that sense?! If my kid’s teacher isn’t sure how he’s doing, should I just assume she isn’t competent? Like everyone else, I would like to reduce class sizes down one or two students per teacher so that the teacher can focus on each individual student. That doesn’t seem likely to happen, and I still want my kids to get the most out of school. To me that means that “I want to see what you can do” is critically important.

Other than that, I basically agree completely with Kohn about grades. They suck. Kohn goes on to say he’s against sorting kids, and I’m less sure about that. One reason I like mastery learning is that kids can go as fast or as slow as necessary for them to really learn the material. I think that will naturally lead kids to learn things at different times, and there’s no need for teachers to push that. I think it’s also important that teachers not push against it, and it’s not clear Kohn would agree with me on that.

Kohn has a set of specific recommendations for improving schools. These boil down to:

Let kids learn via discovery by being open to mistakes, eliciting curiosity, and giving reasons for assignments
Don’t make learning competitive. One student learning something faster/better shouldn’t take points away from someone else. Allowing kids to complete project-based assignments in different ways can reduce competition.
Do a lot of group work. Let kids share, discuss, and rethink.
Contextualize new information by tying it to topics kids are already interested in.
Make learning hard, but not too hard. Kids love a challenge.

This all makes sense, and I mostly agree with it. Here’s what I think is missing:

Deliberate practice on specific skills.
Memorizing critical information.

Kohn doesn’t like these (he’s explicitly against “rote” learning), so I’ll explain more about why I think they are so important.

First of all, memorization is just knowing things. If you memorize it, you know it. If you don’t know it, you haven’t memorized it. Memorization gets a bad rap because it’s effortful and sometimes boring, but memorizing the right facts allows people to use those facts to understand new information and build new knowledge.

I also think memorization gets more hate these days than it deserves. We know how to help people memorize something: spaced repetition. Using flash cards with exponential forgetting curves lets people minimize study time and maximize memorization. I personally do anki flashcards every day for about 5 minutes (often while drinking my coffee), and it has helped me to memorize a huge number of things that I find valuable. I have thousands of flashcards, but I only do a few each day due to the way spaced repetition only presents cards when they’re about to be forgotten.

We should be teaching kids how to memorize, and then we should be helping them to memorize useful knowledge.

This brings me to deliberate practice. Memorization is effortful. It’s not just chatting with friends about a topic, or reading a book. You have to actually try, test your memory, and try again. There’s a lot of failure, because you’ll forget something and then need to re-learn it. This effortful deliberate practice is very useful, and we should help kids to learn the perseverance to do it well.

We should do the same with other skills, like reading, writing, and math. These are things that just take a lot of practice to get good at. Doing that practice, whether via homework or schoolwork, is important. When people have these skills as adults, it genuinely makes their lives easier. It’s worth supporting kids in learning it for that reason, even if love of learning never makes it to the moral pressure level for them.

Today’s school problems

The years between 1993 and 2026 have seen a lot of changes to schools, many of them for the worse. I often see complaints about schools closing gifted programs, no longer teaching advanced math, or just requesting parents to not teach their kids. I also see complaints about grade inflation and poor pedagogy. People have started collecting lists of these failures. Some of what I hear complaints about seem like things Kohn would also dislike, but he might celebrate other changes.

I benefited a lot from accelerated learning programs when I was in school. I was doing high school algebra and geometry in middle school. I did Running Start in high school, where I spent junior and senior years at a community college taking classes for college credit. These things were good for me. I liked learning, but I absolutely hated high school. High school (with a few exceptions for specific teachers or classes) made me miserable. Escaping to community college was a huge benefit to me.

This experience makes me really worry about degradations in advanced programs. They were so helpful to me and to my friends, and I worry that districts who dismantle them are harming their students. I’m not really sure what Kohn would say about it though. He doesn’t really talk about having high standards for learning or for skill growth in the book, so he might not care about a lack of performance. He does care a lot about control, and reducing options available to kids is more controlling, so maybe he would support accelerated learning programs on those grounds.

It’s also hard to know what Kohn would think of grade inflation. He hates grades, and thinks that they should be abolished. Instead, he wants direct feedback that has been leached of any hint of reward, punishment, or praise. I think he would hate that we still have grades at all. That said, one of the things he suggests is replacing the A through F grades with only two: A and incomplete. I actually really like this idea in theory, since it lines up with mastery learning. I do ultimately want all kids to get really good at what we’re trying to teach them. The A/incomplete idea only works if teachers have high standards though. If kids are getting As before they actually know the material, then those kids are being betrayed by the educational system.

The last major change in schools that I want to touch on is pedagogy. Teachers are teaching the same subjects in different ways now, and that’s preventing kids from learning. The poster-child for this is reading: teachers stopped teaching phonics. Now that some states are moving back to phonics based teaching, it’s become obvious how bad the last couple decades have been for American literacy

I’m not an expert why 3-cuing replaced phonics for a while, but one story about it is that teachers were feeling controlled by the phonics curriculum because it prescribed specific learning procedures. When another curriculum offered more choice to teachers, they jumped on it as a better option even though it was dramatically worse for the students.

Kohn’s book doesn’t touch on high standards or performance over time, but he has a blog post that does. His post argues that people have been complaining for decades that kids these days are performing more poorly than they used. Kohn says that if people in the Eisenhower era were complaining about schools on the decline, then there was no golden era of great teaching. While I agree about there never being a golden era, I think the argument doesn’t actually show very much. Mississippi’s phonics turnaround happened in less than 10 years. It seems very feasible to me that a cycle of education degradation and rebuilding can happen in between major complaints (perhaps caused by them), so the idea that people have been complaining about education for decades is meaningless on its own.

Kohn’s book emphasizes freedom and choice. Enhancing choice is one of the main reasons that he rejects rewards. He even suggests that if we control teachers (such as by enforcing a specific teaching method) then the teachers will therefore try to control (and thus mistreat) students. He says explicitly that teachers should be given freedom to decide their own curricula. The literacy losses since his book was published show the limitations of his idea.

Fundamentally, teachers have a job. That job is to help kids learn and grow as much as possible. Requiring them to use the best teaching methods available may be institutional pressure instead of moral pressure, but doing so is worth it for the benefit to the students. Of course, we also need to worry about whether we are requiring a good curriculum or not. What did the people of NYC think when they required reading to be taught by 3-cuing, and thereby destroyed the reading ability of their children?

Work

The main goal of teaching is to help each student do their individual best. The main goal for employees is to help the team produce as well as they can. These are different goals and therefore we need to treat them differently. I disagreed with a lot of Kohn’s suggestions for students because he suggests only focusing on group achievement. My comments on his advice for workers are different.

A lot of Kohn’s complaints about rewards in the workplace can be explained through Goodhart’s law, so I won’t belabor it much here. Suffice it to say that rewards in the workplace can cause people to focus on the wrong things.

One more interesting point about rewards in the workplace is that they don’t seem to improve performance. This was true for kids as well, but my use of rewards for kids is mostly limited to getting them to do things they otherwise wouldn’t. Performance improvements didn’t enter into it.

It seems that in the 80s, pay-for-performance incentive plans were common. Kohn talks about situations where people get cash rewards for doing a better job. He cites a lot of studies showing that paying somebody to improve their performance doesn’t work very well.

There’s a saying in martial arts that motivated me to train harder: “You don’t rise to the occasion, you sink to the level of your training.” The idea here is that if you aren’t able to do something in the training hall, you won’t be able to do it in real life. High pressure situations make it harder to think and people will default to whatever they already know.

In the workplace, offering rewards for performance increases pressure. That means the peak performance a worker can achieve will be lower. That might be fine if the reward moves someone from not doing something to doing it, but it won’t benefit you if they’re already doing it and you just want them to improve how well they do.

Kohn’s advice on performance matches my own intuitions. If we want to improve performance, we need to focus more on training and less on direct reward for performance. My company does this by offering direct reimbursement to employees for training that they do outside of work (I buy a lot of books and online classes this way).

Kohn quotes several studies as saying that pay is not an important motivator. It generally ranks behind things like “interesting work” and “impact.” I don’t question these results, but I do question Kohn’s interpretation of them. Most people are not optimizing their lives around income, they are satisficing on income by finding a job that pays “enough” and sticking with it. I think that’s good, but it doesn’t mean income won’t change people’s decisions.

Several large-scale national surveys have found that people who were unhappy with their jobs pointed to reasons like the lack of variety or challenge, conflicts with coworkers or the boss, and too much pressure. Salary simply was not a major issue.

I think this means the opposite of what Kohn is implying here. Salary isn’t a factor for the people unhappy with their jobs because when salary is the reason for unhappiness then people change jobs. Being unhappy with your salary will get you to look around for a new job. Being unhappy with the level of challenge will get you to kvetch at the neighborhood barbecue. I know this because this person is me: I left my last company for two reasons, and one of them was money. My current job offers too little challenge and too little impact, and as much as I talk about leaving for something better, I am still there.

One of the studies in that survey […] found that people who were paid on a piece-rate basis did not even turn out any more work than did those paid only for their participation.

Whyte reported that piecework systems typically fail because workers “set a quota on what constitutes a fair day’s work and refuse to go beyond this amount even when it is well within their ability to do so.” This is not, he continued, a function of a poorly designed or administered incentive system but of the underlying theory of motivation.

This seems fine though. Remember that we don’t care about maximizing individual performance in the workplace, we care about maximizing performance of the team. If each individual decides how many pieces to make in a day, and we pay them fairly for those specific pieces, that’s fine. Increasing the amount we pay for a piece may not cause people to make more pieces. It’s still worth it if it causes more people to enter the market to produce those pieces, increasing the size of the “team.”

Kohn argues that work is not a means to an end, it’s something people do because it fulfills them and helps give life meaning. I totally agree with this for the workers. Money and payment aren’t the reason that we work, and plans to improve performance only through changing salaries are not going to be as effective as other plans. People want to work, money just changes what they work on.

Capitalism

At first I thought Kohn’s book was just arguing against bonuses for performance improvements, but Punished by Rewards leans very hard towards an equal pay for all workers model. As Kohn puts it in the book:

After I offered a critique of incentive plans at a management conference not long ago, one businessman exclaimed, “Well, isn’t that communism?”

This is a bit of a throw away line in the book. The next paragraph immediately pivots to the history of management. Kohn eventually returns to this line near the end of the book, but again just gestures at “calling into question our most fundamental institutions” without actually making an argument. I want to linger on this comment though. Kohn doesn’t come out and say he’s a communist, but he also doesn’t say he’s not (in other writings he says he’s a democratic socialist). My feeling reading this particular anecdote is that Kohn is looking at us, the audience, as though expecting us to implicitly understand that communism is better.

I don’t particularly think that communism is good, and this line helped crystalize my complaints about the “work” section of the book. I can accept that pay for performance doesn’t work very well, and that pay for creativity is doomed. I don’t think that means that we should pay the same for everything.

In particular, I think Kohn is making a common mistake for academics, of thinking that work is for the benefit of the worker. It makes sense that academics would have this bias, because school is actually for the benefit of students. The main purpose of work is to provide stuff to people (to society, to your community) that they need.

It’s easy to lose this in the modern era, because we all live in the middle of such luxury. Very few people in modern America need to worry about whether they’ll be able to afford to heat their house in the winter. Very few people need to worry about whether they’ll literally starve. Some still do, and that’s a tragedy, but wholesale famine is a thing of the past. It’s a thing of the much more recent past for communism.

Famines in the 20th century were primarily in communist countries. The Holodomor, the Great Chinese Famine, and the North Korean Famine – these were events where a country’s organizational system was unable to provide enough for people. Hundred of millions couldn’t get enough food. Tens of millions starved literally to death (though “only” 500,000 for the North Korean Famine in the 90s).

That’s why I get nervous when people start implying that communism is so obvious that they don’t even have to argue for it. Look through famines in the 20th century and they are mostly communist countries or war torn countries. The productive output of an economy literally defines if the people in that economy get to live. Famine isn’t the only reason I don’t want to live under communism, but it’s a big one. I think capitalism has a lot of problems, and I also think it’s resulted in a great decrease in human suffering over the past few centuries.

I’m not trying to say Kohn wants famines. I’m not even trying to say Kohn supports communism (he only gestures at it, remember). What I’m trying to say is that the cost of ignoring effective institutional methods of organizing people is enormous. Those enormous costs are not visible in the individual motivation of people. They’re visible in the large scale output of groups.

If I’ll agree with Kohn that monetary rewards for performance don’t improve an individual’s performance, why then do I think that capitalism works? When capitalism works, it’s because it encourages people to move around and do new things, not because it encourages people to work harder.

Consider an economy in a steady state. Workers are all paid enough to be happy where they’re at. They produce enough that needs are mostly satisfied. Now imagine some external shock: a flood that destroys several farms, for example. Suddenly the same number of farmers in the economy is producing less. Raising the wages of the farmers won’t help them to produce more, but it will attract other people and get them to start farming. Those who were making an amount in between the old and new farmer’s wages are likely to switch, leading to more food being made.

The money that workers are paid is a signal about how many more workers are needed to do that job. When more workers are needed, the wage will rise. When fewer workers are needed, the wage will fall (blah blah inflation adjusted). It’s not really about how much value the worker produces (though if they produce less than they’re paid then they’re the customer and not the worker).

I know wages work as a price signal because I used that signal to change careers. I studied electrical engineering in school, and I worked for about ten years doing circuit design, schematic capture, PCB layout, etc. It was fun and I loved it, but at some point I realized that I needed more money. Getting married and starting a family made life suddenly feel a lot more real and precious.

Over the course of about four years, I got a sequence of jobs that moved me from electrical engineering to firmware development to software development. I took a bunch of classes in the evenings to bump my skills and did a bunch of hobby projects to test them. I now make significantly more than what I was making before this career shift.

I agree with Kohn that if someone had offered me performance bonuses to do software work while I was working as an EE, I wouldn’t have been able to do it. It literally took years for me to be able to do the work that I knew I needed to in order to get paid how I wanted. Kohn is right that short term performance won’t improve if you offer people more money to do a better job at what they’re already doing.

Where Kohn is wrong is thinking that pay for performance is all there is to our work-pricing system. We don’t pay for performance, we pay to incentivize people to move around.

Motivating Employees

Let’s move on from my extended capitalism digression. Regardless of whether you agree with me about capitalism being a good way to figure out where to allocate workers, I think we can all agree that the inside of a firm is not capitalist. Nobody pays their coworkers to help them with a problem. Nobody uses supply and demand to figure out what parts of their project to prioritize on a day-to-day basis.

If capitalism lets us incentivize people to shift companies, why not pay everyone at a company (or maybe in a given role at a company) the same amount? Then if the company needs more of those people they pay everyone more. This brings us back to Kohn’s main question: how do we get the best out of the people that we hire? How do we manage them?

Kohn complains that pay-for-performance schemes are just a way to avoid doing the hard work of management. Again, I don’t know what life was like in the 80s, but if all managers were doing was trying to figure out how much to pay people that seems terrible. Maybe this was a huge revelation when the book was published. I’m not sure.

What I am sure of is that all of the big companies I have worked for are basically doing the things that Kohn wants managers to do. I am lucky enough to work as a software developer, a job for which there’s a cultural understanding that people need the freedom to excel. Kohn wants managers to “set up conditions that will maximize the probability of [workers] developing an interest in what they are doing and remove the conditions that function as constraints.” That’s what my managers at big companies have focused on doing.

In order to set up workers for success, Kohn wants companies to look for problems and blockers and help people solve them. He wants managers to listen to worker concerns and try to see things from their point of view. He wants managers to provide informational feedback. He wants them to facilitate more teamwork, knowledge sharing, and camaraderie. He wants them to help assign work to workers that is interesting and novel and helps the worker learn new skills. I don’t think managers are always good at this, but it does seem like exactly the thing that they’re trying to do in my experience.

When it comes to performance reviews, Kohn wants to decouple them as much as possible from compensation. He complains that performance reviews only serve to show how a manager thinks a worker is doing, but at the big companies I’ve worked all perf reviews have been 360. I’ve gotten way more official feedback from peers than from managers.

Kohn also thinks promotion cases assume “someone’s achievement in his current job is a reliable predictor of how successful he will be in another, very different, position.” That’s not how promo cases work in big tech. I’ve been told over and over that you get promoted into a role when you’re already doing the job. As a senior software developer, I had to already be handling most of the responsibilities of a staff software developer before I would be assured of promotion. My manager and my more experienced peers worked really hard to give me the opportunities to show that I was doing that, and now that I’m more senior I strategize every week about how to give junior devs the opportunity to grow their responsibilities.

When it comes to accountability for poor performers, the traditional way of handling it is the Performance Improvement Plan (PIP). The stereotype is that the PIP is just an excuse and once someone has been PIPed they’re going to be fired. That hasn’t been my experience seeing it from the outside. I’ve seen people quit rather than going on a PIP. I’ve seen people quit halfway through rather than keep going with it. I’ve seen people succeed at PIPs and go back to normal performance. I think I’ve only seen it result in somebody being fired once.

The way a PIP works is that there is a concrete set of things that someone has to do to reach “adequate” performance. The list is very explicit and objective, with as little room as possible for managers to fudge the outcome one way or another. At the end of the PIP, the worker can point to the list and say “I did this, I did that, I did the other thing.” Sometimes there’s a quality component, but even there I’ve seen it written to be as objective as it can be. The PIP works as an institutional pressure, providing explicit rules (do this) and consequences (or else that).

Yes, I think the end result of a PIP is a consequence and not a punishment. When I’ve seen people on PIPs, their managers have often been pretty stressed out by it. The manager doesn’t want their employee to suffer, they don’t want the employee to be fired. My own manager now has gone to great lengths to prevent people from being put on PIPs (like making skill-up plans for employees who are struggling so they know how to avoid the PIP). Teams exist to accomplish certain work-related goals, and if people can’t support those goals even with major institutional backing then the consequence is that they aren’t allowed to contribute to the team anymore.

And what about money? Kohn describes his own method of negotiating payment for talks as: negotiate for maximum fair compensation, then decouple performance from money. That’s not quite how a job at a big tech co is, but it’s close. There’s not much talk of salary or compensation when people are actually working. People talk a lot about how to negotiate a salary and stock going into the job. Every year around perf review time and vesting time people will talk about raises and ratings. People do get bigger raises when they have a better perf rating (as rated by both peers and management).

That brings me to the main way that big tech companies diverge from Kohn’s advice. Every one I’ve worked at uses a rating scale for all employees every year. Often this is a 5 point scale, where 3 is “does all the responsibilities of their job” and is how most people are rated. Those people who are doing a lot of responsibilities at a higher level than their job might be rated 5, and those who are struggling to accomplish anything might be rated 1. These grades are not informative – they often come long after the informed feedback of managers and peers has been delivered.

Workers (myself included) definitely choose the work they do with an eye towards what rating they want. Managers talk to employees about what new scope they’ll have to take on to reach a 4 or a 5. Kohn would hate this, and would say that it shouldn’t work well to motivate employees. To me it seems like the company is trying to figure out how much they want to pay the person to stay. If capitalism is about constantly revaluing tasks so people move around, then companies need to figure out whether they want to compete to keep employees when compared to wages offered elsewhere. I’m not certain about the motivation aspect to it.

Talent

There’s another option that people pursue to build effective companies. Tyler Cowen calls it finding talent. In this paradigm, you aren’t looking for ways to motivate or manage employees. You are looking for employees that have enough talent and drive that they don’t need you. Find the right people, give them what they need, and let them cook.

This might work to start new companies. It might be what you want to do if you only care about hits. Fund 100 people who might be talents. If 95 of them crash and burn but 5 of them come out with medical miracles, that’s a win for the whole world.

This is not a very useful paradigm for deciding what to do in a school. It’s not very useful if you already work in a company with employees and need to interact with the existing systems. It’s not even clear how much this paradigm can be scaled. At some point your biotech startup needs a lot of techs to run your experiments, right? Or do we just wait until those get replaced by robots? Is the plan to eventually transform the entire working world into “talents” and robots, and everyone else just goes home to play with their kids?

I mostly like the idea of talent as a way to differentiate more between work and school. School is where you develop talent, work is where you deploy it. We should be helping as many kids as possible develop their talents as far as they can. We should help as many workers as possible deploy their talents as much as we can. Inasmuch as Tyler can identify talent and allow it grow, that’s great.

Talent acquisition doesn’t dissolve the need for management or training, and I think this mindset doesn’t speak much to the ideas of punishment and reward in general.

Choice

There’s a just-so story about checklists. Pilots adopted checklists early because if they made a mistake, they would die. Surgeons rejected checklists for a long time because if they made a mistake, they wouldn’t be the one who dies.

I don’t know how true that is as a historical description, but it gets at my feeling about choice when it comes to work. There are better and worse ways to accomplish tasks, and if someone chooses to do something a worse way then that person is responsible for their failures. This is true even if the effective method is “less fun” or offers “less creativity” than whatever alternative the person selects.

I touched on this a bit earlier when I talked about teachers who stopped teaching reading using phonics. Kohn wants to maximize autonomy and choice, and I fear what that will do to the effectiveness of our society.

This is not to say that I think choice and autonomy aren’t valuable. I also want to increase the autonomy that people feel. I especially want to increase autonomy for kids! Kids in our society are often prevented from doing a huge number of things, which I think is very bad for them overall. I just also think we need to be sure that giving people choice doesn’t let individuals break things for everyone.

In the software engineering world, people love creativity and autonomy. One of the common motivators for people to get into programming as kids is that it lets them have complete control over something, often for the first time. This complete control can be dangerous for workers in a big company, as a mistake on prod could bring down a whole site and cause thousands to millions of dollars of lost revenue.

How do companies like Amazon deal with software developers’ love of autonomy and the high risks of breakage? They use Schneier’s 4th layer of trust enforcement: security. Between git merge limitations, PRCI automated testing, and prod deployment checks, it is actually very difficult to do something that breaks a web site at most large companies. The systems in place prevent breaking changes from landing, providing a safe sandbox for people to mess around and exert choice. It’s only when people’s choices would impact the overall product that they start running into guardrails.

Every company I’ve worked at has had rules and institutional pressures to support code quality, but those are generally less important to continued high quality than the security guardrails that prevent failures from leaking through.

Teachers, police, and surgeons don’t have those guardrails. The nature of their job means that they are always working “on prod,” so they always need to be performing well. This is where tools like checklists come into play (I wish Kohn would review The Checklist Manifesto so I could know his thoughts on it). Teaching and similar jobs are where “controlling” features like institutional reward and social praise can help. It’s where people’s pride in a job well done can help. We need to recruit as many of Schneier’s pressures as we can to help people in these jobs to choose to use the best techniques available to them.

With all of that said, I do agree with a more general exhortation to maximize autonomy and choice. I do want people with direct experience at their job to participate in decision making about how to do it. I do want people to feel in control of their own work and for their decisions to make a difference. I want to maximize choice and autonomy as much as possible without sacrificing effectiveness.

Status

Kohn talks about how money isn’t necessarily the biggest motivator at jobs. Some bigger motivators are things like impact and group culture. In other words: social pressure.

One reason that impacts on others is so important when motivating people is status. It’s weird to read a whole book about how to motivate people and not see a discussion on status. Money and physical rewards are obvious, but I feel like social status and the positive regard of your peers is a huge part of why people do anything. Is this just obvious to me now because everyone wants to be an influencer?

Kohn talks a little bit about status when he says that rewards create an atmosphere where we want to impress the person who hands them out. That’s not automatically bad to me. Wanting to impress others may not be as noble as wanting to discover the laws of the universe, but it’s also a very human desire. Turning that human desire to impress people towards learning seems good actually.

For a book that explicitly values group work and collaboration so much, it’s weird that it doesn’t interrogate the social aspects of motivation very much. Schneier’s book Liars and Outliers is the thing to read for more info on status as a trust mechanism for supporting society.

We Live in a Society

In the section on school, Kohn doesn’t talk about the impacts one kid’s behavior has on a whole class. In the section on work, he ignores the systemic impacts of wages as a pricing signal. I think the book has an overall blind spot in thinking about how to motivate or incentivize people when their behavior has immediate impacts on others. Kohn only thinks about the impacts of rewards and punishments on the recipient, not on the people the recipient impacts.

When it comes to discussion of how to raise kids, it makes sense to focus mostly on them. Most books about raising kids or being a parent focus on kids: what they need and how to raise them well. I think maybe that emphasis has grown too strong in recent years though.

I don’t know what life was like for parents back in the 80s, and I barely remember what it was like for me as a kid in the early 90s. Kohn reports that spanking was used by the “overwhelming majority” of Americans, but I’ve met exactly one person who admitted to using it these days. Maybe it’s good that Kohn’s only emphasis in 1993 was on how kids experience actions designed to shape their behavior. I do want to drive spanking down to 0. I also think that a focus on the child can go too far if it results in an abnegation of the parent.

Personally, I really benefited from reading Hunt, Gather, Parent. I forget if the argument was explicit, but I came away from that book with the understanding that my preferences are actually important. Hunt, Gather, Parent blew my mind with the argument that if a toy really bothered me, it was OK for me to get rid of it. I could throw out the toy because of my own preferences, not out of a cost/benefit calculation for what was good for my kid. While I can’t speak for other parents, it was very useful for me to be reminded that my kids are in a relationship with me and I’m not a robot whose sole purpose is to train my kids.

Kohn has a throwaway line in the book about how we shouldn’t potty train kids until they’re ready for it. To me this is a perfect example of Kohn neglecting the needs of parents. Potty training is such a cultural thing, with age varying hugely around the world. Waiting for kids to be ready for it is not nearly as relevant as doing it in a way that’s appropriate for the kid’s age. One of my Malaysian friends was doing some preliminary potty training with his 6mo old, but that looked very different than the kind of potty training my wife and I did with our kids at 2 years.

I’m not against child-led milestones. What I am against is the idea that anything other than child-led is a problem. The needs of the rest of the family do impact when kids should be learning new skills or taking on new roles. It could be harder for the kid, sure, but we’re not here just to make things easy on them. We’re here to raise them well. We have responsibilities to them, and they have responsibilities to their own family. Yes, they have responsibilities even at two years old. I fervently believe that the responsibilities a kid undertakes should rise smoothly as they age, and that means they need to start very small when they’re very young.

All else equal, I want to treat my kids in ways that raise them up to be virtuous, hard-working, thoughtful people. Sometimes all else is not equal, and I need to take actions that piss them off in order to keep them or others safe. Other times I need to take actions that seem like punishments to them, but are really just me protecting my or my wife’s sanity after a long day.

Ultimately, I want to raise kids that I want to be friends with as adults. One of my greatest hopes in life is that when I’m 60, my kids and I share a deep bond of affection and respect that goes beyond just being parent/child. I want us to hang out, to share struggles and to share triumphs. I want to go grab a drink at the pub with them on the weekend and hear about what they’re working on.

My kids are only 7 now, and we already have fun on hikes and doing little hobby kits on the weekends and playing D&D. Those are things I might do with my adult friends. The tenor of the interaction is different, but I can see a path towards the future I want. This is part of why I react so negatively to Kohn. He advises parents to go through an enormous amount of introspection about how their behavior impacts the child before they even compliment the kid.

I know people who are my parents’ age, with kids my age, who don’t have any friendship with their kids. Their kids don’t go to them for advice or comfort or help. Their kids don’t go to them with triumphs. It’s a holidays and vacations type of relationship only. My observations of the boomers is that it’s rare for them to have an actual good friendship with their kids. Some of them seem confused about why it didn’t happen, others seem like they don’t mind or don’t notice. That’s not what I want for my family.

I do acknowledge that there’s a lot of subtlety here. While my kids are kids, I am their parent first and foremost. There will be times (a lot of times) where I just can’t be their friend. What I try to do is make it really clear when I’m acting in one mode or the other. I obviously also have to make sure that I’m the kind of person they’d want to be friends with when they get to be adults. This is a place where I’m still learning and growing, and it’s a place where Kohn’s advice seems likely to be counterproductive. All he says is to “develop a caring alliance of openness and trust,” but he doesn’t say how to do that.

Virtue

In spite (or perhaps because) of hanging out with so many rationalists, I’m not a utilitarian. Perhaps utilitarianism is fundamentally correct in some sense. Even if it is, using utilitarianism to decide what to do is like using quantum physics to design a bridge. It’s so inefficient as to be impossible.

In a very deep way, I’m a virtue ethicist. I might be more virtue-ethics aligned than anyone else I’ve actually met. I think it’s crucial to develop a strong internal character of responsibility, self-determination, charity, courage, and creativity. I think a lack of those virtues is having a noticeable negative impact on our society today.

Kohn seems to share my view on the importance of character (though maybe not in the details). This is why I kept reading the book, even though much of Kohn’s pedagogy and even some of his morality are objectionable to me. I am looking for better ways to instill virtue in my kids.

We recently enrolled both of our kids in a martial arts class. Martial arts classes for kids are not really a self-defense thing; regardless of your threat model, a kid should be running or deescalating from every encounter. Instead, kids’ martial arts are about exercise and about self-discipline.

I did Tae Kwon Do from when I was about 10 until I was 16, when I got my first degree black belt. The physical conditioning was hugely beneficial, but not as beneficial as the discipline and the virtues that it very explicitly focused on. My dojang had a list of virtues painted on the wall, and we had to know them for our belt tests. I still sometimes chant “ha-myeon doen-da” (goals we set are goals we get) to myself when I start something hard. I’m hoping my kids pick up some similar virtues from their own martial arts experience.

It’s strange to watch my kids in their martials arts class while reading Punished by Rewards, because martial arts is the one place every week that has the highest amount of what Kohn would call punishment. Forget your belt? Do 20 pushups before the class starts. Fool around in class too much? Do pushups instead of getting to kick the targets. Make a hand juggling motion? Do 67 pushups. None of this seems bad intentioned to me, and the class does have a lot more discipline from the kids than I’m used to from kids that age. Punishment’s presence is stark when compared to the rest of my kids’ non-punishment oriented life, even though punishment is generally rare even in martial arts class.

The main focus of the class is focus. The kids practice doing hard things with intention. They practice doing a lot of kicks. They practice holding their breath. They practice balance. They hold stances until their muscles strain. At the end of it, the instructor emphasizes that they did that. They did a hard thing, that seemed hard, but they focused and exerted themselves and succeeded. I think this is great.

I don’t know if it’s the exercise (schools don’t have nearly enough of this) or the focus on focus, but starting martial arts has also had a great impact on my kid doing homework. Rewarding him with a chocolate chip for each word written did help a lot over the past year, and after he started martial arts it seems like the homework aversion might be fully resolved.

I wonder what Kohn would think of this. He emphasizes over and over that we need to raise kids to have internal strength of character: agency, self-determination, some kind of internal fire of motivation. Now my kids seem to be getting a lot more of that, and they’re getting it from an organization that’s a lot more open to minor punishments than the rest of their life is.

Instilling Virtue

Kohn thinks the best way to instill virtue into people is using the 3 Cs of Content, Collaboration, and Choice.

Content is what you’re asking folks to do. What kind of virtue are you asking them to embody, and how reasonable is it for them specifically.
Collaboration is how you involve folks in the decision making process for what’s allowed and how it will be done. Involve them as much as you can given their age and skill levels. When there are problems, help them to understand why and to reason through what should be done on their own. This is the true “teachable moment”.
Choice is how you let people do what’s right on their own. In order for people to learn moral virtue, they need to do it independently of social or institutional pressure. They need to experiment with it, fail at it, and try again.

Kohn gives some examples about how to instill the virtue of care within kids using his 3 Cs framework. This was super valuable, and for me this chapter was definitely worth reading the rest of the book for. It was still limited though. Alfie Kohn focuses on only the one virtue of caring. He mentions courage, but only in the context of being able to care better. I find this a bit flat.

In Ben Franklin’s autobiography, he describes 13 virtues that he wants to embody. These include justice, industry, and frugality. Franklin doesn’t include care as a virtue explicitly, though many of his virtues speak of doing good to others. He doesn’t talk about courage at all. Nonetheless I find his list to be more humane than just a focus on care itself. It’s more accepting of the complexity of life and value.

I also really like Franklin’s virtue of frugality. He phrases it as “waste nothing” but to me it represents an idea of effectiveness. We only have so much time. We only have so many resources. We only have so many chances to help our kids and our workers and ourselves. Let’s make the most of them. Kohn wants to judge schools by whether kids like learning, but the virtue of frugality leads me to judge them by whether kids actually learn.

Let’s go back to Schneier’s 4 ways to get people to do the right thing: moral pressure, social pressure, institutional pressure, and security. Kohn argues that any action made to improve social, institutional, or security will not impact the moral realm. He says we can’t teach morality by changing social pressure. I’m mostly aligned with this (though unlike Kohn I don’t think that means the other layers are useless or bad).

Kohn also argues that institutional or social pressure can undermine morality. I remain pretty skeptical of this. I think it’s more likely that people perform social or institutional pressures and then think that acts as moral education. If a parent is using social and institutional pressure (rewards) and that displaces moral education, that’s the problem that can be solved by just doing the moral education that was displaced. There’s not an intrinsic tradeoff here except in time, and I generally want to spend more time with my kids.

Mysterious Wizards and the Demands of Modern Life

Robert Kegan’s book “In Over Our Heads” is about how people deal with the world by changing their own concept of themselves. It’s complex and difficult to read, but does a pretty good job of explaining how people’s mode of thinking and conceptualizing their own self impacts their ability to succeed in our modern world.

Kegan arranges modes of thinking into a hierarchy that people move through as they age. These modes describe how people primarily represent their own self-concept. Each level defines the main way that the person is driven: how they figure out what’s valuable and what to do. I argue that these levels are what it feels like from the inside to be most constrained by Schneier’s different pressures.

Level 1: impulse or reflex (infants and toddler years). When you’re primarily being limited in action by security precautions (fences, git integrations, parents who don’t let you put marbles in your nose) this can feel confusing and arbitrary.
Level 2: needs and desires (kids, teenagers). When you’re primarily choosing actions based on the institutional rules and what the explicit rewards and punishments are, this can feel isolating and zero-sum.
Level 3: social relationships and roles (most adults). When social pressure guides your actions, problems can feel like they reflect on your fundamental worth as a person. Someone critiquing your performance can feel equivalent to an attack on your relationship with them.
Level 4: a self-authored set of values (some adults). If you are choosing actions out of an understanding of your own moral values and agency, it will feel like you are able to chart a course through your own life while managing the systems that you come into contact with.
Level 5: self-actualized and unbound (mysterious wizards). When you are a mysterious wizard, life feels like an infinite game where you are expressing that which is good through your every action. Or something.

I’ve heard it said that someone at level 2 shows up to work on time because they don’t want to get fired. Someone at level 3 shows up to work on time because they know their coworkers are depending on them. Someone at level 4 shows up to work on time because that’s their job.

Somewhat confusingly for us, Kegan names level 4 the “institutional” mind because that’s the level where people are able to generate and manage institutions, whereas Schneier calls his level 2 institutional because it is external institutions which mostly constrain someone at that level. Kegan’s book spends almost all of its time talking about how people make the transition from Level 3 (socially driven) to Level 4 (values driven). This is largely because modern life presents a lot of challenges that are easier to manage when you are able to work at a higher level.

In Punished by Rewards, Kohn constantly pushes educators, parents, and managers to support people in acting out of their own values. I think Kegan would approve here. Kegan, like Schneier, suggests that people need support at all the levels of the hierarchy. We should support people and help them to exist in society, even if they struggle at the moral level. Kohn is right that we should be trying to minimize how we limit people via security, institutional pressure, or social pressure. Kegan and Schneier are right that these other levels are still necessary and good. We just need to be sure they don’t suffocate the people who are still primarily living within them.

I first discovered Kegan’s work because rationalists really love it. It’s a highly structured system in which being at a higher level means you’re more capable. The levels are murky enough that it can be tricky to tell what level you mostly interact with the world through. It also has a super powerful level 5 where things just get easier. If you squint at your life, maybe you can pretend that you’re a mysterious wizard. Or if not, you can at least aim at one day being a mysterious wizard.

I’m going to be honest, I read In Over Our Heads a long time ago and the Level 5 stuff was pretty confusing to me. I don’t think I got a good sense for how Kegan’s idea of institutional thinking (his level 4) differed from his idea of inter-institutional thinking (level 5).

Kegan and Schneier have these complementary hierarchies that fit together very nicely. Kohn has no hierarchies. Given how much he hates hierarchies in human interaction, it’s possible he would hate the idea of these hierarchies as well. I’m putting his ideas into these other frameworks because I think they fit well and provide a good window into the limitations of Kohn’s thinking.

What Kohn does really well though is describe something that I now think fits the difference between Kegan’s Level 4 and Level 5. Kohn describes a kid learning caring as them coming to internalize it as a value, but he suggests that there are different types of internalization.

When a person first learns a value, they “swallow the rule whole.” This kind of values driven action is what Kohn calls introjection. One who is driven by introjected values can chart a course through their own life, but the values they’re expressing aren’t fully their own.

The level beyond introjected values is integrated values. When someone has integrated their values, they have made the rule their own. They are able to make decisions about which values and rules to embrace. They can choose which virtue to embody at any given time, based on what serves the good in their world for that specific situation.

Who makes our society?

I love America. This country has it’s problems, but it is still so wonderful. I think our constitution, our Bill of Rights, and our history have been great forces for good throughout the world.

The constitution is just a piece of paper though. Each generation of Americans has to reify it to preserve what it really stands for. We can’t just accept the constitution into our hearts and execute it. Instead, we need to grapple with it, to understand its strengths and its limitations. When necessary, we need to amend it. If we don’t, our country withers from within.

We can look back on Ben Franklin, Clara Barton, Martin Luther King Jr., and all the other heroes of our country and say that they did good. They helped to create institutions that endure to this day. Those institutions shape the rules and roles that we live our lives within. But those people also expressed the virtues that made our country great. We can’t just take the rules they produced and live by them. We have to embody the virtues that they embodied. Only then can we truly re-instantiate the institutions that they created.

I worry that a lack of virtue, of moral education, has harmed our politics greatly. We have politicians today who imprison innocent people, tell bald faced lies to reporters, shut down criminal investigations they don’t like, threaten war in order to extort good trading terms with other nations. We have staffers that lie to cover up the cognitive impairments of elderly officials. Some of my friends think these are costs worth paying for certain political ends that the administration is also pursuing. I think that is farcical. A leader without virtue will never deliver on promises unless it benefits them personally, and that puts all of their followers at the mercy of their individual whims. That is not American.

Sometimes appeals to the American virtues lead people to point out all of the ways America has fallen short of virtue. We say “grabbing people off the streets for not having their papers is un-American” and they respond “just look at these specific civil rights violations throughout our history”. They are right, but the discussion is conflating what is with what should be. Societies, like people, will never fully embody their virtues. We nonetheless need to always push towards those virtues.

Whether we know it or not, we are always collectively reinventing our society. We of the modern era didn’t miss America’s founding, we are living it right now. We can either reinvent it to be a better version of what it already is, or we can let it degrade. I’ve focused on virtue here because I think it is most lacking in our politics now, but I actually think we are collectively reinventing our society through moral, social, institutional, and security pressures. We need to reinvent all of those, and improve them all, every generation.

While Kohn doesn’t write about patriotism and I assume he’d be put off by my own, I think he would agree with the fundamental insufficiency of accepting a ruleset defined by previous generations of people. No matter how good it is, we need to re-author it and act through it as a part of our own agency. This applies at the national level and down into the family level.

Kohn isn’t against rules. His last chapter talks a lot about bringing kids into the process of creating rules, and how to balance applying rules to them from the outside with the need to involve them in rule creation. This seems crucial to me, and I basically agree with everything Kohn is saying here.

In order to help write the rules, people need to be able to understand them. That means that the laws and rules that form the institutional pressure of a society need to be in principle learnable. Obviously our world is complex and we need rules that are detailed enough to deal with that complexity. I think that needs to trade off against the fact that the more complex our rules become, the more people are excluded from democratic participation.

We need to lower the mental demands of modern American life, so that people at every level are better able to benefit from it and contribute to it. We need to ensure that the institutional rules we have apply to everyone equally. We need to ensure that social pressure is reasonable and caring. We need to help people develop virtue, so that they act well in the world.

Lagrangian Mechanics and the Mass/Energy Relation

A while ago I wrote about where Einstein’s famous equation comes from. With the benefit of some advanced math and the Lagrangian formulation of physics, it’s actually even easier to find this equation. Though in this case easy means you have to learn Lagrangian mechanics first, so take that with a grain of salt.

Much of this discussion comes from Lagrangian Mechanics for the Non-Physicist, which I found to be excellent and highly recommend.

Pre-requesites

Let’s do a brief overview of Lagrangian mechanics and Noether’s theorem before we get to Einstein’s famous equation.

Lagrangian Formulation of Physics

The Lagrangian formulation of physics is what the aliens use in Story of Your Life (and the movie Arrival based on it). Instead of Newtonian calculations that predict what a particle will do at any given time, Lagrangian calculations optimize the entire trajectory of a particle at once based on the physics of the problem.

In order to optimize the trajectory, you need a quantity that summarizes that trajectory. That quantity is called the Action ( $S$ ), and it’s generally the integral of some function over the entire trajectory. We call the function inside the integral the Lagrangian. The action has units of Joule-seconds, so the Lagrangian has units of Joules (energy).

S = \int_{t_0}^{t_f} L dt

What’s the Lagrangian that you put inside the Action integral? That depends on what formulation of physics you’re using. For classical mechanics like what Newton was solving, the Lagrangian is just kinetic energy minus potential energy, and represents a balancing of a tendency of move (kinetic) and a tendency to stay put (potential). Other theories of physics, like quantum mechanics or relativistic theories, use different Lagrangians (as we’ll see later).

The Lagrangian (in non-field theories like mechanics) is a function of coordinates, the derivatives of those coordinates, and time. That means if we’re using coordinates $x$ , $y$ , and $z$ then the Lagrangian would be a function $L(x,y,z,\dot{x},\dot{y},\dot{z},t)$ .

Once you have the Action and the Lagrangian, you can do some complicated math to figure out what the optimal trajectory is. This math turns out to be the same for a lot of different theories. Instead of re-deriving it each time they need it, physicists derived it once and gave it the fancy name of the Euler-Lagrange equation. This equation is used to find the trajectory that optimizes the Action for a specified Lagrangian.

\frac{d}{dt}\frac{\partial L}{\partial \dot{q}_i} - \frac{\partial L}{\partial q_i} = 0

There’s a different Euler-Lagrange equation for each coordinate ( $q_i$ ), so in a problem with coordinates $x$ , $y$ , and $z$ we would have $q_1=x$ , $q_2=y$ , and $q_3=z$ . By solving the Euler-Lagrange equation for each coordinate, you can get the equations of motion as functions of time for a particle with the given Lagrangian.

Noether’s Theorem

The Lagrangian isn’t just useful for finding equations of motion. It also serves as an input to other mathematical machinery. One of the most useful mathematical machines in modern physics is Noether’s theorem.

There’s a lot of wonderful details to Noether’s theorem, but for our purposes we’ll skip to the end and present it in a form most useful for us.

Any invariance of a system implies a conserved quantity. An invariance of the form $\delta L = \frac{dF}{dt}$ has a conserved quantity of the form $Q = \sum_i p_i \delta q_i - F$ . In this case, $\delta q_i$ is an infinitesimal change to coordinate $q_i$ , which may be something like a spatial dimension $x$ or $y$ .

One way to use Noether’s theorem is to find the invariance equation (by guessing or via experiment), and then derive the conserved quantity from it. That’s what we’ll do for relativity.

The Relativistic Lagrangian

In order to use Noether’s theorem with Einstein’s relativity, we need the relativistic Lagrangian itself. Since the action is just the integral of the Lagrangian, we’ll start with the finding the action. In Special Relativity, this is the length of a particle’s “world line”.

Why is the action in special relativity the length of a particle’s world line? The world-line is invariant. All observers, no matter how they’re moving, will measure the same world-line length. We also find that maximizing a particle’s world line length gives results that match experiment. Since we maximize world-line length, we will include a negative sign to the equation so that we end up minimizing the action with the Euler-Lagrange equation.

The length of a line in Euclidean space can be found by summing individual segments of it: $ds^2 = dx^2 + dy^2 + dz^2$ . When we move to a relativistic world, we need to get a bit more complicated.

Relativistic physics is done in a Minkowski space, not a Euclidean space. We need to account for time and the speed of light. And time and distance don’t have the same sign. The equivalent formulation for a relativistic world line length is $ds^2 = c^2dt^2 - dx^2 - dy^2 - dz^2$ .

Note that the little segment of time is multiplied by the speed of light (so the units come out right). Each segment of distance is negative, rather than positive. These are postulates Einstein made, and are implied pretty strongly by the outcome of the Michaelson-Morley experiment.

If that’s the length of an infinitesimal length of a world line, the total world line is the integral over the curve $S$ . This gives us

S_{temp} = \int\sqrt{c^2dt^2 - dx^2 - dy^2 - dz^2}

Unfortunately, this doesn’t have the right units. We learned above that the action should have units of Joule-seconds (same as $\hbar$ ). Since a Joule is kg m^2/s^2, a Joule-second is kg m^2/s. Our S above has units of meters, so we’re going to have to do something to fix this.

If we’re talking about the world line length of a single particle, there’s only one place we can get a unit of kg: the mass of the particle itself.

The most natural place to get a unit of m/s is to use a speed: the speed of light.

To fit the textbooks, we’ll throw a minus sign in there too. That let’s us define the action of a relativistic particle to be:

S = -mc\int\sqrt{c^2dt^2 - dx^2 - dy^2 - dz^2}

For now let’s constrain our problem to just be in one dimension ( $x$ ) and use $S = -mc \int\sqrt{c^2dt^2-dx^2}$ .

We can find the Lagrangian from an action just by looking inside the integral. We saw above that $S=\int Ldt$ . We’ll need to rearrange our action a bit to make it a total integral over time instead of over each individual coordinate. Let’s try just pulling a $cdt$ out of the whole thing.

S = -mc \int\sqrt{c^2dt^2-dx^2}=\int -mc\sqrt{1-\frac{dx^2}{c^2dt^2}}cdt=\int -mc^2 \sqrt{1-\frac{\dot{x}}{c^2}}dt

This finally gives us our Lagrangian, $L=-mc^2 \sqrt{1-\frac{\dot{x}}{c^2}}$ .

Whence E=mc^2

Once we have all of that, we can look at a relativistic particle. We say that this particle is invariant under time translations.

What does it mean to be invariant to time translations?

It means that if we do the same experiment at 3pm or at 7pm, we’ll get the same answer. The experiment has been “translated” (moved) in time, but it hasn’t varied in outcome.

A time translation means that we shift $t$ , so if we have a little bit of time then it also gets shifted $t \rightarrow t + \delta t$ .

Now we just need to find the quantity $\delta L$ , which will give us the $F$ that is used to calculate our invariance.

\delta L = \frac{\partial L}{\partial x}\delta x + \frac{\partial L}{\partial \dot{x}}\delta \dot{x} + \frac{\partial L}{\partial t}\delta t

This is just the chain rule applied to $L(x, \dot{x}, t)$ . Since $L$ is independence of $x$ and $t$ the first and last terms are both $0$ .

To find the middle term, we need to know what $\delta \dot{x}$ is. That little shift in velocity may be a function of time ( $\dot{x}(t)$ ), so we can calculate $\dot{x}(t + \delta t) = \dot{x} + \ddot{x} \delta t$ . That means the change in our velocity due to the shift in time is $\ddot{x}\delta t$ , so that’s gives us the following:

\delta L = \frac{\partial L}{\partial \dot{x}}\ddot{x}\delta t

We know we need to formulate this into something that looks like $\frac{dF}{dt}$ , so we need to get rid of the partial derivatives. To do that, we’ll try finding a derivative of L (since that’s about the only thing in our equation we have to go on):

\frac{dL}{dt} = \frac{\partial L}{\partial x} \dot{x} + \frac{\partial L}{\partial \dot{x}} \ddot{x} + \frac{\partial L}{\partial t}

But remember the first and third terms are 0 here as well. Then we can combine these two equations to get:

\delta L = \frac{dL}{dt}\delta t

We now know $F=L\delta t$ , so we can finally use Noether’s theorem. In this case, our only coordinate is $x$ , so we have:

Q = p_x \delta x - L\delta t

We’ll find $\delta x$ like we did $\delta \dot{x}$ , $x(t + \delta t) = x + \dot{x}\delta t$ , which means $\delta x = \dot{x}\delta t$ . We also know that $p_x = \frac{\partial L}{\partial \dot{x}}$ . We then have:

Q = \frac{\partial L}{\partial \dot{x}} \delta x - L\delta t = \frac{m\dot{x}}{\sqrt{1-\frac{\dot{x}^2}{c^2}}}\dot{x}\delta t + mc^2 \sqrt{1-\frac{\dot{x}}{c^2}}\delta t = \frac{mc^2}{\sqrt{1-\frac{\dot{x}^2}{c^2}}}\delta t

We normally call the term $1/\sqrt{1-\frac{\dot{x}^2}{c^2}}$ the Lorentz factor $\gamma$ .

Einstein defined this conserved value to be the energy $Q=E$ , so we now have

Q = E = \gamma mc^2

And if the velocity of the particle is small, then $\gamma$ is just about 1 and we have

E=mc^2

Going Further

We squinted enough and said that $\gamma$ at low speeds was 1, but technically speaking you would usually want to do a Taylor series expansion on $\gamma$ and then keep the first few terms. That would let us find the Newtonian equation for kinetic energy as well: $E = mc^2 + m\dot{x}^2/2$ .

Context is That Which is Scarce

Information is always scarce for kids

Over the past few years I’ve gotten my kids a lot of the “Who Was?” series of books. These books, each about 100 pages split into 10 chapters, are short biographies of famous people throughout history. They cover everyone from Genghis Khan to Steve Irwin.

I first bought a Who Was? book for my kids when Costco had a sale on a technologists and inventors box set. I want my kids to grow up valuing similar things to me, and to feel like they can invent and discover amazing things. I also want them to love reading. I bought the box set and brought it home, and they mostly ignored it.

This wasn’t too surprising. I see much of my job as a father as creating options and showing my kids what they could do. It’s up to them to decide if they want to. But I was kind of interested in these books, so after a while I read them a few of them. Over the course of a few weeks, we made our way through biographies of Newton, Curie, Darwin, and Steve Jobs. My kids liked these enough that when there was a box set of biographies on the American founders I bought that for them too.

What surprised me most about these books was how much I learned from them. I knew who all of these people were. I also had studied a lot of what they’d discovered or made (notably Newton and Jobs). Even so, I found myself being surprised by what they’d done, or how they’d done it.

The books are all organized the same way. A short intro chapter explaining something exciting about the person, a biography that covers the persons whole life, and a few explainer boxes covering adjacent historical issues. They’re written to be accessible to 8 year-olds who don’t know much about the world yet. That means that they’re full of context.

At this point, I read the Who Was? books more than my kids. I’ll read them to the kids, but when I take the kiddos to the library I’ll often just pick up a random Who Was? and take it home for myself. They’re short, interesting, and I learn more about the world.

I think the reason I reach for Who Was? books but not Wikipedia entries is that Wikipedia entries are often entirely lacking in context.

“Why is this important?”

I am interested in what people did and how they did it. I’m much more interested in why it mattered. That’s what Who Was? really excels at. They don’t assume any context, which makes it easy to jump in. Wiki articles, biographies for adults, the news: all of this assumes you already know enough to put the facts in context.

Who Was? is better than Wikipedia in another way too: they are opinionated books. They’ll explicitly say: it was bad that Genghis Khan killed so many people, the Wild West was bad for the Native Americans, saving the environment is good, exploring space is so cool. I think opinions like this are critical for grasping onto facts, even if you don’t necessarily agree with the opinions.

The Who Was? books are each written by a different author, so they all have a slightly different slant. Even when I don’t agree with the opinions in a book, reading such a book to my kids gives me a chance to talk through all of the issues with them. I can also notice myself disagreeing, and use that as a trigger to investigate my own ideas and priorities.

Wikipedia tries to be unopinionated. This is good. We need a resource like that. But it doesn’t mean it’s the best resource to learn from or to gain context from.

A friend of mine was recently trying to learn about the Border Gateway Protocol. This is a complex and arcane protocol, and after reading several explainers, asking ChatGPT, and watching several youtubes he went to ask an old hacker friend of ours. Our hacker friend proceeded to give a long rant about all of its problems, not really emphasizing how it works at all. The rant was exactly what my friend needed though. The opinions gave my friend something for his list of facts to latch on to, and suddenly the entire concept was understandable to him.

I’m not sure how to think about or prioritize finding opinions, given the current adversarial nature of social information transfer. All of my feeds are full of people propagating falsehoods for fun or profit. I want people who have strong opinions about factual topics to explain those topics to me, but I don’t want to be taken advantage of by grifters and trolls.

For me, the happiest mix seems to be around 70-90 percent fact, with well-supported opinion wrapped around it.

Adults can have very different amounts of context

Recently a friend of mine subscribed to the Bismarck Brief. He was given a gift subscription to hand out, so I ended up getting a month’s worth of access myself. I was very excited about this going into it, as people say it has really deep analysis.

The first post I read, on the robotics industry, was super disappointing. It was all just obvious stuff, along with a few cases where specific companies were looked up and listed (x makes motors, y makes gears, z has 63% market share). I could have looked up any of those things if I cared about those numbers. I didn’t get it. Where was the insight and analysis?

Then I read about Greece’s financial situation and was blown away with how much I learned and how much more I understood things I’d been hearing about for 15 years. In a lot of ways, the Greek Financials post had a similar structure to the robotics post: lots of basic facts about how a system works. Some stats and details about specific aspects of the issue. A few strong opinions about what was good or bad about it.

This, then, is the insight – learning enough about a topic to know what context someone needs. I didn’t get anything from the robotics post because I already have most of the context for the industry. Because I had the context it seemed like minutiae. For politics and finance, I have very little context. Bismarck Brief is fantastic specifically for context. It has an agenda, an opinion. That opinion is almost but not quite the average bay area libertarian opinion, which I like well enough to read a lot of words in it.

A lot of basic information in each brief is something anyone could look up, presented in a moderately opinionated way and with a huge amount of context. If I had wanted to, I could have spent a week or so sifting through wiki articles, news, history books, and finance books to get a sense for what was going on with Greece. Even after that, I would have had trouble knowing what was important without having someone to chat with who had some experience with it all.

Geopolitics and history articles engage me because I know like 30% of them, so the rest fills in gaps and it feels like I’m gaining an amazing understanding of the world. But am I?

How do I know what I’m missing, given that I only read the brief about Greece’s financials and didn’t read anything else? The appeal of the brief is that it is brief, and I get what I need without having to read hundreds of pages. If I want to be sure that I’m not being misled, I’d need to read an equivalent article by someone with an alternative viewpoint (an adversarial collaboration, maybe?).

What about LLMs? Could I paste a brief, or wiki article, or a Who Was? into the context and get it to give me an opposing viewpoint? I’m optimistic about this. I often think of LLMs as answer engines: they can give you the exact right amount of context (assuming you prompt them right).

Why is this important?

Why do I care about the Kremlin? About Genghis Khan? About Greece’s financial crisis? Will I do anything different now that I know a bit more about those topics?

There are a lot of reasons that I’m interested in history and politics, but honestly one big one is just that I want to engage with my culture. I want to understand how we got to where we are, why people are doing or thinking certain things. I’m just curious. Sure I could read deep scholarly history books about these things. I’m reading Ada Palmer’s 500 page tome on the Italian Renaissance right now. I have a limited amount of time and unlimited curiosity, so brief primers on topics are perfect for me.

I don’t think most books should be blog posts, but I do appreciate a short book. Not everyone reads as fast as Tyler. I want to be able to get enough context quickly to engage with active cultural conversations.

I was originally excited about Bismarck Brief for “strategy”. I wanted to use it to answer questions like:

what to invest in?
is there going to be a war?
should I vote for policies like X if I value Y?

It’s not yet clear to me yet how to use it for that. It does give me a great starting place to learn about something, but it doesn’t make any decisions for me (and if it did I probably wouldn’t trust them).

I do think that reading these posts is useful for those goals though. Learning about the Greek financial crisis is helpful to understand some of what’s going on in my own country. I can actually see some patterns like “if a country does x, then y will happen”. Then I can reason about that based on my own values.

The more context you have, the more the new information you come across will be meaningful and useful to you. This is something asking the LLM can’t give you. It can’t help you think, any more than having a really smart teacher helps you think. It can guide you and show you the way, but you have to put in the effort.

I tell my mentees at work: it’s ok to use an LLM to write code, but make sure that the code flows through your brain. You need to understand it well enough to maintain and improve it. Vibe coding isn’t enough when a mistake could kill someone.

Obviously people in different industries have different tradeoffs. Not everyone builds safety critical robots, but everyone is ultimately responsible for their own lives. Having context can help you know when someone is trying to help you or take advantage of you. It can help you make critical decisions when they need to be made quickly. This is what I’m trying to instill in my kids when I read them Who Was?

Walking while chewing bubble gum

After fencing for years, I think I’ve finally become skilled enough to think at the same time as I fence.

When I started fencing, and even after a few years of it, I was fencing without thinking. I don’t mean this in some kind of Taoist “be without being” kind of way. More in the “I have no idea what just happened or what I did” kind of way. I used to tell people I didn’t form memories while fencing. I was able to fence quite well this way, but I had a lot of trouble developing a higher strategy against other skilled opponents.

Fencing has a lot of cognitive load. You have to track your stance, your guard, your opponent’s stance and guard, the distance between you, how they opponent responded to your past assaults, the length of your weapon, how tired you each are. Layering all of these things makes it very hard to do anything else.

I used to think that I would never be able to think while fencing. I thought I would just be unable to have a longer strategy or to pick apart my opponent’s style. Now those things are – not quite easy – but at least doable. Now I can fence, and notice good plays that I or my opponent does. And I’m able to remember them and comment on them after a bout.

I wonder how much of this is simply due to mastering so many of the little motions, both physical and mental, that make up the art. I wonder also how many other tasks I do with only half focus are just due to immense amounts of practice when I was young.

Thinking while doing math

People who never master basic concepts at school get overwhelmed by the more advanced stuff.

It’s no wonder someone can’t do trig if they still have to pay attention when they do algebra! That’s like trying to have a fencing strategy before you’ve mastered the basic guards and cuts. Sure you can do a basic cut, and you can think about strategy outside of a fight, but putting those together in real time is painful or impossible until your cuts flow automatically.

I like the idea of mastery schooling – always return to the basics when you have trouble. If you’re having trouble with a topic, don’t review that specific topic. Review all of the prior pieces that it depends on.

This is part of why I hate contemporary schooling so much. It puts kids on a track, and it keeps kids on a track no matter how they’re doing. They learn a set topic at a set age for a set amount of time. When that topic’s time is up, they move on regardless of whether they’ve mastered it.

(As an aside, education terminology drives me up the wall. When kids can learn what they’re ready to learn, and get help when they need it, this is called “tracking”. When kids are put on a specific schedule for learning and advance at a pace decided on by the district, that’s somehow not “tracking” even though kids can’t get off the train.)

Cognitive load builds up slowly as kids age through school without getting mastery, and school becomes more and more of a painful chore.

The optimistic outlook on this is that kids can get good. Even if they’re struggling with school now, they can learn and improve.

The pessimistic prescription is that the kids have to go backwards a lot more than teachers think in order to get good.

I fenced for five years, senior year of high school through college. I took a ten year break, and came back to fencing as a beginner. I took all the newbie classes and practiced all the newbie drills. Putting it all together from scratch was fast, because I had a lot of experience. It was also transformative, because I finally saw what my old fencing teacher had been trying to get me to understand for literally hundreds of lessons.

Adulting

I’m an awkward guy. In social situations, I pendulate between shy silence and cringey proclamations. I often feel lost in the flow of a multi-person conversation, and totally adrift when people banter. Those souls blessed with the skill of banter, or of putting their conversational partners at ease, have always been subjects of envy for me.

But now I wonder how much of my discomfort and lack of conversational skill is simply that I’m missing more fundamental skills. Whether I lack those skills through nature or nurture doesn’t interest me here. I’m more interested in the idea that I could learn basic skills that give me enough breathing room to think during a conversation.

I think this is also the dynamic around the concept of adulting. In America today, kids are sometimes prevented from doing real work or engaging in real tasks. Instead they’re given homework or extracurriculars. Those do teach real skills, but they teach skills divorced from how they will be used in day-to-day life. They also don’t teach a lot of day-to-day living and organization skills.

Historically, I think kids used to learn these things just from spending time with their parents and being asked to help out the family early. Starting early, focusing on super easy fundamental parts of home-life, and then slowly building up responsibilities alongside capabilities. Eventually, things like doing the laundry, shopping, even doing taxes will be chunked actions that don’t take up all of a brain.

I think people who learn certain tasks early underestimate their importance. Once someone masters a skill, they can do it without thinking about it (by my definition). That means they may not notice when using that skill is critical for some higher level task. If they’re a parent raising their kids in a more modern context, they may not notice that their kid needs to learn those skills early.

Advice

If there’s any advice to be found in all of this, I’d say it’s this:

if you are struggling with a new skill, you could try going back and practicing the fundamentals that it’s based on
if you’re teaching someone (or parenting someone) who is struggling, go back to the basics
if someone is struggling with something you think is easy, consider that you might have skills that you don’t even realize had to be learned

Practice the basics enough, and eventually you’ll be able to walk and chew bubblegum at the same time.

Judgement in Saber Fencing

One common mistake I see my students make when they start HEMA saber is that they want to go fast. They’ll move their arms around as fast as they can, whipping from an outside guard to an inside guard and back in the time I take to lean my body slightly.

From a beginner’s perspective, I can understand this. You don’t know when your opponent will attack and you want to make sure you’re safe. Are they coming from the left? No, then it must be the right! No, it was actually left! Get the sword in the right place!

The problem with this is that there is no right place for their sword to be. The right place is created by the interplay of their motions and mine. If they are moving their sword rapidly trying to predict what I’ll do, I can just wait until they overextend and then cut their arm. Their eagerness to outspeed me opens them up to a much slower attack.

I generally tell my students that fencing isn’t about moving your sword fast, it’s about moving their sword to the right place at the right time. Sometimes that does require a fast motion, but sometimes it actually requires a slow one.

I generally teach saber from Roworth and Waite, who are not very philosophical about all of this. They have a lot of suggested drills and some very interesting things to say about feints, but they seem to assume that the art of fencing will awaken naturally in a student after enough drills.

To be fair to that viewpoint, students do seem to get it after a few months of lessons. But let’s get philosophical anyway.

I first learning fencing by studying George Silver‘s English backsword. George Silver is very philosophical about everything, and gives us this great quote about distance, motion, and finding the right place for your sword to be:

through judgment, you keep your distance, through distance you take your time, through time you safely win or gain the place of your adversary, the place being won or gained you have time safely either to strike, thrust, ward, close, grip, slip or go back, in which time your enemy is disappointed to hurt you, or to defend himself, by reason that he has lost his place, the reason that he has lost his true place is by the length of time through the numbering of his feet, to which he is out of necessity driven

This quote always reminds me of the OODA loop. Distance gives you time to observe and orient, judgement lets you decide correctly, then you act so as to give yourself a true place to attack safely. It’s not a perfect 1:1 map, but if George Silver had ever met John Boyd I think they would have recognized the tactical mindset they each shared.

Given this principled approach, let’s think again about why it doesn’t make sense to always move as fast as you can. Why doesn’t that let you shrink your OODA loop and win?

The answer is that fencers are moving so fast that they can’t observe and orient. They are deciding and acting without the input they need to do that well.

Boyd’s whole plan of getting inside the adversary’s OODA loop requires being able to do the whole process in a shorter time. When fencers move their sword faster than they can orient to, they actually make things worse for themselves. By changing their situation, they have to reorient, then reorient again. They find themselves ever more uncertain about what is going on in the fight. When I’m fencing someone like that, it’s easy to wait just out of distance, letting them rush to defend a place that I hint at threatening, then attacking whatever they leave open.

By moving slower, I orient faster. That means that I can decide when I want to go faster and act to cut them.

This applies to more than fencing, of course. It often makes sense to move a little slower, so that you can gather information and understand it faster, and thus make a better decision and act decisively.

We Need Better Adversarial Collaboration Tools

It’s common to complain about seeing family with extreme political views during the holidays. While some advocate for cutting ties with people who think differently than they do, I actually enjoy arguing and talking about political philosophy. It’s become a load-bearing way for me to bond with some of the in-laws.

I’m lucky enough that our arguments are usually minor. Often they’re about factual questions that everyone is mildly unsure of. How effective is this one vaccine? Does learning math help with other life skills directly or indirectly? How nervous should we be about bird flu?

But sometimes, the arguments are much more fundamental. This Christmas I found out one of my younger relatives has become an avowed Marxist.

I would be interested in arguing about the pros and cons of different economic systems. I think there’s a lot of benefit from the tension between free market capitalism and redistributive socialism. When it comes to straight-up Marxism, though, I think the last 150 years are fairly conclusive. Between the famines, genocides, and gulags, Marxism has been used to justify the deaths of hundreds of millions of people. The failed economic policies led to the starvation of tens to hundreds of millions more.

My relative didn’t believe any of that actually happened. Some events she didn’t know about, others she thought were western imperialist propaganda. When I listed statistics, she mostly said she didn’t believe me.

I encouraged her to do research and learn about it from resources that she thinks she can trust. What makes me think that she’ll come to the same conclusion as me though, or trust sources that I think are actually trustworthy? I think we need better tools to have these kinds of disagreements; tools for adversarial collaborations.

How do you know what you know?

I (for obvious reasons) think that what I know is right. I think I’ve learned real things about the impacts of Marxism over the past hundred years. Many of those things I learned from books written by Americans. Some written by non-Americans. I’ve also talked to people from Poland and Russia about what life was like for them before the Soviet Union fell. That was much more about bread-lines and asking the government to go to college in the 80s, not so much genocide or acute starvation.

At best I have second hand accounts of things like China’s great leap forward. For example, I listened to that Dwarkesh interview with Jung Chang, which was pretty damning. It’s not like I read Chang’s book though, or even any other books written by any Chinese person who lived through it. When I thought about sending that interview to my relative, I wondered if she’d just think it was propaganda promulgated by the US.

So I looked it up, and the Wiki article on Jung Chang says that she overplayed how bad Mao was. I don’t doubt that Chang lived through what she claims, and I bet it was as bad for her as she says it was. The statistics on deaths from the Great Leap Forward don’t come from her. But this is an unexpected crack in the edifice of my knowledge. How much did Chang mislead me? Is the wiki article overplaying Chang’s inaccuracies due some Wikipedia bias? How far will I follow a trail to see who can be trusted to report on the object level questions?

Almost all of what I think about Marxism (the famines, the gulags, the vast murder campaigns) still stands. But how do I know for sure? And how would I convince my relative if she thinks my information sources are adversarial (or even just corrupted)?

On the one hand, we’re starting from a good position. I think we’re both interested in the truth of what happened. We have very different priors, just based on our family history and economic experience up to now. I think that we can come to a consensus about what data is available and how much to trust it if we put in the effort.

Adversarial Collaboration

Sometimes it can be a lot of effort to vet sources, collate their information, and come to some synthesis of what’s being said that can be relied on. Other times the effort is not what prevents it. The only reason I even bothered to look up Jung Chang’s reliability is because I was thinking about sending her interview out as an example. Even that 1 minute on Wikipedia was too much for me to put in before I had a challenger to my worldview. Whether its about effort or priorities, people will often not check up on things they think they know. How can we make it worth people’s while to actually look things up, and to actually get things right?

Years ago, Scott Alexander hosted an Adversarial Collaboration contest. He recruited people who disagreed on some point and had them write an article about it together. The idea was that, since the multiple authors disagreed with each other, their biases would cancel out. I really liked this idea in principle, and I think something like it would be great at getting people to come together and take their beliefs seriously.

I think the small nature of Scott Alexander’s projects are part of what made it work. You might argue that social media, or the internet in general, is a giant adversarial collaboration. The problem with the internet is that it’s too big. People are social, tribal. If the argument is with a specific person who you think will actually listen to you, you’ll put in a lot of work. If the argument is with “the internet” and you’re unlikely to convince anyone, then why put in the effort to think about your own beliefs or understand those of others?

We need more small scale adversarial collaborations. We need more people working together, disagreeing amicably with each other, trying to figure out what’s right.

I would love to see a tool that makes adversarial collaborations easy to run. These collaborations should be between just a couple people, about specific questions, and support both public and private communications.

Why not prediction markets

Lots of people love prediction markets these days. Even Scott Alexander seems to have abandoned Adversarial Collaborations in favor of prediction markets. I think this is unfortunate, because to me these serve very different use cases.

There are a lot of questions about the world for which answers are basically known. Humanity, as a whole, knows how electricity and magnetism interact. We know how big the average great white sharks are. We know the earth is round and the sun performs fusion and why the sky is blue.

We know these things, but any individual maybe doesn’t. A prediction market isn’t useful for these things, because the problem isn’t a distributed uncertainty in the answer but rather the individual effort of finding it and caring about it.

People sometimes try to force prediction markets to work for individual issues. I’ve bet on markets for whether someone will read a certain book, go to the gym a certain number of times, or believe a certain thing on a certain date. These kind of work, but the markets are always small and don’t seem very emotionally activating for anyone involved. Questions like “will I read this book on time?” are a better fit for beeminder than prediction markets. Questions like “will I believe Columbus was the first European in the New World?” are a better fit for Adversarial Collaboration.

When the knowledge is already out there, we just need the motivation and the tools to use that knowledge. It’s not about wisdom of the crowds, it’s about improving the wisdom of the individuals.

Adversarial Collaboration as a Service

I want a tool that makes it easy for people to have adversarial collaborations about any given topic.

This tool needs to help people identify their disagreement. Once identified, they need to operationalize it. It’s no good for me and my relative to take “capitalism good”/”Marxism good” positions. We need to actually get concrete about what that means to us. Maybe frequency of famines in countries with either system. Or maybe number of political prisoners, or how easy it is to buy a car or go to school or get healthcare. The actual operationalization doesn’t matter as far as the tool goes, but the tool needs to make it easy to come up with something both people can agree on.

Once the crux of the disagreement is formalized, the tool should help guide participants to answering it for themselves. This can take the form of pinboard of links to informational sources. Maybe with some commentary from each arguer.

I don’t think the tool should be prescriptive about sources. Podcasts, books, journal papers, some guy on twitter; they’re all fair game. But any claim one arguer makes should be easy to refer to, refute, and question for the other arguer. People should just use whatever they like to research their question. If people bring up unconvincing sources, their collaborator can tear them down. That will push them to look for something better.

I’m imagining something like a graph of questions. The root node is the operationalization of the original argument. Then child nodes could be supporting evidence, either sources or reasoning. Each node would support commenting on it directly, as well as adding child nodes supporting or questioning it. Any node where both arguers agree could be colored green, while nodes that are still not agreed on would be colored yellow.

Arguers would make comments, post sources, argue in all of the same ways that they normally would online. The tool would structure those arguments so it was easy to refer to and go back to what’s uncertain still. There wouldn’t be social media’s infinite scroll of temporary factoids. Instead, there would be a slowly converging set of information that either arguer could add to at any point.

Automated supporting interactions

There are a lot of ways that I see modern LLMs being able to support these interactions. I’m imagining that a lot of the argumentation happens via text. Little tweet length quips to long-form essays arguing small points in context of the larger argument. An LLM integrated into the Adversarial Collaboration tool could watch all of these and make recommendations.

It would be easy, after the arguers agree on an operationalization of their main disagreement, for an LLM to decompose it into smaller questions to look into. An LLM could also help propose cruxes when the arguers are first trying to operationalize things. When smaller questions are identified, the LLM could look things up and suggest a list of sources (both pro and con for any question), allowing the arguers to review them and say whether they agree.

Obviously the arguers would be free to do whatever work they wanted independently of a supporting LLM. They could search out any sources and include them. Write up any arguments on their own, and decompose questions however they think best. The LLM would really just be to support any of that.

I think keeping the LLMs input minimal and gentle will be very important to the success of the project. This isn’t meant as a research helper, or an answer engine. It’s meant as a way for people to interact with others. It’s primarily a social tool in service to an informational mindset that would be developed in users. Too much LLM interaction would degrade that.

I do think it would be useful to gently nudge users to be nicer to each other. Or to respect each other more. Doing this without seeming sanctimonious or censorious would be tricky, but hitting the right balance would do a lot to keeping users engaged instead of enraged.

Why would people put in all this work?

One way that I think LLMs could really contribute would happen before the Adversarial Collaboration even started. They could connect people who disagree.

There are millions of people posting about all kinds of topics on the various social networks every day. Imagine an LLM reading these, finding people who seem like they would trust each other, but who disagree on something they each find important. Imagine that LLM then setting up an Adversarial Collaboration for them and inviting them to join it.

The idea here is that this tool wouldn’t be something users go out and look for. This tool would go out and look for users.

“Check out this other person,” it would say. “They disagree with you about this thing. They might seem foolish, but look! This post of theirs shows you might be able to convince them of how right you are. Join this adversarial collaboration to build knowledge on the topic you love.”

It harnesses the “someone is wrong on the internet” energy that we all have. It also makes it more personal and actionable.

When the person on the other end of the social network is a rando who might hate everything you stand for, it’s easy to round their disagreements off to 0. When it’s someone who’s similar to you and who you might chat with at length, it becomes harder to ignore whatever they say.

Tuning the LLM to search for similar people who disagree on specific issues will be critical to getting this social balance right.

Why would people join in the first place?

The actual reason that someone might agree to this depends completely on the person and the question. By assumption, the tool is offering these collaborations to people who could be open to changing their mind. On some level, these people care about being right.

Maybe they’ll invest based on their new opinion and make money. Maybe they’ll change their diet and get healthier. Maybe they’ll change how they interact with their family, and have a happier life. There are a lot of questions where having a realistic model of the world does make your life directly better, and helping people to answer those questions well will have immediate impacts.

This works even for the people who are more right to begin with. Say you’re convinced that vitamin C keeps you from getting sick. In order to convince someone who disagrees, you may have to learn about how much, when to take it, cases where it doesn’t have an impact. Even if you and your collaborator come away with largely your original opinion, you will have a much better and likely more actionable set of information after the fact.

Adversarial collaboration is valuable

While being right about things feels good, making good decisions is its own reward. From a societal perspective, increasing the number of people who can make good decisions is fantastic.

There’s another individual benefit that I think will accrue here. People like to talk a lot about “crises of meaning” and “the loneliness epidemic”. It’s hard to make friends. One of the common recommendations to make a really good friend is to do a project together, and this kind of collaboration might be a great fit.

I honestly think most Adversarial Collaborations wouldn’t lead directly to great friendships, but some would. The ones that don’t would hopefully help people engage with others in a more honest and accepting way (especially if the soft-touch tips from the LLM help the way I hope they will).

Adversarial Collaborations have all kinds of side effects that I like. They connect people to each other about things they care about. They provide motivation to learn, and to learn how to learn. They help address factual questions that are already kinda settled, just not for everyone.

They may also help fix the information environment we find ourselves in. There are a lot of experts around, but it’s hard to identify who to trust and who’s just faking it. Adversarial Collaborations can help surface the true experts and submerge the charlatans. If collaborators agree to share their output (which would be opt-in), they could provide “trustedness” scores to different sources.

There’s one important difference between these “trustedness” scores and the kind of fact checking that happens today. The Adversarial Collaboration “trustedness” score would just be a metric of how much most people do trust a source, after thinking about it and arguing with someone about it a lot. This is very different than some reporter or politician handing down the trustworthyness of someone from on high.

But even if the wider social improvement don’t end up as large as I hope, winning friends and influencing people might just be enough.

Why did Einstein think mass was energy?

A kid eating toast while walking on train tracks, contemplating the true nature of physics

The equivalency of mass and energy turns out to follow naturally from Einstein’s special relativity. The book Quantum Field Theory As Simply As Possible has a pretty good description of this. Here’s my take on it.

In order to understand the mass/energy equivalency, we first need to know what relativity really means. Then we learn what Einstein’s special version of relativity meant, and how it immediately implies that distances and time can change with reference-frame velocities. That in turn leads to the question of how energy and momentum change with reference-frame velocities, which implies the mass/energy equivalence.

Galilean Relativity

Imagine two ice hockey players: one goalie sitting in place and one winger skating towards the goalie at constant velocity $V_w$ (as measured by the goalie). Both players see a puck between them, gliding along with some velocity $V_p$ (again measured by the goalie).

Since the goalie knows $V_w$ and $V_p$ , the goalie can predict exactly what velocity the puck would be moving at if measured by the winger. It would be $(V_p - V_w)$ . This is called Galilean relativity, and it’s been known for hundreds of years. The speed is relative to the observer’s reference frame.

With Galilean relativity, time is measured the same way for each player. So are distances. It’s only speeds that could vary.

Since spatial distances are measured to be the same by each observer, we can say that the Pythagorean relationship holds for space. A triangle drawn on the ice will have $h^2 = x^2 + y^2 + z^2$ . Of course since it’s drawn on the ice the $z$ length would be 0, but you can just imagine a 3D right pyramid on the ice, like maybe a new fangled puck shape.

A constant speed of light

Maxwell came along a couple hundred years after Galileo and put together a lot of hints from Faraday and others. His equations (or our current form of them, which he probably wouldn’t recognize) are:

$\nabla \cdot E = \frac{\rho}{\epsilon_0}$
$\nabla \cdot B = 0$
$\nabla \times E = - \frac{\partial B}{\partial t}$
$\nabla \times B = \mu_0 J + \mu_0 \epsilon_0 \frac{\partial E}{\partial t}$

These equations describe the electric field ( $E$ ) and the magnetic field ( $B$ ). The third one shows how a changing magnetic field will cause an electric field. The fourth shows how a changing electric field will cause a magnetic field.

Since neither $E$ nor $B$ are dispersive (they don’t lose energy in a vacuum), that means they’ll propagate through space repeatedly causing each other into the distance. This is what a light wave is.

This Quora question has the best answer I’ve seen to deriving the speed of light in vacuum from these equations, but you have to know a bit of vector calculus. The gist is that, in a vacuum $\rho$ and $J$ are both 0. That simplifies the above 4 equations and lets you find a standard form wave equation for the electric field, from which you can just read off the wave speed.

QFT As Simply As Possible makes a big deal about the speed of light derived from these equations being independent of any observer’s speed. This seems to have been the first hint that the speed of light was constant in the universe. The speed being independent of observers only matters if observer speed doesn’t change relative to the wave’s medium (the equivalent of water for ocean waves). Sound has a speed in air, but we can go faster than sound because we move relative to the air.

Most physicists of the day thought light waves were carried on luminiferous ether in just the same way that sound waves are carried on air. If that were true, the ether may move relative to us observers. A moving ether would mean that light’s speed was not constant. Michaelson and Morley got together in Ohio and showed that light didn’t travel faster in the direction of motion of the Earth than it did perpendicular to it, implying that the ether didn’t move relative to the Earth. This was another huge clue that light had a constant velocity regardless of the observer’s velocity.

Special Relativity

Einstein came along 10 to 20 years after the Michaelson-Morley experiment and took the idea of a constant speed of light seriously. If you accept (or simply assume for the sake of argument) that light has a constant speed for all observers, then that does away with temporal synchrony.

Light beams, trains, and simultaneity

Here’s a little thought experiment about temporal synchrony.

My two twins are sleeping in beds separate by a distance $D$ . I’m located at a point right between the beds, $D/2$ from each. As soon as the kids wake up, they shoot a laser at me to let me know I should start their breakfast toast. Only if they both need toast at the same time, meaning both their lasers arrive at my location at the same instant, will I actually stop reading physics books and head to the kitchen.

My wife is on a train moving relative to our kids at speed $v$ . If you’re wondering why there’s a train in our house, you don’t understand children. Or physics nerds.

At some point, both lasers arrive at my location at the same time and I go make toast. Both kids awoke at the same instant.

But what did my wife see? Both kids are gliding by her at speed $v$ (since from her perspective she might as well be stationary and me and the kids are moving). Assume my wife’s train is moving from kid H to kid F (with me in the middle moving along at the same speed as both kids). Then she sees me moving towards the light from H, and away from the light from F. Remember that both light beams are travelling the same speed (by assumption in the Einsteinian relative universe). This means that, from my wife’s perspective, the photons from H travel a smaller distance than the photons from F. In other words, for the lights to arrive at the same time H must have awoken first.

The idea of two things happening at the same time, simultaneity, depends on the observer’s relative motion!

Lorentz Transforms

Lorentz transforms are the equations that tell us how positions and times in one reference frame look to someone in another reference frame. Assuming that the second frame is moving only in the x-axis (to keep things simple), the transformations are:

$t' = \gamma (t + \frac{vx}{c^2})$
$x' = \gamma (x + vt)$
$y' = y$
$z' = z$

In this case, the variable $\gamma$ is called the Lorentz factor, and it’s given by $\gamma = (\sqrt{1-\frac{v^2}{c^2}})^{-1}$ . The Lorentz factor influences how much we have to care about special relativity. When the speed $v$ is small, the Lorentz factor is close to 1 and we can assume Galilean relativity to make things easy on ourselves.

One of the most important implications of special relativity is that time is not fixed. That means we have to adapt our Pythagorean equality from the Galilean case. In special relativity, there’s still a triangle equality, but it incorporates the time dimension! A triangle drawn in space will have $h^2 = x^2 + y^2 + z^2 - c^2t^2$ .

Yes, everyone finds it weird that time is treated like negative space.

Momentum in special relativity

The Lorentz transforms let an observer calculate lengths and times in another reference frame, assuming they know how it moves relative to them.

Once you can do that, you might ask if there are similar transforms for energy and momentum. Remember that the kinetic energy of a moving object is $KE = \frac{1}{2} mv^2$ . The momentum is just $p=mv$ . If we know these values in one reference frame, can we calculate them in another?

The answer is yes, and the Lorentz transforms for momentum and energy are very similar to those for time and space.

$E' = \gamma (E + vp)$
$p' = \gamma (p + \frac{vE}{c^2})$

Let’s go back to our two hockey players on the ice. The goalie measures the velocity of the puck, its energy, and its momentum. We know that the winger sees a puck velocity of $v$ . They should see energy and momentum that corresponds with that velocity.

Imagine now that the winger is skating along at the same velocity as the puck. Then Galileo would say that the winger measures a kinetic energy of $KE = 0$ and momentum of $p = 0$ .

Let’s assume the winger is skating at a speed that’s small relative to $c$ , so $\gamma \approx 1$ . If the winger uses the Lorentz transforms to see what the goalie measures for energy and momentum, the winger calculates:

Energy measured by goalie (as incorrectly predicted by winger):
$E' = \gamma (E + vp) \approx 1*(0+0v) = 0$
Momentum measured by goalie (as *incorrectly* predicted by winger):
$p' = \gamma (p + \frac{Ev}{c^2}) \approx 1*(0+\frac{0v}{c^2}) = 0$

But the goalie actually sees the puck moving at velocity $v$ and measures:

Energy actually measured by goalie = $\frac{1}{2}mv^2 \neq 0$
Momentum actually measured by goalie = $mv \neq 0$

This is no good. We want our transforms to all work nicely in every case. The way we fix this is by positing a rest energy. The puck at rest needs to have energy that transforms to what the goalie measures, so we can just back it out from the Lorentz transform and what we know the goalie sees.

Assume there exists a rest energy, $E = mc^2$

Then the winger makes some different predictions given his own measured rest mass.

Energy measured by goalie (as predicted by winger):
$E' = \gamma (E + vp) \approx 1*(mc^2+v*0) = mc^2$
Momentum measured by goalie (as predicted by winger):
$p' = \gamma (p + \frac{Ev}{c^2}) \approx 1*(0+\frac{mc^2v}{c^2}) = mv$

Note that we approximated $\gamma$ as 1, and since the additional energy $\frac{1}{2}mv^2$ is so small relative to the rest energy, it washes out in the energy conversion.

So we see that the idea of a rest energy is needed in order for reference frame transformations in special relativity to correctly predict the energy and momentum that would be measured in other frames.

Sunday Thoughts

Newtonian Mechanics

I’ve started reading the book Lagrangian Mechanics for Non-Physicists. I’m very excited to get to the variational calculus chapters, but the book starts off with a long chapter on Newtonian mechanics. Since Lagrangian mechanics explicitly seeks to replace and go beyond Newtonian mechanics, it makes sense for the book to at least start with that. I was expecting to be a bit bored, so I was pleasantly surprised to get a lot of insight into standard mechanics.

While everyone learns about Newton’s three laws of motion in school, I think people often don’t learn how to really interpret and use the laws. The second law, for example, is the justification for the F=ma method of solving dynamics problems (problems how how objects move when acted on by various forces). In high school I learned this, but I didn’t learn much about the reason or use of the first or third law.

The first law, it turns out, sets the stage for the second. The first law (an object in motion tends to stay in motion), implicitly defines the idea of an inertial reference frame. It’s these reference frames that the second law can be used in. Non-inertial reference frames exist, and using Newtonian mechanics in them is quite difficult.

The third law is defining what a force is in Newton’s framework: something pushes on another thing, and both objects experience a force.

The importance of Newton’s three laws is that they set up the entire paradigm of solving dynamics problems using forces. Most of the rest of first-quarter physics class was just mathematical tricks for solving the equations that Newton’s laws help to formalize.

Newton’s laws have their limits (as Newton himself knew). The aforementioned non-inertial reference frames are very tricky to deal with. Some forces, such as gravity, are posited to apply instantly everywhere (faster than the speed of light), and conservation laws are assumed.

All of these are easier to deal with in Lagrangian mechanics. That said, Lagrangian mechanics is difficult to use for Forces that aren’t modeled by potentials. Gravity and the electromagnetic field are easily modeled as potentials. Friction and physical interactions are not.

I’m looking forward to reading the rest of the book learn more.

Fixing things for yourself

Our electric hair clippers broke last week. It’s probably over ten years old, and it had seen a lot of use. I thought about just ordering another one from Amazon. They range from $12 to $60, so it wouldn’t have been a hardship.

Instead, I disassembled the clippers and figured out what was wrong with it. Turns out the wire had broken inside its insulation, just at the point that the wire entered the body of the clippers. It only took about ten minutes to cut the broken section of wire out and then splice the (slightly shorter) remainder into the clippers. I lost the nice strain-relieved wire insertion point, but that was where the wire broke originally so maybe it’s not a big loss. I ended up supporting the new wire with hot glue. It won’t last another ten years, but it may last a few.

This wasn’t a very difficult project. All told it probably took me about an hour. If I had ordered a new one from Amazon, I would have paid less than I make in an hour of work. Nevertheless, I’m very satisfied with the choice to fix these clippers.

I think people sometimes focus too much on the financial value of their products. There are several types of value: financial, but also social, psychological, and functional. Repairing the clippers and buying new clippers both would have given me the same functional value (working hair clippers). In other domains, they differ in value.

A utilitarian might argue that all of these values can be denominated in dollars. I agree with that in theory, and in practice doing so would make it very clear that spending the cost to buy a new set of clippers was actually much more expensive in utility.

Buying new clippers would have been better financial value, but worse social and psychological value. By repairing them myself, I was able to show my kids what’s inside a machine that they’ve used in their own lives. I was able to demonstrate that when things break, they’re often fixable. This idea that the world is malleable and that people’s actions can impact it in a direct way is very important, and I would pay an enormous amount of money to make sure my kids understood that. Realistically, there’s no way to get that by paying money. Instead, I have to make choices and spend other resources to do it.

ToDo lists

I carry a small notebook around in my pocket all the time. I tell myself its to record ideas and keep track of my ToDo list. In practice, I don’t use it very often. I think I once went six months with this thing in my pocket every day, and never once wrote in it for that period of time.

I’m trying to use it more now. A big part of that is just letting myself write dumb shit in there. If I try to only use it for important things, then I end up never using it and it doesn’t help me.

Lately I’ve been using it for my ToDo list again. I’m getting more done, and I’m happier and excited about it too. The trick to this, for me, is that I have to use it for “dumb” “easy” things, like shaving. I also sometimes write something down just to cross it off.

This has a practical use: getting me to look at the notebook and remember that it’s something I can use for less “dumb” stuff.

It also has an emotional use: actually making sure I shave regularly is good, and it’s ok to write it on my ToDo list if that helps me get it done.

The purpose of the Young Lady’s Illustrated Primer

Andy Matuschak has a post up about strengths and limitations of the Young Lady’s Illustrated Primer, a powerful teaching tool from Neal Stephenson’s book The Diamond Age. While I mostly agree with all of Andy’s points about what the Primer does well (and poorly) as an educational tool, I think his argument is missing an important piece about what the Primer is for and how it accomplishes that mission.

In particular, Andy is against what he sees as the authoritarian nature of the Primer. The primer isn’t authoritarian, but it’s also not meant for adults. It’s a tool for teaching children, and more importantly for instilling culture in them. Andy is explicitly thinking about a tool he would want for himself, as an adult. Because of that, he mistakenly interprets aspects of the Primer as authoritarian when they are instead meant to be enculturing.

It’s important to remember that the Primer isn’t a sketch of technology that Stephenson wanted to build. It’s a plot device in a book with a very clear set of themes. Those themes can be summed up most simply as “culture matters”. Much of the book is organized around people choosing their culture, and those cultures clashing.

The Diamond Age repeatedly shows people with certain kinds of cultures or mindsets succeeding, bettering themselves, and gaining resources and allies. It also shows those with certain other types of cultures living in relative poverty, ignorance, and self-perpetuated violence. Whatever you think of the reality of situations like these, the Primer is an argument that these are cultural issues that can be influenced through education and enculturation.

Hackworth, creator of the Primer, left his birth culture to join a new one. Finkle-McGraw, who came up with the idea of the Primer, helps to basically found a new culture in the lead-up to the books time period. When they discuss the creation of it, there’s a repeated and explicit call out of wanting to raise children who can critically engage with their culture and either choose it actively or reject and improve it. They don’t want their kids to grow up to passively accept what’s handed to them. The question that the Primer answers is just how do you do that for children?

Is it authoritarian to raise your kids into a certain culture, or with certain values? I’m writing this on the fourth of July. Yesterday I read my kids a book about the Declaration of Independence. Was that me acting as an authoritarian to choose what they learned? Or was it me sharing what I thought were the best parts of my culture with them in hopes that they grew to appreciate it? Is it authoritarian when I choose to read my kids books about history or science or math, or is it me giving them enough of a foundation that they can later choose what to focus on in an informed way?

Andy says the primer chooses what Nell learns, what she does, makes goals for her. As he says,

Nell is manipulated so completely by the Primer, for so much of her life, that it’s hard to determine whether she has meaningful goals or values, other than those the Primer’s creators have deemed “good for her”.

Andy says that this is patronizing and infantilizing. Remember, though, that the Primer is a tool for children. It consistently adjusts what it presents and how. My read is that it provided strong guiderails when she was very young, and fewer as she aged.

This is what schools and parents do now. Sometimes this is invisible, especially if the parents are choosing the dominant culture to instill in their children. Then it doesn’t look like choosing values and goals for your kid, it just looks like “raising them”.

A lot of the arguments our society has around school, church, legacy- and social-media are really about how we raise our kids. We are all trying to pick and choose the syllabus our social version of the Primer uses for our kids. Some people do call this authoritarian, but few people suggest that it shouldn’t be done at all. The question is how to balance a child’s goals with the goals their culture has for them.

Andy is also wrong when he says the Primer chooses all of a child’s goals. This is most evident with Nell’s friend Fiona. She has her own primer, and uses it to pursue and explore her dreams of being an artist and author. As all good parents know, we can’t choose what our children love. We can support what they’re interested in while simultaneously trying to give them the tools to thrive. The Primer definitely has a very strong ideology for what tools and ideas a kid should learn, but it does incorporate and support the goals the kid comes up with on their own. That’s better than some education systems we have today.

Perhaps Andy is responding to a real phenomenon of people wanting to build the Primer for adults. If so, he’s right that the enculturation aspect is unnecessary and disruptive. But the Primer isn’t for adults. It’s for children who still need that kind of guidance.

This is clear in the Diamond Age as well. The book is given to Nell when she’s about 4 years old. In her late teens, she finishes the final puzzle in the book and basically breaks the Primer. The interactive aspects become a featureless plain, and all that’s left are the informative books it’s based on and the tools for thought Andy loves so much. Nell then rebuilds the world of it to suit herself and the people she’s responsible for. Upon reaching adulthood, the Primer turns into exactly the kind of “non-authoritarian” tool that Andy is asking for.

The symbolism couldn’t be clearer. The Diamond Age is claiming that children need the structure of enculturation, implicitly or explicitly. As we age, we gain the tools to evaluate our culture on its merits. We also gain the intellectual self-defense mechanisms to prevent ourselves from being coopted by malevolent cultural memes. When we have gained these tools, we effectively become adults and no longer need that pedagogical structure.

Most people recognize that children don’t have the same forms of intellectual and emotional tools as adults. They need different media, different types of stories, different educational guiderails than adults. I notice myself constantly making decisions about these questions, and the answers I come up with are vastly different than if I were choosing for myself or for an adult friend. I don’t claim that I get this all right, but I do think that my kids are having a much better life than if I didn’t provide any guidance at all for them.

If I were to reframe Andy’s criticism into something that I agree with, it’s this: Children need different types of structure for learning than adults. If you’re building an Illustrated Primer for adults, don’t. Adults have already had their priming, and are ready for the real deal. If you’re building a Primer for kids, keep in mind the cultural impacts that even seemingly benign decisions may have.

What do I want from the Primer

I’ve read the Diamond Age several times, first as a teenager and most recently as a parent of toddlers. The parts of it that appeal to me have changed dramatically as I’ve aged.

As a teen, I wanted my own Primer. Not just to help me learn, but also to keep me on track. To act as an angel on my shoulder, encouraging me in the culture that I most wish myself to exhibit. Assuming that I could have had this, I would not have found the Primer to be authoritarian for modifying its pedagogy to align my short-term incentives with long-term goals. On the other hand, if teenage me hadn’t been able to choose the expressed culture then I would have chafed at it.

As a parent, I’m not sure I want a Primer for my kids. I certainly think that a Primer would be far better than the worst schools in our country. I’m not sure it would be better than our best schools. Part of the appeal of the Primer is that it can democratize the high quality school option, as it does in the Diamond Age for thousands of orphan girls.

While I think Andy missed the importance of the Primer’s enculturation, I also think he was very right with many of his other concerns on its design. The Primer is highly isolating. To some extent, that’s recognized in the Diamond Age itself, where the voice actor reading lines to Nell becomes as important to Nell as a good mother. This human element was unforeseen by the original creators of the tool, something I think is echoed by many real life technologists building learning tools.

It’s also worth considering another of Stephenson’s works: Anathem. In this novel, a modified version of universities (called Concents) takes in children from birth and trains and educates them. The result is a very monastic life focused on a community of discovery. The book has similar cultural themes to the Diamond Age, but the educational method is vastly different. In Anathem there are no high-tech tricks for learning. Simple lectures, books, and interactions with local experts are enough to teach people as well as Nell ever was taught by the Primer.

In Anathem, the students inside concents are given a culture very different from the dominant one outside. While I don’t know how Andy would see this, I suspect it comes off as less authoritarian than the Primer. I don’t think it is; rather, I don’t think the Primer is more authoritarian than the concents. I just think it seems that way because the interactions between students and teachers at a concent read as allo-parenting and mentoring. They read that way because that is what they are, in a way that the Primer tries to mimic.