Bayesian Generalization
Bayesian Generalization
How do you learn a concept from a handful of examples? You see three numbers that fit a hidden rule β or a few bentos with a golden sticker β and somehow you know which other things fit too. This chapter shows that the same Bayes’ rule you already know becomes a model of human generalization once you make a single shift: a hypothesis is a set.
Try it yourself
A companion notebook builds the number game and the size principle interactively:
π Open in Colab: 07_generalization.ipynb
The one new idea is that the unknown you reason about is no longer a number (a mean $\mu$) or a yes/no fact (is the taxi blue?), but a set β a rule about which things share a property. Everything else β Bayes’ rule, the posterior, the predictive distribution β is machinery you already have.
This chapter is long, so it’s split into four parts. Work through them in order:
The four parts
- Setup & the Framework β the golden-sticker story, the keystone shift from “which event?” to “which set?”, Shepard’s law as the target to aim for, and the framework named (hypothesis space, prior, likelihood, posterior; the membership matrix).
- The Number Game & the Size Principle β generalization as a posterior-weighted vote; weak vs. strong sampling; the size principle; and Tenenbaum’s number game, where one example gives graded generalization and three snap to a rule.
- Continuous Concepts & Shepard’s Law β the rectangle game: the same framework over infinitely many interval hypotheses, where Shepard’s exponential law of generalization emerges from the model rather than being assumed.
- No Free Lunch & Summary β why a learner that assumes nothing learns nothing, so the prior is unavoidable; the chapter summary; practice exercises; and references.
graph LR
P1[1. Setup & framework] --> P2[2. Number game & size principle]
P2 --> P3[3. Continuous & Shepard's law]
P3 --> P4[4. No Free Lunch & summary]
classDef part fill:#2c7fb8,stroke:#333,stroke-width:2px,color:#fff
class P1,P2,P3,P4 partThe shift in one sentence
A hypothesis in this chapter is a rule, and a rule is a set β the set of things the rule says have the property. Hold onto “a hypothesis is a set”; everything else follows from it.