Conditioning and Inference

From Simulation to Inference

So far, we’ve used GenJAX to generate outcomes — simulating what could happen.

Now we’ll learn to infer — reasoning backwards from observations to causes.

This is the heart of probabilistic programming!

📓 Interactive Notebook Available

Prefer hands-on learning? This chapter has a companion Jupyter notebook - Open in Colab that walks through Bayesian inference interactively with working code, visualizations, and exercises. You can work through the notebook first, then return here for the detailed explanations, or use them side-by-side!

Recall: Conditional Probability

From the probability tutorial, remember conditional probability:

“Given that I observed $B$, what’s the probability of $A$?”

Written: $P(A \mid B)$

Meaning: Restrict the outcome space to only outcomes in $B$, then calculate the probability of $A$ within that restricted space.

Formula: $P(A \mid B) = \frac{P(A \cap B)}{P(B)} = \frac{|A \cap B|}{|B|}$

📘 Foundation Concept: Conditioning as Restriction

Recall from Tutorial 1, Chapter 4 that conditional probability means restricting the outcome space:

$$P(A \mid B) = \frac{|A \cap B|}{|B|}$$

The key idea: Cross out outcomes where $B$ didn’t happen, then calculate probabilities in what remains.

Tutorial 1 example: “At least one tonkatsu” given “first meal was tonkatsu”

Original space: {HH, HT, TH, TT}
Condition: First meal is T → Restrict to {TH, TT}
Event: At least one T → In restricted space: {TH, TT}
Probability: 2/2 = 1 (both remaining outcomes have tonkatsu!)

What GenJAX does:

Tutorial 1: Manually cross out outcomes and count
Tutorial 2: Code filters simulations or uses ChoiceMap to restrict

The logic is identical — conditioning = restricting possibilities to match observations!

← Review conditional probability in Tutorial 1, Chapter 4

The Taxicab Problem: A Real Inference Challenge

Let’s apply these ideas to a real problem from the probability tutorial.

Scenario: Chibany witnesses a hit-and-run at night. They say the taxi was blue. But:

85% of taxis are green, 15% are blue
Chibany identifies colors correctly 80% of the time

Question: What’s the probability it was actually a blue taxi?

Why This Is Surprising

Most people’s intuition says: “Chibany is 80% accurate, so probably 80% chance it’s blue.”

But the answer is only about 41%!

Why? Because most taxis are green. Even with 80% accuracy, there are more green taxis misidentified as blue than there are actual blue taxis correctly identified.

This is base rate neglect — ignoring how common something is in the population.

Let’s see how GenJAX helps us solve this!

The Generative Model

First, we express the taxicab scenario as a GenJAX generative function:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
import jax
import jax.numpy as jnp
from genjax import gen, flip

@gen
def taxicab_model(base_rate_blue=0.15, accuracy=0.80):
    """Generate the taxi color and what Chibany says.

    Args:
        base_rate_blue: Probability a taxi is blue (default 0.15)
        accuracy: Probability Chibany identifies correctly (default 0.80)

    Returns:
        True if taxi is blue, False if green
    """

    # True taxi color
    is_blue = flip(base_rate_blue) @ "is_blue"

    # Model: What does Chibany say?
    # Probability of saying "blue" depends on the true color:
    # - If blue: says "blue" with probability = accuracy (correct identification)
    # - If green: says "blue" with probability = (1 - accuracy) (mistake)
    says_blue_prob = jnp.where(is_blue, accuracy, 1 - accuracy)
    says_blue = flip(says_blue_prob) @ "says_blue"

    return is_blue

What this encodes:

Prior: Taxis are blue 15% of the time (base rate)
Likelihood: How observation (“says blue”) depends on true color
- If blue: says “blue” 80% of the time (correct)
- If green: says “blue” 20% of the time (mistake)
Complete model: Joint distribution over true color and observation

📘 Foundation Concept: Bayes’ Theorem in Code

Recall from Tutorial 1, Chapter 5 that Bayes’ Theorem updates beliefs with evidence:

$$P(H \mid E) = \frac{P(E \mid H) \cdot P(H)}{P(E)}$$

In the taxicab problem:

Hypothesis (H): Taxi is blue
Evidence (E): Chibany says “blue”
Question: $P(\text{blue} \mid \text{says blue})$ = ?

Tutorial 1 approach (by hand):

Calculate $P(\text{says blue} \mid \text{blue}) \cdot P(\text{blue}) = 0.80 \times 0.15 = 0.12$
Calculate $P(\text{says blue} \mid \text{green}) \cdot P(\text{green}) = 0.20 \times 0.85 = 0.17$
Calculate $P(\text{says blue}) = 0.12 + 0.17 = 0.29$
Apply Bayes: $P(\text{blue} \mid \text{says blue}) = \frac{0.12}{0.29} \approx 0.41$

Tutorial 2 approach (GenJAX):

Define the generative model (prior + likelihood)
Specify observation (says blue)
Let GenJAX compute the posterior automatically!

The structure is identical:

is_blue = flip(0.15) → Prior: $P(\text{blue})$
says_blue_prob = jnp.where(is_blue, accuracy, 1 - accuracy) → Likelihood: $P(\text{says blue} \mid \text{blue})$
GenJAX conditioning → Computes posterior: $P(\text{blue} \mid \text{says blue})$

Key insight: GenJAX does all the Bayes’ Theorem algebra for you! You just write the generative story (prior + likelihood), and conditioning gives you the posterior.

← Review Bayes’ Theorem in Tutorial 1, Chapter 5

Three Approaches to Inference

GenJAX provides three ways to compute conditional probabilities:

Approach 1: Filtering (Rejection Sampling)

Generate many traces, keep only those matching the observation.

Pseudocode:

1. Generate many traces
2. Keep only traces where observation is true
3. Among those, count how many satisfy the query
4. Calculate the ratio

This is Monte Carlo conditional probability — exactly what we did by hand with sets!

Approach 2: Conditioning with `generate()`

GenJAX has built-in support for specifying observations. We provide a choice map with the observed values, and GenJAX generates traces consistent with those observations.

Approach 3: Full Inference (Importance Sampling, MCMC)

More advanced methods (beyond this tutorial). These are more efficient when observations are rare.

This chapter focuses on Approach 1 and 2 — the most intuitive methods.

📐→💻 Math-to-Code Translation

How Bayesian inference translates to GenJAX:

Math Concept	Mathematical Notation	GenJAX Code
Prior	$P(H)$	`flip(0.15) @ "is_blue"`
Likelihood	$P(E \mid H)$	`jnp.where(is_blue, accuracy, 1-accuracy)`
Evidence	$P(E)$	GenJAX computes automatically
Posterior	$P(H \mid E) = \frac{P(E \mid H) P(H)}{P(E)}$	Result of conditioning
Observation	$E$ = “says blue”	`ChoiceMap.d({"says_blue": True})`
Inference Query	$P(\text{is_blue} \mid \text{says_blue})$	`mean(posterior_samples)`

Three equivalent inference approaches:

Approach	Mathematical Idea	GenJAX Implementation
1. Filtering	Sample from joint, keep only matching $E$	Filter traces where `says_blue == 1`
2. generate()	Direct sampling from $P(H \mid E)$	`model.generate(key, observation, args)`
3. importance()	Weighted sampling	`target.importance(key, n_particles)`

Key insights:

Generative model = Prior + Likelihood — The @gen function encodes both
Conditioning = Computing posterior — GenJAX does the Bayes’ theorem math
All three methods compute the same thing — They just differ in efficiency
Base rates matter! — Prior P(H) heavily influences posterior P(H|E)

Approach 1: Filtering (Rejection Sampling)

Let’s solve the taxicab problem by generating many scenarios and filtering to the observation.

Step 1: Generate Many Scenarios

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
# Generate 100,000 scenarios
import jax.numpy as jnp

key = jax.random.key(42)
keys = jax.random.split(key, 100000)

def run_scenario_vec(k):
    trace = taxicab_model.simulate(k, (0.15, 0.80))
    choices = trace.get_choices()
    return jnp.array([choices['is_blue'], choices['says_blue']])

scenarios = jax.vmap(run_scenario_vec)(keys)
is_blue = scenarios[:, 0]
says_blue = scenarios[:, 1]

Step 2: Filter to Observation

Observation: Chibany says “blue”

1
2
3
4
5
6
7
# Keep only scenarios where Chibany says "blue"
import jax.numpy as jnp

observation_satisfied = says_blue == 1

n_says_blue = jnp.sum(observation_satisfied)
print(f"Scenarios where Chibany says blue: {n_says_blue} / {len(scenarios)}")

Output (example):

Scenarios where Chibany says blue: 29017 / 100000

Why ~29%?

$P(\text{says Blue}) = P(\text{Blue}) \cdot P(\text{says Blue} \mid \text{Blue}) + P(\text{Green}) \cdot P(\text{says Blue} \mid \text{Green})$
$= 0.15 \times 0.80 + 0.85 \times 0.20 = 0.12 + 0.17 = 0.29$

Step 3: Count True Positives

Among scenarios where they say “blue”, how many are actually blue?

1
2
3
4
5
6
7
# Both says blue AND is blue
import jax.numpy as jnp

both_blue = observation_satisfied & (is_blue == 1)

n_actually_blue = jnp.sum(both_blue)
print(f"Scenarios where taxi IS blue: {n_actually_blue} / {n_says_blue}")

Output (example):

Scenarios where taxi IS blue: 12038 / 29017

Step 4: Calculate Posterior

1
2
prob_blue_given_says_blue = n_actually_blue / n_says_blue
print(f"\nP(Blue | says Blue) ≈ {prob_blue_given_says_blue:.3f}")

Output:

P(Blue | says Blue) ≈ 0.415

Only 41.5%! Even though Chibany is 80% accurate, there’s less than 50% chance the taxi was actually blue!

The Base Rate Strikes!

Why so low?

Even though Chibany is 80% accurate, most taxis are green (85%). So even with his 20% error rate on green taxis, there are more green taxis misidentified as blue than there are actual blue taxis!

The numbers:

Blue taxis correctly identified: $0.15 \times 0.80 = 0.12$ (12%)
Green taxis incorrectly identified: $0.85 \times 0.20 = 0.17$ (17%)

More false positives than true positives!

This is why the posterior is only 41.5% ≈ 12/(12+17).

The Filtering Pattern

Conditional probability via filtering:

Generate many traces
Filter to observations (keep only matching traces)
Count queries among filtered traces
Divide to get conditional probability

This is rejection sampling — the simplest form of inference!

Approach 2: Using `generate()` with Observations

Now let’s use GenJAX’s built-in conditioning. This is usually more convenient!

Creating a Choice Map and Generating Conditional Traces

A choice map is a dictionary specifying values for named random choices. We use it to condition the model on observations:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
from genjax import ChoiceMap

# Specify that Chibany says "blue"
# Note: flip() returns boolean values, so we use True/False
import jax.numpy as jnp

observation = ChoiceMap.d({"says_blue": True})

# Generate 10,000 traces conditional on observation
key = jax.random.key(42)
keys = jax.random.split(key, 10000)

def run_conditional(k):
    trace, weight = taxicab_model.generate(k, observation, (0.15, 0.80))
    return trace.get_retval(), weight  # Return both value and weight

results = jax.vmap(run_conditional)(keys)
is_blue_samples = results[0]  # The sampled values
weights = results[1]  # The importance weights (log probabilities)

# IMPORTANT: Use importance sampling with weights!
# Normalize weights (convert from log space and normalize)
normalized_weights = jnp.exp(weights - jnp.max(weights))
normalized_weights = normalized_weights / jnp.sum(normalized_weights)

# Calculate weighted posterior probability
prob_blue_posterior = jnp.sum(is_blue_samples * normalized_weights)
print(f"P(Blue | says Blue) ≈ {prob_blue_posterior:.3f}")

Output:

P(Blue | says Blue) ≈ 0.414

Same answer! Both methods work — generate() is just more convenient.

⚠️ Critical: Always Use the Weights!

When using generate() with conditioning, you must use the importance weights returned!

Why? When GenJAX generates traces conditional on observations, different traces have different probabilities. The weight tells you how likely each trace is. Simply averaging without weights gives you the prior (what you believed before seeing the evidence), not the posterior (what you should believe after seeing the evidence).

Correct approach:

1
2
3
# Return BOTH value and weight
trace, weight = model.generate(key, observation, args)
# Then use weighted average with normalized weights

Incorrect approach (gives wrong answer):

1
2
3
4
# Only returning value, ignoring weight
trace, weight = model.generate(key, observation, args)
return trace.get_retval()  # ❌ Discarding weight!
# Then simple average → gives prior, not posterior!

This is the essence of importance sampling - a fundamental inference technique in probabilistic programming.

generate() vs simulate()

simulate(key, args):

Generates a trace with all choices random
No observations specified
Returns just the trace

generate(key, observations, args):

Generates a trace consistent with observations
Specified choices take given values
Unspecified choices are random
Returns (trace, weight) where weight = log probability of observations

When to use which:

Forward simulation (no observations): Use simulate()
Conditional sampling (some observations): Use generate()

Theoretical Verification

Let’s verify our simulation against exact Bayes’ theorem calculation:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
# Prior
P_blue = 0.15
P_green = 0.85

# Likelihood
P_says_blue_given_blue = 0.80
P_says_blue_given_green = 0.20

# Evidence (total probability of saying blue)
P_says_blue = (P_blue * P_says_blue_given_blue +
               P_green * P_says_blue_given_green)

# Posterior (Bayes' theorem)
P_blue_given_says_blue = (P_says_blue_given_blue * P_blue) / P_says_blue

print(f"=== Bayes' Theorem Calculation ===")
print(f"P(Blue) = {P_blue}")
print(f"P(says Blue | Blue) = {P_says_blue_given_blue}")
print(f"P(says Blue | Green) = {P_says_blue_given_green}")
print(f"P(says Blue) = {P_says_blue}")
print(f"\nP(Blue | says Blue) = {P_blue_given_says_blue:.3f}")

Output:

=== Bayes' Theorem Calculation ===
P(Blue) = 0.15
P(says Blue | Blue) = 0.8
P(says Blue | Green) = 0.2
P(says Blue) = 0.29

P(Blue | says Blue) = 0.414

Perfect match! GenJAX simulation ≈ 0.415, Bayes’ theorem exact = 0.414

Visualizing Prior vs Posterior

Let’s visualize how evidence changes our beliefs:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
import matplotlib.pyplot as plt

# Prior: before observation
prior_blue = 0.15
prior_green = 0.85

# Posterior: after observation
posterior_blue = prob_blue_posterior  # From simulation
posterior_green = 1 - posterior_blue

# Plot
fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(12, 5))

categories = ['Green', 'Blue']
colors = ['#4ecdc4', '#6c5ce7']

# Prior
ax1.bar(categories, [prior_green, prior_blue], color=colors)
ax1.set_ylabel('Probability', fontsize=12)
ax1.set_title('Prior: Before Chibany Speaks', fontsize=14, fontweight='bold')
ax1.set_ylim(0, 1)
ax1.axhline(y=0.5, color='gray', linestyle='--', alpha=0.5)

for i, prob in enumerate([prior_green, prior_blue]):
    ax1.text(i, prob + 0.05, f'{prob:.1%}', ha='center', fontsize=14, fontweight='bold')

# Posterior
ax2.bar(categories, [posterior_green, posterior_blue], color=colors)
ax2.set_ylabel('Probability', fontsize=12)
ax2.set_title('Posterior: After Chibany Says "Blue"', fontsize=14, fontweight='bold')
ax2.set_ylim(0, 1)
ax2.axhline(y=0.5, color='gray', linestyle='--', alpha=0.5)

for i, prob in enumerate([posterior_green, posterior_blue]):
    ax2.text(i, prob + 0.05, f'{prob:.1%}', ha='center', fontsize=14, fontweight='bold')

plt.tight_layout()
plt.show()

print(f"\n📊 Belief Update:")
print(f"   Before: P(Blue) = {prior_blue:.1%}")
print(f"   After:  P(Blue | says Blue) = {posterior_blue:.1%}")
print(f"   Change: +{(posterior_blue - prior_blue):.1%}")

Key insight: Evidence increased our belief in blue from 15% to 41%, but still not even 50% because the base rate is so strong!

Exploring Base Rate Effects

Let’s see how changing the base rate affects the answer.

Scenario 1: Equal Taxis (50% blue, 50% green)

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
import jax
import jax.numpy as jnp
from genjax import ChoiceMap

observation = ChoiceMap.d({"says_blue": True})

def run_equal_base(k):
    trace, weight = taxicab_model.generate(k, observation, (0.50, 0.80))
    return trace.get_retval(), weight

key = jax.random.key(42)
keys = jax.random.split(key, 10000)
results_equal = jax.vmap(run_equal_base)(keys)
weights_equal = jnp.exp(results_equal[1] - jnp.max(results_equal[1]))
weights_equal = weights_equal / jnp.sum(weights_equal)
prob_equal = jnp.sum(results_equal[0] * weights_equal)

print(f"If 50% blue: P(Blue | says Blue) = {prob_equal:.3f}")

Output:

If 50% blue: P(Blue | says Blue) = 0.800

Now it’s 80%! When base rates are equal, accuracy dominates.

Scenario 2: Mostly Blue (85% blue, 15% green)

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
import jax.numpy as jnp

def run_mostly_blue(k):
    trace, weight = taxicab_model.generate(k, observation, (0.85, 0.80))
    return trace.get_retval(), weight

results_mostly_blue = jax.vmap(run_mostly_blue)(keys)
weights_mostly_blue = jnp.exp(results_mostly_blue[1] - jnp.max(results_mostly_blue[1]))
weights_mostly_blue = weights_mostly_blue / jnp.sum(weights_mostly_blue)
prob_mostly_blue = jnp.sum(results_mostly_blue[0] * weights_mostly_blue)

print(f"If 85% blue: P(Blue | says Blue) = {prob_mostly_blue:.3f}")

Output:

If 85% blue: P(Blue | says Blue) = 0.971

Now it’s 97%! When most taxis are blue, seeing “blue” is strong evidence.

Visualizing the Effect

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
import jax
import jax.numpy as jnp
from genjax import ChoiceMap

observation = ChoiceMap.d({"says_blue": True})

# Test different base rates
base_rates = jnp.linspace(0.01, 0.99, 50)
posteriors = []
key = jax.random.key(42)

for rate in base_rates:
    def run_with_rate(k):
        trace, weight = taxicab_model.generate(k, observation, (float(rate), 0.80))
        return trace.get_retval(), weight

    keys = jax.random.split(key, 1000)
    results = jax.vmap(run_with_rate)(keys)
    samples, weights = results[0], results[1]

    # Use importance sampling
    normalized_weights = jnp.exp(weights - jnp.max(weights))
    normalized_weights = normalized_weights / jnp.sum(normalized_weights)
    posteriors.append(jnp.sum(samples * normalized_weights))

# Plot
plt.figure(figsize=(10, 6))
plt.plot(base_rates, posteriors, linewidth=2, color='#6c5ce7')
plt.axhline(y=0.5, color='gray', linestyle='--', alpha=0.5, label='50% threshold')
plt.axvline(x=0.15, color='red', linestyle='--', alpha=0.7, label='Original problem (15%)')
plt.scatter([0.15], [0.414], color='red', s=100, zorder=5)

plt.xlabel('Base Rate: P(Blue)', fontsize=12)
plt.ylabel('Posterior: P(Blue | says Blue)', fontsize=12)
plt.title('How Base Rates Affect Inference\n(Chibany 80% accurate)', fontsize=14, fontweight='bold')
plt.grid(alpha=0.3)
plt.legend(fontsize=10)
plt.tight_layout()
plt.show()

The graph shows a sigmoidal (S-shaped) curve:

Low base rates (e.g., 1%): Even with a positive witness, posterior stays low (~4%)
Medium base rates (e.g., 50%): Steep rise - evidence has maximum impact (~80%)
High base rates (e.g., 99%): Posterior approaches certainty (~99.7%)

The curve shows: Even with 80% accuracy, the posterior depends heavily on the base rate!

The Lesson

Base rates matter enormously in real-world inference!

Medical tests, fraud detection, witness testimony — all require considering:

How accurate is the test/witness? (likelihood)
How common is the condition/crime? (prior/base rate)

Ignoring base rates leads to wrong conclusions.

This is called base rate neglect — a common cognitive bias.

Key Concepts

Prior Distribution

What we believe before seeing data.

Generated by simulate() (no observations)
Represents our initial uncertainty

Posterior Distribution

What we believe after seeing data.

Generated by generate() with observations
Represents updated beliefs incorporating evidence

Bayes’ Theorem in Action

GenJAX automatically handles the math:

$$P(\text{hypothesis} \mid \text{data}) = \frac{P(\text{data} \mid \text{hypothesis}) \cdot P(\text{hypothesis})}{P(\text{data})}$$

You just:

Define the generative model (encodes $P(\text{data} \mid \text{hypothesis})$ and $P(\text{hypothesis})$)
Specify observations (the data)
Generate conditional traces (GenJAX computes the posterior)

No manual Bayes’ rule calculation needed!

The Power of Generative Models

When you write a generative function, you’re specifying:

Prior: The distribution of random choices before observations
Likelihood: How observations depend on hidden variables
Joint distribution: The complete probabilistic model

GenJAX handles the inference automatically!

Complete Example Code

Here’s everything together for easy copying:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
import jax
import jax.numpy as jnp
from genjax import gen, flip, ChoiceMap
import matplotlib.pyplot as plt

@gen
def taxicab_model(base_rate_blue=0.15, accuracy=0.80):
    """Taxicab problem generative model."""
    is_blue = flip(base_rate_blue) @ "is_blue"

    # What Chibany says depends on true color
    says_blue_prob = jnp.where(is_blue, accuracy, 1 - accuracy)
    says_blue = flip(says_blue_prob) @ "says_blue"

    return is_blue

# Observation: Chibany says "blue"
observation = ChoiceMap.d({"says_blue": True})

# Generate posterior samples
key = jax.random.key(42)
keys = jax.random.split(key, 10000)

def run_inference(k):
    trace, weight = taxicab_model.generate(k, observation, (0.15, 0.80))
    return trace.get_retval(), weight

results = jax.vmap(run_inference)(keys)
is_blue_samples = results[0]
weights = results[1]

# Use importance sampling with weights
normalized_weights = jnp.exp(weights - jnp.max(weights))
normalized_weights = normalized_weights / jnp.sum(normalized_weights)
prob_blue = jnp.sum(is_blue_samples * normalized_weights)

print(f"=== TAXICAB INFERENCE ===")
print(f"Base rate: 15% blue")
print(f"Accuracy: 80%")
print(f"Observation: Says 'blue'")
print(f"\nP(Blue | says Blue) ≈ {prob_blue:.3f}")

Interactive Exploration

📓 Interactive Notebook: Bayesian Learning

Want to explore Bayesian learning in depth with interactive examples? Check out the Bayesian Learning notebook - Open in Colab which covers:

The complete taxicab problem with visualizations
Sequential Bayesian updating with multiple observations
Interactive sliders to explore different base rates and accuracies
How base rates affect posterior beliefs

This notebook lets you experiment with the concepts you just learned!

Exercises

Exercise 1: Higher Accuracy

What if Chibany were 95% accurate instead of 80%?

Task: Modify the code to use accuracy=0.95 and calculate the posterior.

Solution

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
import jax.numpy as jnp

def run_high_accuracy(k):
    trace, weight = taxicab_model.generate(k, observation, (0.15, 0.95))
    return trace.get_retval(), weight

key = jax.random.key(42)
keys = jax.random.split(key, 10000)
results_high_acc = jax.vmap(run_high_accuracy)(keys)
weights_high_acc = jnp.exp(results_high_acc[1] - jnp.max(results_high_acc[1]))
weights_high_acc = weights_high_acc / jnp.sum(weights_high_acc)
prob_high_acc = jnp.sum(results_high_acc[0] * weights_high_acc)

print(f"With 95% accuracy: P(Blue | says Blue) = {prob_high_acc:.3f}")

Expected: ≈ 0.77 (77%)

Much higher! Accuracy matters, but even at 95%, base rates still pull it below 100%.

Theoretical: $$P = \frac{0.95 \times 0.15}{0.95 \times 0.15 + 0.05 \times 0.85} = \frac{0.1425}{0.1850} \approx 0.770$$

Exercise 2: Opposite Observation

What if Chibany said “green” instead of “blue”?

Task: Calculate $P(\text{Blue} \mid \text{says Green})$

Solution

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
# Observation: says "green"
import jax.numpy as jnp

observation_green = ChoiceMap.d({"says_blue": False})

def run_says_green(k):
    trace, weight = taxicab_model.generate(k, observation_green, (0.15, 0.80))
    return trace.get_retval(), weight

key = jax.random.key(42)
keys = jax.random.split(key, 10000)
results_green = jax.vmap(run_says_green)(keys)
weights_green = jnp.exp(results_green[1] - jnp.max(results_green[1]))
weights_green = weights_green / jnp.sum(weights_green)
prob_blue_given_green = jnp.sum(results_green[0] * weights_green)

print(f"P(Blue | says Green) = {prob_blue_given_green:.3f}")

Expected: ≈ 0.041 (4.1%)

Very low! If Chibany (80% accurate) says “green”, it’s very likely green.

Theoretical: $$P = \frac{0.20 \times 0.15}{0.20 \times 0.15 + 0.80 \times 0.85} = \frac{0.03}{0.71} \approx 0.042$$

Exercise 3: Two Witnesses

What if two independent witnesses both say “blue”?

Task: Extend the model to include two witnesses, both 80% accurate. Calculate the posterior.

Solution

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
import jax.numpy as jnp

@gen
def taxicab_two_witnesses(base_rate_blue=0.15, accuracy=0.80):
    """Two independent witnesses."""
    is_blue = flip(base_rate_blue) @ "is_blue"

    # Witness 1
    witness1_prob = jnp.where(is_blue, accuracy, 1 - accuracy)
    witness1 = flip(witness1_prob) @ "witness1"

    # Witness 2 (independent)
    witness2_prob = jnp.where(is_blue, accuracy, 1 - accuracy)
    witness2 = flip(witness2_prob) @ "witness2"

    return is_blue

# Both say "blue"
observation_two = ChoiceMap.d({"witness1": True, "witness2": True})

def run_two_witnesses(k):
    trace, weight = taxicab_two_witnesses.generate(k, observation_two, (0.15, 0.80))
    return trace.get_retval(), weight

key = jax.random.key(42)
keys = jax.random.split(key, 10000)
results_two = jax.vmap(run_two_witnesses)(keys)
weights_two = jnp.exp(results_two[1] - jnp.max(results_two[1]))
weights_two = weights_two / jnp.sum(weights_two)
prob_two = jnp.sum(results_two[0] * weights_two)

print(f"P(Blue | both say Blue) = {prob_two:.3f}")

Expected: ≈ 0.73 (73%)

Much higher! Two independent pieces of evidence are much stronger.

Theoretical: $$P(\text{both say Blue} \mid \text{Blue}) = 0.80^2 = 0.64$$ $$P(\text{both say Blue} \mid \text{Green}) = 0.20^2 = 0.04$$ $$P = \frac{0.64 \times 0.15}{0.64 \times 0.15 + 0.04 \times 0.85} = \frac{0.096}{0.130} \approx 0.738$$

Two witnesses push us above 50% despite the low base rate!

What You’ve Learned

In this chapter, you learned:

✅ Conditional probability — restriction to observations

✅ Filtering approach — rejection sampling for inference

✅ generate() function — conditioning with choice maps

✅ Prior vs Posterior — beliefs before and after data

✅ Bayes’ theorem in action — automatic Bayesian update

✅ Base rate effects — why priors matter enormously

✅ Real inference problems — the taxicab scenario

The key insight: Probabilistic programming lets you encode assumptions (generative model) and ask questions (conditioning) without manual Bayes’ rule calculations!

Why This Matters

Real-world applications:

Medical diagnosis: Test accuracy + disease prevalence → probability of disease
Fraud detection: Transaction patterns + fraud base rate → probability of fraud
Spam filtering: Email features + spam base rate → probability of spam
Criminal justice: Witness accuracy + crime base rate → probability of guilt

All follow the same pattern:

Define generative model (how data arises)
Observe data
Infer hidden causes

GenJAX makes this systematic and scalable.

Next Steps

You now know:

How to build generative models
How to perform inference with observations
How to interpret posterior probabilities
Why base rates matter

Next up: Chapter 6 shows you how to build your own models from scratch!

← Previous: Understanding Traces	Next: Building Your Own Models →

Conditioning and Inference

From Simulation to Inference

Recall: Conditional Probability

The Taxicab Problem: A Real Inference Challenge

Why This Is Surprising

The Generative Model

Three Approaches to Inference

Approach 1: Filtering (Rejection Sampling)

Approach 2: Conditioning with generate()

Approach 3: Full Inference (Importance Sampling, MCMC)

Approach 1: Filtering (Rejection Sampling)

Step 1: Generate Many Scenarios

Step 2: Filter to Observation

Step 3: Count True Positives

Step 4: Calculate Posterior

Approach 2: Using generate() with Observations

Creating a Choice Map and Generating Conditional Traces

Theoretical Verification

Visualizing Prior vs Posterior

Exploring Base Rate Effects

Scenario 1: Equal Taxis (50% blue, 50% green)

Scenario 2: Mostly Blue (85% blue, 15% green)

Visualizing the Effect

Key Concepts

Prior Distribution

Posterior Distribution

Bayes’ Theorem in Action

Complete Example Code

Interactive Exploration

Exercises

Exercise 1: Higher Accuracy

Exercise 2: Opposite Observation

Exercise 3: Two Witnesses

What You’ve Learned

Why This Matters

Next Steps

Approach 2: Conditioning with `generate()`

Approach 2: Using `generate()` with Observations