Homework 1: simulation, and probability¶
Instructions: Please answer the following questions and submit your work by editing this jupyter notebook and submitting it on Canvas. Questions may involve math, programming, or neither, but you should make sure to explain your work: i.e., you should usually have a cell with at least a few sentences explaining what you are doing.
import numpy as np
rng = np.random.default_rng()
1. Probabilities and expectations¶
Find the following quantities, both by (i) math and by (ii) simulation, for the independent random variables $$\begin{aligned} N &\sim \text{Poisson}(\text{mean}=2) \\ T &\sim \text{Exponential}(\text{mean}=5) \\ X &\sim \text{Normal}(\text{mean}=10, \text{sd}=3) . \end{aligned}$$
For instance, if asked to find the probability that $N=0$ then you might consult Wikipedia, and so report that (i) this is $$\mathbb{P}\{N = 0\} = e^{-\lambda} = e^{-2}$$ and to verify this (ii)
N = rng.poisson(lam=2, size=1000000)
print(f"Simulation: {np.mean(N == 0)}, theory: {np.exp(-2)}")
Simulation: 0.135645, theory: 0.1353352832366127
Note you should include a comparison of the numerical value produced by your mathematical expression, and reference to any properties of random variables used in the calculations.
a. The probability that $T < 4$.
b. The standard deviation of $T + X$.
c. The expected value of $2X + 1$.
d. The expected value of $NT$ (i.e., the product of $N$ and $T$).
2. Ascending sums¶
For each $i \ge 1$, Let $D_i$ be a random number drawn independently and uniformly from $\{1, 2, 3, 4, 5, 6\}$. Let $$ K = \min\{ k \ge 1 \;:\; D_k + D_{k+1} = 7 \} , $$ i.e., $K$ is defined by the fact that $D_{K}$ and $D_{K+1}$ are the first pair of adjacent rolled numbers that add up to 7. Finally, let $$ X = \sum_{i=1}^K D_i . $$
a. Describe in words how to simulate $X$ using fair dice.
b. Write a function to simulate $X$ (in python).
The function should have one argument, size
,
that determines the number of independent samples of $X$ that are returned.
c. Make a plot describing the distribution of $X$, and estimate its mean (by simulating at least $10^5$ values).