For \( Y \in \mathbb{R}^n \), \( X \in \mathbb{R}^{n \times p} \), and \( \beta \in \mathbb{R}^p \), consider a regression model \[ Y = X \beta + \epsilon, \] where \( \epsilon \) has an \( n \)-dimensional multivariate normal distribution with zero mean vector and identity covariance matrix. Let \( I_p \) denote the identity matrix of order \( p \). For \( \lambda>0 \), let \[ \hat{\beta}_n = (X^T X + \lambda I_p)^{-1} X^T Y, \] be an estimator of \( \beta \). Then which of the following options is/are correct?
Step 1: Unbiasedness of \( \hat{\beta}_n \) The estimator \( \hat{\beta}_n = (X^T X + \lambda I_p)^{-1} X^T Y \) is biased because the regularization term \( \lambda I_p \) adds bias. Therefore, \( \hat{\beta}_n \) is not an unbiased estimator of \( \beta \).
Thus, Option (A) is incorrect.
Step 2: Positive Definiteness of \( X^T X + \lambda I_p \)
Since \( X^T X \) is positive semi-definite and \( \lambda I_p \) is a positive definite matrix for \( \lambda>0 \), the matrix \( X^T X + \lambda I_p \) is positive definite.
Thus, Option (B) is correct.
Step 3: Distribution of \( \hat{\beta}_n \)
Since \( \epsilon \sim \mathcal{N}(0, I_n) \), the estimator \( \hat{\beta}_n \) is a linear function of the normally distributed vector \( Y \), and hence \( \hat{\beta}_n \) follows a multivariate normal distribution.
Thus, Option (C) is correct.
Step 4: Variance of \( \hat{\beta}_n \)
The variance of \( \hat{\beta}_n \) is given by: \[ {Var}(\hat{\beta}_n) = (X^T X + \lambda I_p)^{-1}. \]
Thus, Option (D) is correct. Final Answer:
The correct answers are \( \boxed{B, C} \).
Let \( (X_1, X_2, X_3) \) follow the multinomial distribution with the number of trials being 100 and the probability vector \( \left( \frac{3}{10}, \frac{1}{10}, \frac{3}{5} \right) \).
Then \( E(X_2 | X_3 = 40) \) equals:
Let \( X \) follow a 10-dimensional multivariate normal distribution with zero mean vector and identity covariance matrix. Define \( Y = \log_e \sqrt{X^T X} \) and let \( M_Y(t) \) denote the moment generating function of \( Y \) at \( t \), \( t>-10 \). Then \( M_Y(2) \) equals _________ (answer in integer).
Let \( x_1 = 0, x_2 = 1, x_3 = 1, x_4 = 1, x_5 = 0 \) be observed values of a random sample of size 5 from \( {Bin}(1, \theta) \) distribution, where \( \theta \in (0, 0.7] \). Then the maximum likelihood estimate of \( \theta \) based on the above sample is ________ (rounded off to two decimal places).
An electricity utility company charges ₹7 per kWh. If a 40-watt desk light is left on for 10 hours each night for 180 days, what would be the cost of energy consumption? If the desk light is on for 2 more hours each night for the 180 days, what would be the percentage-increase in the cost of energy consumption?
In the context of the given figure, which one of the following options correctly represents the entries in the blocks labelled (i), (ii), (iii), and (iv), respectively?

A bag contains Violet (V), Yellow (Y), Red (R), and Green (G) balls. On counting them, the following results are obtained:
(i) The sum of Yellow balls and twice the number of Violet balls is 50.
(ii) The sum of Violet and Green balls is 50.
(iii) The sum of Yellow and Red balls is 50.
(iv) The sum of Violet and twice the number of Red balls is 50.
Which one of the following Pie charts correctly represents the balls in the bag?