Chi-squared test of independence

All Questions
gaokao 2020 Q18 12 marks
A student interest group randomly surveyed the air quality level and the number of people exercising in a certain park on each of 100 days in a certain city. The organized data is shown in the table below (unit: days):
\backslashbox{Air Quality Level}{Number of Exercisers}[0, 200](200, 400](400, 600]
1 (Excellent)21625
2 (Good)51012
3 (Slight Pollution)678
4 (Moderate Pollution)720

(1) Estimate the probability that the air quality level on a given day in the city is 1, 2, 3, or 4 respectively;
(2) Find the estimated value of the average number of people exercising in the park on a given day (use the midpoint of each interval as the representative value for data in that interval);
(3) If the air quality level on a given day is 1 or 2, the day is called ``good air quality''; if the air quality level is 3 or 4, the day is called ``poor air quality''. Based on the given data, complete the following $2 \times 2$ contingency table, and based on the contingency table, determine whether there is 95\% confidence to conclude that the number of people exercising in the park on a given day is related to the air quality level of the city on that day?
gaokao 2022 Q20 12 marks
20. (12 points)
A medical team conducted a study on the relationship between a certain endemic disease in a region and the hygiene habits of local residents (hygiene habits are classified as either good or not sufficiently good). Among patients with the disease, 100 cases were randomly surveyed (called the case group), and among people without the disease, 100 people were randomly surveyed (called the control group). The following data were obtained:
Not Sufficiently GoodGood
Case Group4060
Control Group1090

(1) Can we conclude with 99\% confidence that there is a difference in hygiene habits between the group with the disease and the group without the disease?
(2) From the population of the region, one person is randomly selected. Let $A$ denote the event ``the selected person has not sufficiently good hygiene habits'' and $B$ denote the event ``the selected person has the disease''. The ratio $\frac { P ( B \mid A ) } { P ( \bar { B } \mid A ) }$ to $\frac { P ( B \mid \bar { A } ) } { P ( \bar { B } \mid \bar { A } ) }$ is a measure of the risk level of the disease associated with not sufficiently good hygiene habits. Let this measure be denoted as $R$.
(i) Prove that $R = \frac { P ( A \mid B ) } { P ( \bar { A } \mid B ) } \
gaokao 2025 Q15 13 marks
To study the relationship between a certain disease and ultrasound examination results, 1000 people who had undergone ultrasound examination were randomly surveyed, yielding the following contingency table:
\backslashbox{Category}{Ultrasound Result}NormalAbnormalTotal
Has Disease20180200
No Disease78020800
Total8002001000

(1) Let $P$ denote the probability that a person with abnormal ultrasound results has the disease. Find the estimated value of $P$.
(2) Based on the significance level $\alpha = 0.001$ for independence testing, analyze whether the ultrasound examination result is related to having the disease.
Attachment: $\chi^2 = \frac{n(ad - bc)^2}{(a+b)(c+d)(a+c)(b+d)}$,
$P(\chi^2 \geq k)$0.0050.0100.001
$k$3.8416.63510.828
gaokao 2025 Q15 13 marks
(13 points) To study the relationship between a certain disease and ultrasound examination results, 1000 people who had undergone ultrasound examination were randomly surveyed, yielding the following contingency table:
NormalAbnormalTotal
Has disease20180200
Does not have disease78020800
Total8002001000

(1) Let $p$ denote the probability that a person with abnormal ultrasound examination results has the disease. Find the estimated value of $p$.
(2) Based on the significance level $\alpha = 0.001$ for the independence test, analyze whether the ultrasound examination result is related to having the disease. Attachment: $\chi^2 = \frac{n(ad - bc)^2}{(a+b)(c+d)(a+c)(b+d)}$,
$P(\chi^2 \geq k)$0.0500.0100.001
$k$3.8416.63510.828
.