Statıstıcs I Ara 7. Deneme Sınavı
Toplam 20 Soru1.Soru
What is the middle value of an ordered dataset called?
Median |
Ratio |
Tendency |
Mode |
Mean |
The median is the middle value of an ordered dataset. The correct answer is A.
2.Soru
Which of the following are not variables?
exam grade |
words in sentence |
your Anadolu University student number |
radio waves |
class |
Your Anatolian University student number is data, not variable. pg. 14. Correct answer is C.
3.Soru
Is your answer to this question variable?
Maybe |
Sometime |
It depends on time |
No |
Yes |
No, it is data. pg. 14. Correct answer is D.
4.Soru
What is the median of the following data set with frequency in paranthesis?
12 (2), 32 (4), 35 (3), 43 (3), 92 (2)
12 |
17 |
32 |
35 |
52 |
35 : median = (14 + 1) / 2 = 7.5 th observation ; then average of 7th and 8th observations : (35 + 35) / 2 = 35
5.Soru
Which one below is NOT correct about histograms?
Histograms will help us to identify the center, shape and symmetry of the data. |
A histogram can tell us about the peaks and extreme values. |
A histogram can be used to check out the normality. |
You can think histograms as bar plots of grouped frequency distributions. |
Histograms are drawn for qualitative data but not for continuous data. |
Histogram is a graph that is very similar to a bar chart except that bar charts are drawn for qualitative data but histograms are drawn for continuous data. Histograms will help us to identify the center, shape and symmetry of the data. A histogram can tell us about the peaks and extreme values, whether the distribution of data is skewed to the left, skewed to the right, bell-shaped, uniform or bimodal. A histogram can also be used to check out the normality.
6.Soru
What is the mode of following data set: 1,2,2,2,3,3,3,4,4,4,4,4,4,4,4,5,5,6,6,7,9
1 |
2 |
3 |
4 |
5 |
The most repeated value is 4, therefore the mode is 4.
7.Soru
Which of the following is more useful to discover the overall shape of the data?
Pie chart |
Histogram |
Dot plot |
Line chart |
Frequency polygon |
The frequency polygons are useful to discover the overall shape of the data (Is it symmetric or is there any asymmetry?). In order to create the frequency polygon, we use the midpoints of the bins (classes) in histogram vs the frequency of each bin. The midpoints are marked by a dot within each class interval. A straight line is used to connect the dots and so that lines are connected to each other.
8.Soru
Which of the following is not an ordinal variable?
Type of housing |
Income group |
Exam grade |
Educational level |
An attitude question in a survey where possible responses are agree / disagree. |
Categorical variables and data can be either nominal or ordinal. Examples of nominal categorical variables are gender, region of residence, field of study, type of transport, type of housing, etc. There is no ordering in the categories of these variables. . Exam grade is an ordinal categorical variable, since its categories are ordered: A is better than a B, B is better than a C, and so on. Other examples of ordinal categorical variables are income group (If incomes have been categorized), an attitude question in a survey where possible responses are strongly agree/agree/disagree/strongly disagree (these categories have an order).
9.Soru
There is a data set of clinic’s patients above. Which of the following statements about this table is false?
The weight in the table is a qualitative variable |
In the table, an element of the data set is a particular patient, for example Ahmet |
Age is a variable and takes on different values for different patients |
90 kg is the observation on the variable weight for patient Gökhan |
In the table, the outcomes on the four variables for patient Elif constitutes a case |
A data set is a collection of facts aggregated for a specific purpose. Elements are the entities on which the data are collected. In the table, an element of the data set is a particular patient. A variable is a characteristic of interest about an element. This characteristic takes on different values for different elements. The Gender and the name in the table are a qualitative variable because their outcomes are nonnumeric. Of the four variables in the data set in table, two are qualitative (Name and Gender) and two are quantitative (Age and Weight). The outcomes obtained on all variables for one element in the data set is called a case.
10.Soru
There are 85 people interviewed on their weekly salary. If the frequency for Class 2 is 21 what is the percentage of relative frequency?
0,247 |
24,7 |
21,3 |
2,13 |
0,85 |
Interpretation of the frequency distribution table can be easier or clearer when we use the percentage of the frequency. Percentage representation of frequency can also be displayed in the frequency distribution table. Percentage of frequency is called the relative frequency and the table is called relative frequency distribution table. Relative frequency can be used for both quantitative and qualitative variables. The relative frequency for a class is calculated as follows: Relative Frequency = fi/ n where, fi is the frequency for class i and n is the sample size. In this case, fi is 21and n is 85. The percentage of relative frequency is 24,7.
11.Soru
Your professor asked you to write an essay and stated that you would be assessed for the count of words in your essay. What form of numerical data would s/he study on?
nominal |
ordinal |
interval |
ratio |
qualitative |
Because the difference in the counts of words could be stated in percentages, it is a ratio type of measurement.
12.Soru
What is the median of the following data set with frequency in paranthesis?
16 (3), 33 (1), 36 (2), 48 (2), 66 (4), 92 (2)
26 |
38 |
42 |
54 |
57 |
57 : median = (16 + 1) / 2 = 8.5 th observation ; then average of 8th and 9th observations : (48 + 66) / 2 = 57. pg. 81. Correct answer is E.
13.Soru
A researcher collects data about the weight of pupils in a school. There are 500 students in that school whose weight differs from 20kgs to 40kgs. What is the class width for this case?
0.04 kg |
0.5kg |
0.89 kg |
1 kg |
2 kg |
Class Width=Range/Number of classes
where Number of Classes=?n, where n=number of observations
Thus Number of Classes=?500=22.36
But when the number of classes is larger than 20 we take it as 20. So in this case:
Class Width=(40-20)/20=20/20=1kg
14.Soru
- Usually used to represent categorical data
- Divides categories as sectors
- Each sector's size shows the proportion of each category to the total
Which of the following do these features belong to?
Simple bar chart |
Stacked bar chart |
Grouped bar chart |
pie chart |
Line chart |
A pie chart is usually used for categorical data. In pie chart components or outcomes of a total frequency is shown as sectors of a circle. The shape resembles to a pie, hence the name of the chart. In pie chart, the categories are divided in to slices/sectors. Each slices’ size is proportional to the total number of objects.
15.Soru
- Used to represent continuous data
- Usually used when the sample is large
- Columns are adjacent to each other
Which of the following visualization types do these features represent?
Stem-and-leaf |
Bar chart |
Histogram |
Pie chart |
Line chart |
Histogram is a graph that is very similar to a bar chart except that bar charts are drawn for qualitative data but histograms are drawn for continuous data. In order to draw the histogram of the data, we usually need to have a large sample. If you remember from previous chapters, the data was classified in to grouped frequency distributions, basically you can think histograms as bar plots of grouped frequency distributions. If you create a grouped frequency distribution of the data, you can easily create the histogram of the same data. Similarly, by looking at a histogram one may easily create the grouped frequency distribution of the data. In bar charts the columns/bars are separated from each other by a convenient distance whereas in histogram the columns/bars are adjacent to each other.
16.Soru
Which of the following matches is wrong?
Nominal-Country |
Ordinal-Olimpic Gold Medalist |
Interval-Temperature |
Continuous-Age |
Interval-Time to run 100 meters |
Age is an interval-scale variable: to compare two children of ages 10 and 12, we would compute the interval difference, i.e. 2 years. We would not say the 12-year old is 20% older than the 10-year old.
17.Soru
Which is NOT TRUE about the table below?
There are three categorical variables. |
It is a contingency table of the variables gender and excel knowledge. |
We may conclude that number of males who have technical knowledge is greater than those in female. |
It is used to determine if one categorical variable is related to another categorical variable. |
It excludes relative frequencies or percentages. |
In this table, there are two categorical variables. One is the variable gender which has two categories, female and male. The other categorical variable is the variable excel knowledge which has two categories, yes and no. Thus, there are two categorical variables.
Ifa data set includes two different categorical variables, we use a two-way table (contingency table) todemonstrate the relationship and interaction of the two categorical variables. A two-way table of counts organizes data about two categorical variables measured from the same set of individuals. A contingency table is a special type of frequency distribution table, where two variables are shown simultaneously and
it is used to determine if one categorical variable is related to another categorical variable.
18.Soru
Which of the following is not data?
Eskişehir |
Eskişehir train station |
Eskişehir's pirate ship |
Eskişehir's beach's users |
Anadolu University airport |
Eskişehir's beach's users is variable, not data. pg. 14. Correct answer is D.
19.Soru
According to the contingency table below, what are the percentage of male and female respectively within the people who do not have technical knowledge (No)?
75%, 25% |
35%, 65% |
50%, 50% |
40%, 60% |
20%, 80% |
When we create the contingency table with row and column percentages, within the people who do not have technical knowledge (No) the percentage of male and female are 50% and 50%, respectively.
20.Soru
What is the mode of the following data set?
10, 19, 11, 26, 26, 26, 18, 20, 35, 99, 11, 14, 18, 18, 26, 20, 48
10 |
26 |
35 |
48 |
99 |
26 : occurs most often, it is repeated 4 times
-
- 1.SORU ÇÖZÜLMEDİ
- 2.SORU ÇÖZÜLMEDİ
- 3.SORU ÇÖZÜLMEDİ
- 4.SORU ÇÖZÜLMEDİ
- 5.SORU ÇÖZÜLMEDİ
- 6.SORU ÇÖZÜLMEDİ
- 7.SORU ÇÖZÜLMEDİ
- 8.SORU ÇÖZÜLMEDİ
- 9.SORU ÇÖZÜLMEDİ
- 10.SORU ÇÖZÜLMEDİ
- 11.SORU ÇÖZÜLMEDİ
- 12.SORU ÇÖZÜLMEDİ
- 13.SORU ÇÖZÜLMEDİ
- 14.SORU ÇÖZÜLMEDİ
- 15.SORU ÇÖZÜLMEDİ
- 16.SORU ÇÖZÜLMEDİ
- 17.SORU ÇÖZÜLMEDİ
- 18.SORU ÇÖZÜLMEDİ
- 19.SORU ÇÖZÜLMEDİ
- 20.SORU ÇÖZÜLMEDİ