Statıstıcs I Ara 3. Deneme Sınavı
Toplam 20 Soru1.Soru
As a general rule, most data on monetary values and those coming from physical measurements (e.g., lira, gold price, centimeters, kilograms) are _____ variables.
Which of the following options fills the blank in the above sentence in the most correct way?
Interval-scale |
Nominal-scale |
Ratio-scale |
Ordinal-scale |
Categorical-scale |
As a general rule, most data on monetary values and those coming from physical measurements (e.g., lira, gold price, centimeters, and kilograms) are ratio-scale variables.
2.Soru
The frequency distribution table of the students’ performance scores of a school were constructed as follows. What is the ratio of the students whose score under 80?
0,88 |
0,84 |
0,80 |
0,73 |
0,70 |
The ratio of the students whose score under 80 is 0,88
3.Soru
Retrieved from https://ourworldindata.org/quality-of-education
The following plot represents the relation between the PISA reading scores and the United Nations' Human Development Index (HDI) for a select group of countries.
According to the scatterplot above and considering straight line that indicates relationship, which one of the following is true?
The straight line tends to have positive slope |
The straight line tends to have negative slope |
No relationship can be claimed between two sets of data |
Straight line to indicate the negative correlation between two sets of data. |
The is not any outlier data for statistical analysis. |
Scatter plot can also be used with a straight line to indicate the correlation between two sets of data. A regression line is added to scatter plot will show a very good indication about the direction of the relationship between two variables. The values of both variables are increasing, hence there is a positive relationship between these two variables. Thus, the straight line tends to have positive slope.
4.Soru
Which of the following is an internal data source for the firm?
Reference books |
Newspapers |
Sectoral magazines |
Statistics published by governments |
Firm’s accounting records |
We can obtain some data from an internal data source, such as an organization’s operating and accounting records. These routine data are usually saved in computer data files or databases for efficient entry, storage, and retrieval of information. Internal data is obtained from inside the company for successful operations. The information obtained from internal data source is important to determine the company strategies. We usually obtain data from external data sources. External data sources may be a reference book or statistical periodical published by a government agency, a trade association, or a private service company.
5.Soru
- The main criteria for selecting a sample will be that the sample is representative of the population and that there is no or very little subjectivity in the choice of the sampling units.
- Sampling is not only conducted by survey researchers on human populations, but also by auditors on a company’s accounts, by agricultural researchers on different pieces of land, and by quality control inspectors on products in a factory, to name only a few examples.
- Data come in the form of numbers as well as text.
- All observations can be reduced to some numerical quantity.
- The way the sample is collected is crucial to obtaining a valid estimate.
Which of the above are correct?
I and II |
I, II and III |
III, IV and V |
I, II, IV and V |
I, II, III, IV and V |
Data come in the form of numbers as well as text, which is something we are discovering more and more. We live in an information world and information is the virtual gold of our society. All observations can be reduced to some numerical quantity, and there are even fields of digital philosophy and digital sociology. (Page 6)
…The key to all of the above is the phrase: “a sample of about 1000 carefully selected people”. The way the sample is collected is crucial to obtaining a valid estimate, and this is an important subject which will be dealt with in this course. The main criteria for selecting a sample will be that the sample is representative of the population and that there is no or very little subjectivity in the choice of the sampling units. Sampling is not only conducted by survey researchers on human populations, but also by auditors on a company’s accounts, by agricultural researchers on different pieces of land, and by quality control inspectors on products in a factory, to name only a few examples. (Page 7)
As also understood from the information given, the correct answer is E.
6.Soru
- It is the best measure of central tendency for nominal data.
- It is the middle value of an ordered dataset.
- It can be preferred when the distribution is skewed.
Which of the given above true about median?
Only II |
I & II |
I & III |
II & III |
I, II & III |
Median cannot be used for nominal data sets.
It is the middle value of an ordered data set and preferred when there are outliers and the distribution is skewed.
The correct answer is D.
7.Soru
- A market researcher's surveying customers
- A teacher's qualitative inquiry of motivation
- A psychologist's measure of addiction level
Which of the above is/are basic interest of statistics?
Only II |
I and II |
I and III |
II and III |
I, II and III |
Statistics mainly work on numerical data. Qualitative inquiry is not an interest of statistics.
8.Soru
Which of the following can be used to display the trends in continuous data over a period of time?
Pie chart |
Line chart |
Frequency polygon |
Bar chart |
Stem and leaf display |
Line chart is often used to display the trends in a continuous data over a period of time. Line chart also works well with discrete (ordered) or categorical types of data.
9.Soru
"It is almost the easiest of the graphs. It can be drawn by hand easily while collecting data. It will be very useful when the number of objects in our study is rather small such as up to 50 observations. It is generally used to investigate univariate (quantitative) data, but sometimes it is used to compare two variables. Essentially it is a one-dimensional scatterplot of observed values of a variable." Which type of graphic is described in the paragraph above?
Pie chart |
Line chart |
Dot plot |
Histogram |
Bar chart |
Dot plot is almost the easiest of the graphs. It can be drawn by hand easily while collecting data. Dot plot will be very useful when the number of objects in our study is rather small such as up to 50 observations. Dot plot is generally used to investigate univariate (quantitative) data, but sometimes it is used to compare two variables. Essentially a dot plot is a one-dimensional scatterplot of observed values of a variable. The correct answer is C.
10.Soru
Which statement below is correct about the table above?
"Gender" is a quantitative data. |
"Stephen" is a data set. |
"Ronnie" is a qualitative data. |
"Age" is a qualitative data. |
"Susan" is an element. |
A data set is a collection of facts aggregated for a specific purpose. Elements are the entities on which the data are collected. In the table above, an element of the data set is a particular worker. For instance, worker "Ronnie" is an element of the data set. Age is a quantitative variable because it takes on numerical measurements. However, gender is a qualitative variable because its outcomes are nonnumeric. Of the four variables in the data set in the table above, two are qualitative (Name and Gender) and two are quantitative (Age and Weekly Wage).
11.Soru
Which one of the following is an example of central tendency measures?
Variance |
Mode |
Range |
Standard deviation |
Correlation coefficient |
Central tendency is defined as “the tendency of data to cluster around some random variable value”. The position of the central value is measured by using central tendency measures such as arithmetic mean, median and mode. There are several names used to refer to central tendency in statistics such as “center of the distribution”, “central location”, “representative values”, “central position”, or “measures of location”.
12.Soru
- The observations made on the variables constitute the data.
- The subjects or individuals or companies on which these observations are made are called cases.
- Categorical variables and data can be either nominal or ordinal.
- Continuous variables and data can be either interval-scale or ratio-scale
Which of the above are correct?
I and II |
I and III |
I, II and III |
II, III and IV |
I, II, III and IV |
The easiest form of data is called categorical, or qualitative, for example data on variables “country” (e.g., the data observation might be Germany) or “question response” (e.g., believe that climate change is manmade) or “exam grade” (e.g., B). Categorical variables and data can be either nominal or ordinal. The question about climate change, with possible responses “natural”, “manmade” or “don’t know/can’t answer” is a nominal categorical variable, as is the variable “country” - there is no ordering in the categories of these variables. By contrast, exam grade is an ordinal categorical variable, since its categories are ordered: A is better than a B, B is better than a C, and so on.
Other examples of nominal categorical variables are gender, region of residence, field of study, type of transport, type of housing, etc.
Other examples of ordinal categorical variables are income group (if incomes have been categorized), an attitude question in a survey where possible responses are strongly agree/agree/disagree/strongly disagree (these categories have an order), social class (with classes usually in an inherent order), terrorist threat levels (in the UK these are low/moderate/substantial/severe/critical), etc.
The other main type of data (see Fig. 1.4) is called continuous, or quantitative, for example data on variables “blood pressure”, “age” and “income”. These are observations of variables on continuous scales, usually rounded in some convenient way. For example, although age is a continuous time variable, and we are getting older all the time by seconds, minutes and hours, someone’s age is almost always rounded to the number of years completed. There is a subtle difference between interval-scale and ratio-scale continuous data, which is worth mentioning here. Age is an interval-scale variable: to compare two children of ages 10 and 12, we would compute the interval difference, i.e. 2 years. We would not say the 12-year old is 20% older than the 10-year old. But comparing prices or incomes, for example, we would tend to compute percentage differences, making them ratio-scale variables. A good example is the inflation rate, comparing the prices of a basket of products over time, not as a difference but as a percentage. As a general rule, most data on monetary values and those coming from physical measurements (e.g., lira, gold price, centimeters, kilograms) are ratio-scale variables.
As also understood from the information given, the correct answer is E.
13.Soru
Which of the following is not true about the scatter plot?
Scatter plot is used to investigate the change of a variable over a time. |
To construct a scatter plot, two data sets or variables are needed, usually, these two data sets or variables are named as X and Y. |
The pair of the data point for a specific observation, (X, Y), is represented by a dot or a symbol of convenience. |
Scatter plot gives a good indication of the correlation between two variables. |
Scatter plot is a good indicator of the value of the correlation coefficient. |
Scatter plot is used to investigate the relationship between two variables. They are also very helpful indicating the minimum, maximum or outliers of the variables.
14.Soru
Which one below is a term that has the same meaning the with the term Statistics?
Data Science |
Database Management |
Data Visualization |
Computer Science |
Analytics |
At the start of this Introduction we asked “What is Statistics? What is Data Science? What is Analytics?” To deal with Analytics first, this is a term now used in business circles as a substitute for the word Statistics, but it really means the same thing. The word Statistics is considered by some people, especially businessmen, as a bit old-fashioned, and sometimes even difficult to pronounce! But don’t be fooled: Analytics is a fancy word for Statistics.
When it comes to Data Science, however, the term does have some different meaning. Data Science is a field that includes Statistics as well as areas such as Computer Science, Database Management and Data Visualization, for example, and has come into being mainly as a result of the spectacular growth in the amount of available data in this new information world that we live in.
15.Soru
Which type of response does the question above require?
Open-ended response |
Multiple response |
Ranked response |
Rated response |
Clarity response |
Rated responses generally include three-point, five-point, and seven-point scales. A rating scale should provide more than two options. The mostly used rating scale is five-point Likert (1932) type scale. Likert type scale can be designed in the following forms.
-
Strongly Agree - Agree – Undecided / Neutral - Disagree - Strongly Disagree
-
Always - Often - Sometimes - Seldom - Never
-
Extremely - Very - Moderately - Slightly - Not at all
-
Excellent - Above Average - Average - Below Average - Very Poor
16.Soru
What is the arithmetic mean of the following data set with weights in paranthesis?
60 (10 %), 50 (15 %), 70 (25 %), 90 (30 %), 80 (40 %)
50 |
30 |
25 |
20 |
18 |
(60 x 10 % + 50 x 15 % + 70 x 25 % + 80 x 30 % x + 60 x 40 %) / 5 = (6 + 7.5 + 17.5 + 27 + 32) / 5 = 90 / 5 = 18. pg. 85. Correct answer is E.
17.Soru
We ask the students to number the most important language skill for them in their academic classes as 1 and the least important one as 2. Which option below best describes this measurement?
Ordinal scale |
Nominal scale |
Interval Scale |
Ratio Scale |
None of the above |
Ordinal scales of measurement have the property of both classifying and magnitude. Subjects are categorized into different rank ordered groups. Each value on the ordinal scale has a unique meaning, and it has an ordered relationship to every other value on the scale. Suppose we want to measure customers’ preferences for five brands of chocolates, brands A, B, C, D, and E. We could ask each customer to rank order the five brands by assigning number 1 to the most preferred brand, number 2 to the next most preferred brand, and so on.
18.Soru
What is the geometric mean of the following data set?
64, 125, 216
96 |
105 |
120 |
128 |
144 |
(64 x 125 x 216)1/3 = 4 x 5 x 6 = 120. pg. 92. Correct answer is C.
19.Soru
- Any person have free access to data in the world
- Data may be in form of text or numbers
- All observations can be reduced to some numerical quantity
Which of the above is true for data?
Only I |
I and II |
I and III |
II and III |
I, II and III |
Data come in the form of numbers as well as text, which is something we are discovering more and more. We live in an information world and information is the virtual gold of our society. All observations can be reduced to some numerical quantity, and there are even fields of digital philosophy and digital sociology. It may seem that everything may be recorded and stored somewhere. But in reality - unless we somehow centralize and link all the databases in the world, and have free access to them - we can get access to only a small part of whatever data we are interested in.
20.Soru
Which of the following refers to 'the average of a set of numbers in the data'?
Median |
Mode |
Mean |
Central tendency |
Ratio |
'Mean' refers to the average of a set of numbers in the data. The correct answer is C.
-
- 1.SORU ÇÖZÜLMEDİ
- 2.SORU ÇÖZÜLMEDİ
- 3.SORU ÇÖZÜLMEDİ
- 4.SORU ÇÖZÜLMEDİ
- 5.SORU ÇÖZÜLMEDİ
- 6.SORU ÇÖZÜLMEDİ
- 7.SORU ÇÖZÜLMEDİ
- 8.SORU ÇÖZÜLMEDİ
- 9.SORU ÇÖZÜLMEDİ
- 10.SORU ÇÖZÜLMEDİ
- 11.SORU ÇÖZÜLMEDİ
- 12.SORU ÇÖZÜLMEDİ
- 13.SORU ÇÖZÜLMEDİ
- 14.SORU ÇÖZÜLMEDİ
- 15.SORU ÇÖZÜLMEDİ
- 16.SORU ÇÖZÜLMEDİ
- 17.SORU ÇÖZÜLMEDİ
- 18.SORU ÇÖZÜLMEDİ
- 19.SORU ÇÖZÜLMEDİ
- 20.SORU ÇÖZÜLMEDİ