In our day to day life, we can collect the following data.
1. Number of females per 1000 males in various states of our country
2. Weights of students of our class
3. Production of wheat in the last 10 years in our country
4. Number of plants in our locality
5. Rainfall in our city in the last 10 years
By: Admin
The information which is collected by the investigator himself with a definite objective in his mind is called as primary data
when the information is gathered from a source which already had the information stored, it is called as secondary data.
It can be observed that the data in 1, 3, and 5 is secondary data and the data in 2 and 4 is primary data.
By: Admin
The frequency means the number of students having same blood group.
It can be observed that 9 students have their blood group as A,
6 as B,
3 as AB, and
12 as O.
Therefore, the blood group of 30 students of the class can be represented as follows.
Blood group
Number of students
A
9
B
6
AB
3
O
12
Total
30
It can be observed clearly that
Most common Blood Group (Highest frequency): O -(maximum number of students)
Rarest Blood Group (Lowest frequency): AB - (minimum number of students)
By: Admin
It is given that a grouped frequency distribution table of class size 5 has to be constructed. Therefore, the class intervals will be 0 - 5, 5 - 10, 10 - 15, 15 - 20…
By observing the data given as above, a grouped frequency distribution table can be constructed as follows.
Distance (in km)
Tally mark
Number of engineers
0 - 5
5
5 - 10
11
10 - 15
11
15 - 20
9
20 - 25
1
25 - 30
1
30 - 35
2
Total
40
It can be observed that there are very few engineers whose homes are at more than or equal to 20 km distance from their work place. Most of the engineers have their workplace up to 15 km distance from their homes.
By: Admin
(i) A grouped frequency distribution table of class size 2 has to be constructed. The class intervals will be 84 - 86, 86 - 88, and 88 - 90…
By observing the data given above, the required table can be constructed as follows.
Relative humidity (in %)
Number of days (frequency )
84 - 86
1
86 - 88
1
88 - 90
2
90 - 92
2
92 - 94
7
94 - 96
6
96 - 98
7
98 - 100
4
Total
30
(ii) It can be observed that the relative humidity is high. Therefore, the data is about a month of rainy season.
(iii) Range of data = Maximum value - Minimum value
= 99.2 - 84.9 = 14.3
By: Admin
(i) A grouped frequency distribution table has to be constructed taking class intervals 160 - 165, 165 - 170, etc. By observing the data given above, the required table can be constructed as follows.
Height (in cm)
Number of students (frequency )
150 - 155
12
155 - 160
9
160 - 165
14
165 - 170
10
170 - 175
5
Total
50
(ii) It can be concluded that more than 50% of the students are shorter than 165 cm.
By: Admin
Taking class intervals as 0.00, - 0.04, 0.04, - 0.08, and so on, a grouped frequency table can be constructed as follows.
Concentration of SO2(in ppm)
Number of days (frequency )
0.00 - 0.04
4
0.04 - 0.08
9
0.08 - 0.12
9
0.12 - 0.16
2
0.16 - 0.20
4
0.20 - 0.24
2
Total
30
The number of days for which the concentration of SO2 is more than 0.11 is the number of days for which the concentration is in between 0.12 - 0.16, 0.16 - 0.20, 0.20 - 0.24.
Required number of days = 2 + 4 + 2 = 8
Therefore, for 8 days, the concentration of SO2 is more than 0.11 ppm.
By: Admin
By observing the data given above, the required frequency distribution table can be constructed as follows.
Number of heads
Number of times (frequency)
0
6
1
10
2
9
3
5
Total
30
By: Admin
(i) By observation of the digits after decimal point, the required table can be constructed as follows.
Digit
Frequency
0
2
1
5
2
5
3
8
4
4
5
5
6
4
7
4
8
5
9
8
Total
50
(ii) It can be observed from the above table that the least frequency is 2 of digit 0, and the maximum frequency is 8 of digit 3 and 9. Therefore, the most frequently occurring digits are 3 and 9 and the least frequently occurring digit is 0.
By: Admin
(i) Our class intervals will be 0 - 5, 5 - 10, 10 - 15…..
The grouped frequency distribution table can be constructed as follows.
Hours
Number of children
0 - 5
10
5 - 10
13
10 - 15
5
15 - 20
2
Total
30
(ii) The number of children who watched TV for 15 or more hours a week is 2 (i.e., the number of children in class interval 15 - 20).
By: Admin
A grouped frequency table of class size 0.5 has to be constructed, starting from class interval 2 - 2.5.
Therefore, the class intervals will be 2 - 2.5, 2.5 - 3, 3 - 3.5…
By observing the data given above, the required grouped frequency distribution table can be constructed as follows.
Lives of batteries (in years)
Number of batteries
2 - 2.5
2
2.5 - 3.0
6
3.0 - 3.5
14
3.5 - 4.0
11
4.0 - 4.5
4
4.5 - 5.0
3
Total
40
By: Admin
(i) By representing causes on x-axis and family fatality rate on y-axis and choosing an appropriate scale (1 unit = 5% for y axis), the graph of the information given above can be constructed as follows.
All the rectangle bars are of the same width and have equal spacing between them.
(ii) Reproductive health condition is the major cause of women's ill health and death worldwide as 31.8% of women are affected by it.
(iii) The factors are as follows.
1. Lack of medical facilities
2. Lack of correct knowledge of treatment
By: Admin
(i) By representing section (variable) on x-axis and number of girls per thousand boys on y-axis, the graph of the information given above can be constructed by choosing an appropriate scale (1 unit = 100 girls for y-axis)
Here, all the rectangle bars are of the same length and have equal spacing in between them.
(ii) It can be observed that maximum number of girls per thousand boys (i.e., 970) is for ST and minimum number of girls per thousand boys (i.e., 910) is for urban.
Also, the number of girls per thousand boys is greater in rural areas than that in urban areas, backward districts than that in non-backward districts, SC and ST than that in non-SC/ST.
By: Admin
(i) By taking polling results on x-axis and seats won as y-axis and choosing an appropriate scale (1 unit = 10 seats for y-axis), the required graph of the above information can be constructed as follows.
Here, the rectangle bars are of the same length and have equal spacing in between them.
(ii) Political party 'A' won maximum number of seats.
By: Admin
(i) It can be observed that the length of leaves is represented in a discontinuous class interval having a difference of 1 in between them. Therefore,
1/2 =0.5 has to be added to each upper class limit and also have to subtract 0.5 from the lower class limits so as to make the class intervals continuous.
Length (in mm)
Number of leaves
117.5 - 126.5
3
126.5 - 135.5
5
135.5 - 144.5
9
144.5 - 153.5
12
153.5 - 162.5
5
162.5 - 171.5
4
171.5 - 180.5
2
Taking the length of leaves on x-axis and the number of leaves on y-axis, the histogram of this information can be drawn as above.
Here, 1 unit on y-axis represents 2 leaves.
(ii) Other suitable graphical representation of this data is frequency polygon.
(iii) No, as maximum number of leaves (i.e., 12) has their length in between 144.5 mm and 153.5 mm. It is not necessary that all have their lengths as 153 mm.
By: Admin
(i) By taking life time (in hours) of neon lamps on x-axis and the number of lamps on y-axis, the histogram of the given information can be drawn as follows.
Here, 1 unit on y-axis represents 10 lamps.
(ii) It can be concluded that the number of neon lamps having their lifetime more than 700 is the sum of the number of neon lamps having their lifetime as 700 - 800, 800 - 900, and 900 - 1000.
Therefore, the number of neon lamps having their lifetime more than 700 hours is 184. (74 + 62 + 48 = 184)
By: Admin
It can be observed that the class intervals of the given data are not continuous. There is a gap of 1 in between them. Therefore, 1/2 =0.5 has to be added to the upper class limits and 0.5 has to be subtracted from the lower class limits.
Also, class mark of each interval can be found by using the following formula.
Class mark
Continuous data with class mark of each class interval can be represented as follows.
Number of balls
Class mark
Team A
Team B
0.5 - 6.5
3.5
2
5
6.5 - 12.5
9.5
1
6
12.5 - 18.5
15.5
8
2
18.5 - 24.5
21.5
9
10
24.5 - 30.5
27.5
4
5
30.5 - 36.5
33.5
5
6
36.5 - 42.5
39.5
6
3
42.5 - 48.5
45.5
10
4
48.5 - 54.5
51.5
6
8
54.5 - 60.5
57.5
2
10
By taking class marks on x-axis and runs scored on y-axis, a frequency polygon can be constructed as follows.
By: Admin
The class intervals in the data is having varying width. We know that the area of rectangle is proportional to the frequencies in the histogram. The class interval with minimum class size 1 is selected and the length of the rectangle is proportionate to it.
Age (in years)
Number of children (frequency)
Width of class
Length of rectangle
1-2
5
1
(5/1)×1 = 5
2-3
3
1
(3/1)×1 = 3
3-5
6
2
(6/2)×1 = 3
5-7
12
2
(12/2)×1 = 6
7-10
9
3
(9/3)×1 = 3
10-15
10
5
(10/5)×1 = 2
15-17
4
2
(4/2)×1 = 2
Taking the age of children on x-axis and proportion of children per 1 year interval on y-axis, the histogram can be drawn
By: Admin
(i) The class intervals in the data is having varying width. We know that the area of rectangle is proportional to the frequencies in the histogram. The class interval with minimum class size 2 is selected and the length of the rectangle is proportionate to it.
The proportion of the surnames per 2 letters interval can be calculated as:
Number of letters
Number of surnames
Width of class
Length of rectangle
1-4
6
3
(6/3)×2 = 4
4-6
30
2
(30/2)×2 = 30
6-8
44
2
(44/2)×2 = 44
8-12
16
4
(16/4)×2 = 8
12-20
4
8
(4/8)×2 = 1
By taking the number of letters on x-axis and the proportion of the number of surnames per 2 letters interval on y-axis and choosing an appropriate scale (1 unit = 4 students for y axis), the histogram can be constructed as follows.
(ii) The class interval in which the maximum number of surnames lies is 6 - 8 as it has 44 surnames in it i.e., the maximum for this data.
By: Admin
We can find the class marks of the given class intervals by using the following formula.
Class mark
Section A
Section B
Marks
Class marks
Frequency
Marks
Class marks
Frequency
0 - 10
5
3
0 - 10
5
5
10 - 20
15
9
10 - 20
15
19
20 - 30
25
17
20 - 30
25
15
30 - 40
35
12
30 - 40
35
10
40 - 50
45
9
40 - 50
45
1
Taking class marks on x-axis and frequency on y-axis and choosing an appropriate scale (1 unit = 3 for y-axis), the frequency polygon can be drawn as follows.
It can be observed that the performance of students of section 'A' is better than the students of section 'B' in terms of good marks.
By: Admin
The number of goals scored by the team is
2, 3, 4, 5, 0, 1, 3, 3, 4, 3
Arranging the number of goals in ascending order,
0, 1, 2, 3, 3, 3, 3, 4, 4, 5
The number of observations is 10, which is an even number. Therefore, median score will be the mean of 10/2 i.e., 5th and 10/2 +1
i.e., 6th observation while arranged in ascending or descending order.
Mode of data is the observation with the maximum frequency in data.
Therefore, the mode score of data is 3 as it has the maximum frequency as 4 in the data.
As the number of observations is 15 which is odd, therefore, the median of data will be 15+1 / 2 = 8th observation whether the data is arranged in an ascending or descending order.
Therefore, median score of data = 52
Mode of data is the observation with the maximum frequency in data. Therefore, mode of this data is 52 having the highest frequency in data as 3.
By: Admin
It can be observed that the total number of observations in the given data is 10 (even number). Therefore, the median of this data will be the mean of 10 / 2 i.e., 5th and 10/2 +1 i.e., 6th observation.
By: Admin
Arranging the data in an ascending order,
14, 14, 14, 14, 17, 18, 18, 18, 22, 23, 25, 28
It can be observed that 14 has the highest frequency, i.e. 4, in the given data. Therefore, mode of the given data is 14.
By: Admin
Salary (xi)
Number of workers (fi)
fixi
3000
16
48000
4000
12
48000
5000
10
50000
6000
8
48000
7000
6
42000
8000
4
32000
9000
3
27000
10000
1
10000
Total
Σfi = 60
Σfixi = 305000
= 5083.33
Therefore, mean salary of 60 workers is Rs 5083.33.
By: Admin
When any data has a few observations such that these are very far from the other observations in it, it is better to calculate the median than the mean of the data as median gives a better estimate of average in this case.
(i) Consider the following example - the following data represents the heights of the members of a family.
154.9 cm, 162.8 cm, 170.6 cm, 158.8 cm, 163.3 cm, 166.8 cm, 160.2 cm
In this case, it can be observed that the observations in the given data are close to each other. Therefore, mean will be calculated as an appropriate measure of central tendency.
(ii) The following data represents the marks obtained by 12 students in a test.
48, 59, 46, 52, 54, 46, 97, 42, 49, 58, 60, 99
In this case, it can be observed that there are some observations which are very far from other observations. Therefore, here, median will be calculated as an appropriate measure of central tendency.
Need more help?
To start up with Doubt classes and coaching with EDUINFY Tutors Feel free to contact us.