What is Statistics? - Primary and Secondary Data

Here we will learn the following concepts in detail:

A brief introduction to statistics
Definition of statistics
What is data
How many types of data
Methods of collection of data
Difference between primary data and secondary data
Representation of data

Introduction to Statistics

In ancient times also statistics were used at that time in India here are some incidents that are proof that at that time statistics also have its importance like

1. In Mahabharatkal during Damayanti’s swaybar Going with king Nal, King Rituparna calculated the exact number of fruits and leaves on the tree based on the sample of a tree.

2. In Chandragupta Maurya’s reign (324-300 BC), the description of registration of birth-death and the arrangement of governance on its basis is found in Kautilya’s Arthashastra.

Many examples of the use of statistics are found in wartime, in the arrangement of government, and even in daily life, when we see in newspapers, electronic media, many channels, magazines, and other communication tools, information about the temperature of the cities, the situation of rainfall from place to place, etc. Factual and comparative information is obtained through data.

We use this data in some way or another throughout our daily lives. so it is necessary for us that get the necessary information from these data.

Definition of Statistics

It is the process of collecting the data, organizing that data, analysis of the data, interpreting of data, and at last, extracting meaningful information from that data.

The statistics word has taken from the Latin word status, the meaning of status is ‘ a (political) state ‘.

What is Data

The collection of facts and information in numerical form in statistics is called data.

On the behalf of collection data, there are two types of data.

Primary Data

When the information (data) is collected by the investigator himself or herself with a definite objective in his or her mind, the data obtained is called primary data

example for in your school the collection of heights of 20 students in your class.

primary data are reliable and relevant because they are original in character and are collected by the investigator himself

Method of Collection of Primary Data

We can collect the primary data in the following manner :

by Interview: here telephonic or face to face
by questionnaire
by observation

Secondary Data

the data (information )which is not originally collected rather than gathered from a source that already had the information stored in this manner the collection of data is called secondary data.

example in your school collect the number of absentees each day in your class for a month, how you will collect this data I am sure by your class attendance register which is already a source of some information

this is most important that the source is reliable by which we are collecting the data, here is the great care required

Method of Collection of Secondary

As we know that secondary data is not collected by the original investigator, here are two types of sources by which we will collect the secondary data

1. Published Sources

U.N.O.
Government Publications
Public Records
Historical and Statical Documents
Business Documents
News Papers
Semi-Government Publications etc.

2. Unpublished Sources

Unpublished Biographies
Diaries
Unpublished Letters etc.

Difference between Primary and Secondary Data

1. Primary data is collected by the researcher but secondary data is not collected by the researcher himself.

2. Primary data is the most accurate but secondary data is less accurate as compared to primary data.

3. Primary data is organized data in a proper manner but secondary data as you know is gathered by other sources that’s why it is not in a properly organized form.

4. For A collection of primary data required large human resources, more time, and is also expensive but for collection of secondary data is very easy because it is collected by already available sources hence secondary data collection is less expensive.

Representation of Data

As soon as work related to the collection of primary and secondary data is over, the investigator has to find ways to present both types of data in a form that contains the following key points

The first one is meaningful,
The second point is easy to understand.
And the last point fulfills its main features with a brief look.

Now take some examples to represent the data

Example 1: Following data are the marks obtained by 10 students on the English test as given below :

55 25 36 95 73 60 42 63 78 76

The data in this form is called raw data.

by looking at this form if we are going to find the lowest and highest marks then we take some time to search for that one

but if this score is arranged in ascending or descending order

25 36 42 55 60 63 73 76 78 95

now we can clearly see the lowest marks are 25 and the highest marks are 95.

Range

The difference between the highest and the lowest value of given data is called the range.

now the range, in this case, is 95 -25 =70

Presentation of data in ascending and descending order can be quite a time taking if the number of observations in an experiment is large as in the case of the following example:

Example 2: Consider the marks obtained (out of 100 marks )by 30 students of Class IX of your school :

10 20 36 92 95 40 50 56 60 70

92 88 80 70 72 70 36 40 36 40

92 40 50 50 56 60 70 60 60 88

here we will make a frequency distribution table:

Marks

Tally marks

number of students

(i.e., the frequency )

III

IIII

III

IIII

III

Total

This table is called an ungrouped frequency distribution table.

If the data is very large then we will use grouped frequency distribution table.

Class interval

Frequency

10 -20

20 -30

30 -40

40 -50

50 -60

60 -70

70 -80

80 -90

90 -100

total

What is Statistics? – Primary and Secondary Data