Question:

With reference to the steps of Data Science Methodology, define the process of data collection. Also differentiate between primary and secondary data sources of data collection with suitable examples.

Show Hint

Primary = First-hand data (you collect)
Secondary = Already available data (others collected)
Hide Solution
collegedunia
Verified By Collegedunia

Solution and Explanation

Step 1: Define Data Collection.
Data collection is the process of gathering relevant data from various sources to answer a problem or support decision-making in Data Science.

Step 2:
Explain its role in methodology.
It is an important step in Data Science Methodology as the quality and accuracy of collected data directly affect the results and insights.

Step 3:
Define Primary Data.
Primary data is the data collected directly by the researcher for a specific purpose.
Example: Surveys, interviews, experiments.

Step 4:
Define Secondary Data.
Secondary data is the data that has already been collected and published by others.
Example: Government reports, websites, research papers.

Step 5:
Key difference.
Primary data is original and specific but time-consuming, whereas secondary data is easily available but may not be fully relevant.
Was this answer helpful?
0
0