Stat 200 Assignment 1 Overview
Stat 200 Assignment 1 Overview
The 'maximum temperature of Cebu City in a year' is a continuous variable because temperature can take any value within a given range, including decimals. The 'number of board examination passers in Engineering' is a discrete variable because it represents a countable number of people, which can only be whole numbers .
The class mark of the 5th class interval is calculated by taking the average of the upper and lower limits of this class interval, which you identify from the Frequency Distribution Table sequence .
To determine the sample size for each stratum using proportional allocation, first calculate the total sample size using Slovin’s formula with the overall population size and margin of error. Then, allocate samples to each stratum based on their proportion in the entire population using the formula: (stratum size / total population size) * total sample size .
To find the percentage of students with scores less than a specific value, sum the frequencies of all classes below that value, divide by the total number of students, and then multiply by 100 to convert it into a percentage .
Socio-economic status is measured on an ordinal scale as it represents categories with a meaningful order but no standard difference between levels, such as low, middle, and high. In contrast, employee ID numbers are nominal as they are identifiers without any order or quantitative significance .
Two types of non-probability sampling are convenience sampling and judgmental (or purposive) sampling. Convenience sampling involves selecting individuals who are easiest to reach, which may introduce bias but is useful for exploratory research. Judgmental sampling involves selecting individuals based on the researcher's judgement about who will provide the best examples and information for a study .
To determine the number of students with scores greater than 15, refer to the constructed Frequency Distribution Table, identify the classes with lower limits greater than 15, and sum the frequencies of these classes .
Stratified sampling is significant because it ensures representation from all sub-groups in a population, leading to more accurate and reliable results. It improves precision by reducing sampling error compared to simple random sampling, especially in heterogeneous populations, and allows researchers to make specific inferences about sub-group differences .
Three common methods of data collection are survey, observation, and experiment. Surveys involve asking individuals specific questions either face-to-face, via telephone, or through a questionnaire to gather data. Observations entail watching subjects in their natural environment, often without their knowledge, to gather authentic data. Experiments involve manipulating variables in a controlled setting to study effects on the subject .
Three sources of data are primary data, secondary data, and tertiary data. Primary data is collected directly from the source, such as through conducting interviews. Secondary data is gathered from existing sources like books or articles. Tertiary data consolidates and summarizes primary and secondary data, as seen in encyclopedias .