Statistics
Textbooks
Boundless Statistics
Sampling
Populations and Samples
Statistics Textbooks Boundless Statistics Sampling Populations and Samples
Statistics Textbooks Boundless Statistics Sampling
Statistics Textbooks Boundless Statistics
Statistics Textbooks
Statistics
Concept Version 5
Created by Boundless

Samples

A sample is a set of data collected and/or selected from a population by a defined procedure.

Learning Objective

  • Differentiate between a sample and a population


Key Points

    • A complete sample is a set of objects from a parent population that includes all such objects that satisfy a set of well-defined selection criteria.
    • An unbiased (representative) sample is a set of objects chosen from a complete sample using a selection process that does not depend on the properties of the objects.
    • A random sample is defined as a sample where each individual member of the population has a known, non-zero chance of being selected as part of the sample.

Terms

  • census

    an official count of members of a population (not necessarily human), usually residents or citizens in a particular region, often done at regular intervals

  • population

    a group of units (persons, objects, or other items) enumerated in a census or from which a sample is drawn

  • unbiased

    impartial or without prejudice


Full Text

What is a Sample?

In statistics and quantitative research methodology, a data sample is a set of data collected and/or selected from a population by a defined procedure.

Typically, the population is very large, making a census or a complete enumeration of all the values in the population impractical or impossible. The sample represents a subset of manageable size. Samples are collected and statistics are calculated from the samples so that one can make inferences or extrapolations from the sample to the population. This process of collecting information from a sample is referred to as sampling.

Types of Samples

A complete sample is a set of objects from a parent population that includes all such objects that satisfy a set of well-defined selection criteria. For example, a complete sample of Australian men taller than 2 meters would consist of a list of every Australian male taller than 2 meters. It wouldn't include German males, or tall Australian females, or people shorter than 2 meters. To compile such a complete sample requires a complete list of the parent population, including data on height, gender, and nationality for each member of that parent population. In the case of human populations, such a complete list is unlikely to exist, but such complete samples are often available in other disciplines, such as complete magnitude-limited samples of astronomical objects.

An unbiased (representative) sample is a set of objects chosen from a complete sample using a selection process that does not depend on the properties of the objects. For example, an unbiased sample of Australian men taller than 2 meters might consist of a randomly sampled subset of 1% of Australian males taller than 2 meters. However, one chosen from the electoral register might not be unbiased since, for example, males aged under 18 will not be on the electoral register. In an astronomical context, an unbiased sample might consist of that fraction of a complete sample for which data are available, provided the data availability is not biased by individual source properties.

The best way to avoid a biased or unrepresentative sample is to select a random sample, also known as a probability sample. A random sample is defined as a sample wherein each individual member of the population has a known, non-zero chance of being selected as part of the sample. Several types of random samples are simple random samples, systematic samples, stratified random samples, and cluster random samples.

Samples

Online and phone-in polls produce biased samples because the respondents are self-selected. In self-selection bias, those individuals who are highly motivated to respond-- typically individuals who have strong opinions-- are over-represented, and individuals who are indifferent or apathetic are less likely to respond.

A sample that is not random is called a non-random sample, or a non-probability sampling. Some examples of nonrandom samples are convenience samples, judgment samples, and quota samples.

[ edit ]
Edit this content
Prev Concept
Populations
Random Sampling
Next Concept
Subjects
  • Accounting
  • Algebra
  • Art History
  • Biology
  • Business
  • Calculus
  • Chemistry
  • Communications
  • Economics
  • Finance
  • Management
  • Marketing
  • Microbiology
  • Physics
  • Physiology
  • Political Science
  • Psychology
  • Sociology
  • Statistics
  • U.S. History
  • World History
  • Writing

Except where noted, content and user contributions on this site are licensed under CC BY-SA 4.0 with attribution required.