History
The first known example of an opinion poll was a tally of voter preferences reported by the ''Raleigh Star and North Carolina State Gazette'' and the ''Wilmington American Watchman and Delaware Advertiser'' prior to the 1824 presidential election, showingThe statistics of opinion polls
If we ask a yes-no question of a sample of people selected randomly from a large population, then the proportion of the sample that respond "yes" will be close to the true proportion, '''', of the whole population who would have said "yes" had all of them been asked. The distribution of the proportion of 'yes' answers follows the binomial distribution. A binomial distribution converges to a normal distribution if the size of the sample approaches infinity according to the central limit theorem. In practice the binomial distribution is approximated by a normal distribution when and where is the sample size. The larger is the sample, the better is the approximation. Suppose that people were sampled, and a share of them responded "yes". This sample proportion can be used instead of , which is unknown, to compute the sample mean, variance and standard deviation. The sample mean is: . The sample variance is: . The sample standard deviation is: .Example:
Assume that we conduct a poll in which people are asked whether they support candidate A. We sample 1000 people of which 650 respond "yes". In this case and . Therefore, we can approximate the binomial distribution by using the normal distribution. As a rule of thumb we want that our poll result will accurate in the 5% significance level or less. Therefore, we will compute the confidence interval: The sample mean is: . The sample variance is: . The sample standard deviation is: . We shall use the formula to create a confidence interval with 95% confidence level: where is the population mean and is the z-score for 95% confidence level. or: That is, we are 95% confident that the true population mean, , is between 620.44 and 679.55. Remembering that , we can say that or is 0.65 with a margin of error equals to 3% (we rounded the numbers).How many people do we need to create a valid sample?
The answer depends on the population size and required margin of error. We shall use Cochran's formula: , where is the z-score for a confidence level of and is the required margin of error. Note that the function is maximized at , therefore, before starting sampling we will use to determine the sample size. For example, assume that we want 95% confidence level and 5% margin of error: . Note that the required sample size is affected by the confidence level and margin of error. If we want 99% confidence interval we have to sample 664 people, and, alternatively, if we want a margin of error of 2% we will have to sample 2401 people. For a finite population, when the sample is a large proportion of population, we modify the formula: where N is the size of the entire population. Note that as N approaches infinity, the two formulas coincide, meaning the consideration of population size can only reduce the required sample size needed for a valid sample. In the above example, if the entire population is 600 then we have to sample only 285 people ().Sample and polling methods
Opinion polls for many years were maintained through telecommunications or in person-to-person contact. Methods and techniques vary, though they are widely accepted in most areas. Over the years, technological innovations have also influenced survey methods such as the availability of electronic clipboards and Internet based polling. Opinion polling developed into popular applications through popular thought, although response rates for some surveys declined. Also, the following has also led to differentiating results: Some polling organizations, such as Angus Reid Public Opinion, YouGov and Zogby useBenchmark polls
A ''benchmark poll'' is generally the first poll taken in a campaign. It is often taken before a candidate announces their bid for office, but sometimes it happens immediately following that announcement after they have had some opportunity to raise funds. This is generally a short and simple survey of likely voters. Benchmark polling often relies on timing, which can be a significant problem if a poll is conducted too early for anyone to know about the potential candidate. A benchmark poll needs to be undertaken when voters are starting to learn more about the possible candidate running for office. A benchmark poll serves a number of purposes for a campaign. First, it gives the candidate a picture of where they stand with the electorate before any campaigning takes place. If the poll is done prior to announcing for office the candidate may use the poll to decide whether or not they should even run for office. Secondly, it shows them where their weaknesses and strengths are in two main areas. The first is the electorate. A benchmark poll shows them what types of voters they are sure to win, those they are sure to lose, and everyone in-between these two extremes. This lets the campaign know which voters are persuadable so they can spend their limited resources in the most effective manner. Second, it can give them an idea of what messages, ideas, or slogans are the strongest with the electorate.Kenneth F. Warren (1992). "in Defense of Public Opinion Polling." Westview Press. p. 200-1.Tracking polls
In a tracking poll responses are obtained in a number of consecutive periods, for instance daily, and then results are calculated using a moving average of the responses that were gathered over a fixed number of the most recent periods, for example the past five days. In this example, the next calculated results will use data for five days counting backwards from the next day, namely the same data as before, but with the data from the next day included, and without the data from the sixth day before that day. However, these polls are sometimes subject to dramatic fluctuations, and so political campaigns and candidates are cautious in analyzing their results. An example of a tracking poll that generated controversy over its accuracy, is one conducted during the 2000 U.S. presidential election, by the Gallup Organization. The results for one day showed Democratic candidate Al Gore with an eleven-point lead over Republican candidateDeliberative opinion polls
Deliberative Opinion Polls combine the aspects of a public opinion poll and a focus group. These polls bring a group of voters and provide information about specific issues. They are then allowed to discuss those issues with the other voters. Once they know more about the issues, they are polled afterward on their thoughts. Many scholars argue that this type of polling is much more effective than traditional public opinion polling. Unlike traditional public polling, deliberative opinion polls measure what the public believes about issues after being offered information and the ability to discuss them with other voters. Since voters generally do not actively research various issues, they often base their opinions on these issues on what the media and candidates say about them. Scholars argued that these polls can truly reflect voters' feelings about an issue once they are given the necessary information to learn more about it. Despite this, there are two issues with deliberative opinion polls. First, they are expensive and challenging to perform since they require a representative sample of voters, and the information given on specific issues must be fair and balanced. Second, the results of deliberative opinion polls generally do not reflect the opinions of most voters since most voters do not take the time to research issues the way an academic researches issues.Exit polls
Exit polls interview voters just as they are leaving polling places. Unlike general public opinion polls, these are polls of people who voted in the election. Exit polls provide a more accurate picture of which candidates the public prefers in an election because people participating in the poll did vote in the election. Second, these polls are conducted across multiple voting locations across the country, allowing for a comparative analysis between specific regions. For example, in the United States, exit polls are beneficial in accurately determining how the state voters cast their ballot instead of relying on a national survey. Third, exit polls can give journalists and social scientists a greater understanding of why voters voted the way they did and what factors contributed to their vote. Exit polling has several disadvantages that can cause controversy depending on its use. First, these polls are not always accurate and can sometimes mislead election reporting. For instance, during the 2016 U.S. primaries, CNN reported that the Democratic primary in New York was too close to call, and they made this judgment based on exit polls. However, the vote count revealed that these exit polls were misleading, and Hillary Clinton was far ahead of Bernie Sanders in the popular vote, winning the state by 58% to 42% margin. The overreliance on exit polling leads to the second point of how it undermines public trust in the media and the electoral process. In the U.S., Congress and state governments have criticized the use of exit polling because Americans tend to believe more in the accuracy of exit polls. If an exit poll shows that American voters were leaning toward a particular candidate, most would assume that the candidate would win. However, as mentioned earlier, an exit poll can sometimes be inaccurate and lead to situations like the 2016 New York primary, where a news organization reports misleading primary results. Government officials argue that since many Americans believe in exit polls more, election results are likely to make voters not think they are impacted electorally and be more doubtful about the credibility of news organizations.Potential for inaccuracy
Over time, a number of theories and mechanisms have been offered to explain erroneous polling results. Some of these reflect errors on the part of the pollsters; many of them are statistical in nature. Some blame respondents for not providing genuine answers to pollsters, a phenomenon known as social desirability-bias (also referred to as the Bradley effect or the Shy Tory Factor); these terms can be quite controversial.Margin of error due to sampling
Polls based on samples of populations are subject to sampling error which reflects the effects of chance and uncertainty in the sampling process. Sampling polls rely on the law of large numbers to measure the opinions of the whole population based only on a subset, and for this purpose the absolute size of the sample is important, but the percentage of the whole population is not important (unless it happens to be close to the sample size). The possible difference between the sample and whole population is often expressed as a margin of error – usually defined as the radius of a 95% confidence interval for a particular statistic. One example is the percent of people who prefer product A versus product B. When a single, global margin of error is reported for a survey, it refers to the maximum margin of error for all reported percentages using the full sample from the survey. If the statistic is a percentage, this maximum margin of error can be calculated as the radius of the confidence interval for a reported percentage of 50%. For a poll with a random sample of 1,000 people reporting a proportion around 50% for some question, the sampling margin of error is approximately ±3% for the estimated proportion of the whole population. A 3% margin of error means that if the same procedure is used a large number of times, 95% of the time the true population average will be within the sample estimate plus or minus 3%. The margin of error can be reduced by using a larger sample, however if a pollster wishes to reduce the margin of error to 1% they would need a sample of around 10,000 people. In practice, pollsters need to balance the cost of a large sample against the reduction in sampling error and a sample size of around 500–1,000 is a typical compromise for political polls. (To get complete responses it may be necessary to include thousands of additional participators.) Another way to reduce the margin of error is to rely on poll averages. This makes the assumption that the procedure is similar enough between many different polls and uses the sample size of each poll to create a polling average. Another source of error stems from faulty demographic models by pollsters who weigh their samples by particular variables such as party identification in an election. For example, if you assume that the breakdown of the US population by party identification has not changed since the previous presidential election, you may underestimate a victory or a defeat of a particular party candidate that saw a surge or decline in its party registration relative to the previous presidential election cycle. Sample Techniques are also used and recommended to reduce sample errors and errors of margin. In chapter four of author Herb Asher he says,"it is probability sampling and statistical theory that enable one to determine sampling error, confidence levels, and the like and to generalize from the results of the sample to the broader population from which it was selected. Other factors also come into play in making a survey scientific. One must select a sample of sufficient size. If the sampling error is too large or the level of confidence too low, it will be difficult to make reasonably precise statements about characteristics of the population of interest to the pollster. A scientific poll not only will have a sufficiently large sample, it will also be sensitive to response rates. Very low response rates will raise questions about how representative and accurate the results are. Are there systematic differences between those who participated in the survey and those who, for whatever reason, did not participate? Sampling methods, sample size, and response rates will all be discussed in this chapter" (Asher 2017). A caution is that an estimate of a trend is subject to a larger error than an estimate of a level. This is because if one estimates the change, the difference between two numbers ''X'' and ''Y,'' then one has to contend with errors in both ''X'' and ''Y''. A rough guide is that if the change in measurement falls outside the margin of error it is worth attention.Nonresponse bias
Since some people do not answer calls from strangers, or refuse to answer the poll, poll samples may not be representative samples from a population due to a non-response bias. Response rates have been declining, and are down to about 10% in recent years. Various pollsters have attributed this to an increased skepticism and lack of interest in polling. Because of this selection bias, the characteristics of those who agree to be interviewed may be markedly different from those who decline. That is, the actual sample is a biased version of the universe the pollster wants to analyze. In these cases, bias introduces new errors, one way or the other, that are in addition to errors caused by sample size. Error due to bias does not become smaller with larger sample sizes, because taking a larger sample size simply repeats the same mistake on a larger scale. If the people who refuse to answer, or are never reached, have the same characteristics as the people who do answer, then the final results should be unbiased. If the people who do not answer have different opinions then there is bias in the results. In terms of election polls, studies suggest that bias effects are small, but each polling firm has its own techniques for adjusting weights to minimize selection bias.Response bias
Survey results may be affected by response bias, where the answers given by respondents do not reflect their true beliefs. This may be deliberately engineered by unscrupulous pollsters in order to generate a certain result or please their clients, but more often is a result of the detailed wording or ordering of questions (see below). Respondents may deliberately try to manipulate the outcome of a poll by e.g. advocating a more extreme position than they actually hold in order to boost their side of the argument or give rapid and ill-considered answers in order to hasten the end of their questioning. Respondents may also feel under social pressure not to give an unpopular answer. For example, respondents might be unwilling to admit to unpopular attitudes likeWording of questions
Among the factors that impact the results of opinion polls are the wording and order of the questions being posed by the surveyor. Questions that intentionally affect a respondents answer are referred to as leading questions. Individuals and/or groups use these types of questions in surveys to elicit responses favorable to their interests. For instance, the public is more likely to indicate support for a person who is described by the surveyor as one of the "leading candidates". This description is "leading" as it indicates a subtle bias for that candidate, since it implies that the others in the race are not serious contenders. Additionally, leading questions often contain, or lack, certain facts that can sway a respondent's answer. Argumentative Questions can also impact the outcome of a survey. These types of questions, depending on their nature, either positive or negative, influence respondents' answers to reflect the tone of the question(s) and generate a certain response or reaction, rather than gauge sentiment in an unbiased manner. In opinion polling, there are also " loaded questions", otherwise known as " trick questions". This type of leading question may concern an uncomfortable or controversial issue, and/or automatically assume the subject of the question is related to the respondent(s) or that they are knowledgeable about it. Likewise, the questions are then worded in a way that limit the possible answers, typically to yes or no. Another type of question that can produce inaccurate results are " Double-Negative Questions". These are more often the result of human error, rather than intentional manipulation. One such example is a survey done in 1992 by the Roper Organization, concerning the Holocaust. The question read "Does it seem possible or impossible to you that the Nazi extermination of the Jews never happened?" The confusing wording of this question led to inaccurate results which indicated that 22 percent of respondents believed it seemed possible the Holocaust might not have ever happened. When the question was reworded, significantly fewer respondents (only 1 percent) expressed that same sentiment. Thus comparisons between polls often boil down to the wording of the question. On some issues, question wording can result in quite pronounced differences between surveys. This can also, however, be a result of legitimately conflicted feelings or evolving attitudes, rather than a poorly constructed survey. A common technique to control for this bias is to rotate the order in which questions are asked. Many pollsters also split-sample. This involves having two different versions of a question, with each version presented to half the respondents. The most effective controls, used by attitude researchers, are: * asking enough questions to allow all aspects of an issue to be covered and to control effects due to the form of the question (such as positive or negative wording), the adequacy of the number being established quantitatively with psychometric measures such as reliability coefficients, and * analyzing the results with psychometric techniques which synthesize the answers into a few reliable scores and detect ineffective questions. These controls are not widely used in the polling industry.. However, as it is important that questions to test the product have a high quality, survey methodologists work on methods to test them. Empirical tests provide insight into the quality of the questionnaire, some may be more complex than others. For instance, testing a questionnaire can be done by: * conducting cognitive interviewing. By asking a sample of potential-respondents about their interpretation of the questions and use of the questionnaire, a researcher can * carrying out a small pretest of the questionnaire, using a small subset of target respondents. Results can inform a researcher of errors such as missing questions, or logical and procedural errors. * estimating the measurement quality of the questions. This can be done for instance using test-retest, quasi-simplex, or mutlitrait-multimethod models. * predicting the measurement quality of the question. This can be done using the software Survey Quality Predictor (SQP).Involuntary facades and false correlations
One of the criticisms of opinion polls is that societal assumptions that opinions between which there is no logical link are "correlated attitudes" can push people with one opinion into a group that forces them to pretend to have a supposedly linked but actually unrelated opinion. That, in turn, may cause people who have the first opinion to claim on polls that they have the second opinion without having it, causing opinion polls to become part of self-fulfilling prophecy problems. It has been suggested that attempts to counteract unethical opinions by condemning supposedly linked opinions may favor the groups that promote the actually unethical opinions by forcing people with supposedly linked opinions into them by ostracism elsewhere in society making such efforts counterproductive, that not being sent between groups that assume ulterior motives from each other and not being allowed to express consistent critical thought anywhere may create psychological stress because humans are sapient, and that discussion spaces free from assumptions of ulterior motives behind specific opinions should be created. In this context, rejection of the assumption that opinion polls show actual links between opinions is considered important.Coverage bias
Another source of error is the use of samples that are not representative of the population as a consequence of the methodology used, as was the experience of ''The Literary Digest'' in 1936. For example, telephone sampling has a built-in error because in many times and places, those with telephones have generally been richer than those without. In some places many people have only mobile telephones. Because pollsters cannot use automated dialing machines to call mobile phones in the United States (because the phone's owner may be charged for taking a call), these individuals are typically excluded from polling samples. There is concern that, if the subset of the population without cell phones differs markedly from the rest of the population, these differences can skew the results of the poll. Polling organizations have developed many weighting techniques to help overcome these deficiencies, with varying degrees of success. Studies of mobile phone users by the Pew Research Center in the US, in 2007, concluded that "cell-only respondents are different from landline respondents in important ways, (but) they were neither numerous enough nor different enough on the questions we examined to produce a significant change in overall general population survey estimates when included with the landline samples and weighted according to US Census parameters on basic demographic characteristics." This issue was first identified in 2004, but came to prominence only during the 2008 US presidential election. In previous elections, the proportion of the general population using cell phones was small, but as this proportion has increased, there is concern that polling only landlines is no longer representative of the general population. In 2003, only 2.9% of households were wireless (cellphones only), compared to 12.8% in 2006. This results in " coverage error". Many polling organisations select their sample by dialling random telephone numbers; however, in 2008, there was a clear tendency for polls which included mobile phones in their samples to show a much larger lead for Obama, than polls that did not. The potential sources of bias are: # Some households use cellphones only and have no landline. This tends to include minorities and younger voters; and occurs more frequently in metropolitan areas. Men are more likely to be cellphone-only compared to women. # Some people may not be contactable by landline from Monday to Friday and may be contactable only by cellphone. # Some people use their landlines only to access the Internet, and answer calls only to their cellphones. Some polling companies have attempted to get around that problem by including a "cellphone supplement". There are a number of problems with including cellphones in a telephone poll: # It is difficult to get co-operation from cellphone users, because in many parts of the US, users are charged for both outgoing and incoming calls. That means that pollsters have had to offer financial compensation to gain co-operation. # US federal law prohibits the use of automated dialling devices to call cellphones ( Telephone Consumer Protection Act of 1991). Numbers therefore have to be dialled by hand, which is more time-consuming and expensive for pollsters.Failures
A widely publicized failure of opinion polling to date in theSocial media as a source of opinion on candidates
Social media today is a popular medium for the candidates to campaign and for gauging the public reaction to the campaigns. Social media can also be used as an indicator of the voter opinion regarding the poll. Some research studies have shown that predictions made using social media signals can match traditional opinion polls. Regarding the 2016 U.S. presidential election, a major concern has been that of the effect of false stories spread throughoutInfluence
Effect on voters
By providing information about voting intentions, opinion polls can sometimes influence the behavior of electors, and in his book '' The Broken Compass'', Peter Hitchens asserts that opinion polls are actually a device for influencing public opinion. The various theories about how this happens can be split into two groups: bandwagon/underdog effects, and strategic ("tactical") voting. A bandwagon effect occurs when the poll prompts voters to back the candidate shown to be winning in the poll. The idea that voters are susceptible to such effects is old, stemming at least from 1884; William Safire reported that the term was first used in a political cartoon in the magazine '' Puck'' in that year. It has also remained persistent in spite of a lack of empirical corroboration until the late 20th century.Effect on politicians
Starting in the 1980s, tracking polls and related technologies began having a notable impact on U.S. political leaders. According to Douglas Bailey, a Republican who had helped run Gerald Ford's 1976 presidential campaign, "It's no longer necessary for a political candidate to guess what an audience thinks. He can ind outwith a nightly tracking poll. So it's no longer likely that political leaders are going to lead. Instead, they're going to follow." An example of opinion polls having significant impact on politicians is Ronald Reagan's advocacy for a voluntary social security program in the 1960s and early 1970s. Because polls showed that a large proportion of the public would not support such a program, he dropped the issue when he ran for presidency.Regulation
Some jurisdictions over the world restrict the publication of the results of opinion polls, especially during the period around an election, in order to prevent the possibly erroneous results from affecting voters' decisions. For instance, in Canada, it is prohibited to publish the results of opinion surveys that would identify specific political parties or candidates in the final three days before a poll closes. However, most Western democratic nations do not support the entire prohibition of the publication of pre-election opinion polls; most of them have no regulation and some only prohibit it in the final days or hours until the relevant poll closes. A survey by Canada's Royal Commission on Electoral Reform reported that the prohibition period of publication of the survey results largely differed in different countries. Out of the 20 countries examined, 3 prohibit the publication during the entire period of campaigns, while others prohibit it for a shorter term such as the polling period or the final 48 hours before a poll closes. In India, the Election Commission has prohibited it in the 48 hours before the start of polling.Opinion poll in dictatorships
The director of the Levada Center stated in 2015 that drawing conclusions fromSee also
* Deliberative opinion poll * Entrance poll * Electoral geography * Europe Elects * Everett Carll Ladd * Exit poll * Historical polling for U.S. Presidential elections * List of polling organizations * Metallic Metals Act * Open access poll * Psephology * Political analyst * Political data scientists * Political forecasting * Push poll *Footnotes
References
* Asher, Herbert: ''Polling and the Public. What Every Citizen Should Know'' (4th ed. CQ Press, 1998) * Bourdieu, Pierre, "Public Opinion does not exist" in ''Sociology in Question'', London, Sage (1995). * Bradburn, Norman M. and Seymour Sudman. ''Polls and Surveys: Understanding What They Tell Us'' (1988). * Cantril, Hadley. ''Gauging Public Opinion'' (1944Additional sources
* Brodie, Mollyann, et al. "The Past, Present, And Possible Future Of Public Opinion On The ACA: A review of 102 nationally representative public opinion polls about the Affordable Care Act, 2010 through 2019." ''Health Affairs'' 39.3 (2020): 462–470. * Dyczok, Marta. "Information wars: hegemony, counter-hegemony, propaganda, the use of force, and resistance." ''Russian Journal of Communication'' 6#2 (2014): 173–176. * Eagly, Alice H., et al. "Gender stereotypes have changed: A cross-temporal meta-analysis of US public opinion polls from 1946 to 2018." ''American psychologist'' 75.3 (2020): 301+