Terms
This technique utilizesBasic study layout
The reverse correlation method is typically executed as an in-lab computer experiment. This method follows four broad steps. Each of the following steps are described in greater detail below. After creating a research question and determining that the reverse correlation method is the most suitable technique to answer the question, a researcher must (1) design randomly varying stimuli. After the stimuli have been prepared, a researcher should (2) collect data from participants who will see and respond to approximately 300 -1,000 trials. Each trial will either consist of one or two images (side by side) derived from the same base image with noise superimposed on top. Participant responses will depend on the chosen study design; if a researcher presents only one image at a time, participants rate the image on a 4pt scale, but when two images are shown, the participant is asked to choose which best aligns with the given category (e.g. choose the image that looks the most aggressive). Once all of the data is collected, the researcher will (3) compute classification images for each participant and using those images compute group classification images. Finally, with the classification images available, the researcher will (4) evaluate the images and draw conclusions about their results.Step 1: making stimuli
When designing the stimuli for a reverse correlation study, the two primary factors that one should consider are (1) the base image and (2) the noise that will be used. While not all bases are images per se, the majority are and for this reason the base is typically referred to as a base image. The base image should represent whatever the research question is addressing. For example, if you are interested in peoples’ mental representations of Chinese people, it would not make sense to use a base image of a Spanish or Caucasian person. Again, if you are interested in the mental representations of male vocal patterns, it would make the most sense to use a base vocal pattern that has been produced by a male. Having a base is important because it provides a kind of anchor for participants to work from. When there is no base image, the number of trials that are required increases dramatically, thus making it harder to collect data. While there are studies that have excluded a base image, (e.g. the S study), for more elaborate and nuanced research questions, it is important to have a base image that is a fair representation of what participants are being asked to categorize. Photographs of faces are generally the most popular base image. Although the reverse correlation method is capable of investigating a wide variety of research questions, the most common application of the method is for evaluating faces on a single trait. Reverse correlation studies that address evaluations of the face are sometimes referred to as being a face space reverse correlation model (FSRCM). Thankfully, there are existing databases for face images of varying demographics and emotion that work well as base images. The reverse correlation method can also be used to help researchers identify what areas of an image (e.g. the areas on the face) have diagnostic value. In order to identify these areas of value, researchers start by minimizing the space a participant can pull information from. By imposing a “mask” on an image (e.g. blur an image while leaving random areas un-blurred), this reduces the information individuals might see, and forces them to focus on certain areas. Then, if/when participants are able to correctly identify an image with a trait repeatedly, we can draw conclusions about what areas have diagnostic value. While faces and visual stimuli are the most popular, this is not the only stimuli that can be used in a reverse correlation study. This method was originally designed for auditory stimuli which allows researchers to investigate how perceivers interpret auditory information and create trait based attributions to different sound patterns. For example, by segmenting a vocal recording of a single word (total sound time 426 ms) into six segments (71 ms each), and varying each segment's pitch using Gaussian distributions, researchers were able to uncover what vocal patterns people associated with certain traits. Specifically, this study investigated how listeners rated sound clips of the word “really” as sounding more interrogative (i.e. like the more common reverse correlation studies this study had participants listen to two sound clips per trial, choose which fit the category the best, and then created an average of the pitch contours). Beyond face and auditory perception, research utilizing the reverse correlation method has expanded to investigate how individuals see three-dimensional objects in images with noise (but no signal). After selecting your base image, regardless of what the image is, it is helpful to apply a Gaussian blur to smooth noise in the image. While noise will be applied later, it is helpful to reduce existing noise in the photo before applying your chosen noise. There are three primary choices when it comes to noise:Step 2: data collection
Once the stimuli for the study has been developed, the researcher must make a few decisions before actually collecting the data. The researcher must come to a conclusion on how many stimuli will be presented at a time and how many trials the participants will see. In terms of stimuli presentation, a researcher can choose from either a 2-Image Forced Choice (2IFC) or a 4-Alternative Forced Choice (4AFC). The 2IFC presents two images at once (side by side) and requires participants to choose between the two on a specified category (e.g. which image looks the most like a male). Typically the noise from the left image is the mathematical inverse of the noise from the right image. This method was developed to better answer questions that could not be fully answered by the 4AFC method. As compared to the 2IFC, the 4AFC only shows participants one image per trial and requires them to rate the image on a 4-point scale ((1) Probably X, (2) Possibly X, (3) Possibly Y, (4) Probably Y). For example, here X might represent male and Y might represent female. Typically, during data analysis, only images that are chosen as a “probably” category are included. As mentioned previously, the 2IFC was designed to address questions that could not be easily answered by the 4AFC. In the 4AFC, there is the possibility that participants may not choose a “probably” category, and if this happens, no classification image can be computed. For example, if the base image does not look like the mental representation participants are asked to report on, then participants may never make a confident choice and classify the image under a “probably” category. While this is a flaw in the 4AFC, one advantage to this method and scale structure is that researchers can see participants’ certainty judgements on their classification decisions (e.g. a probably X label would suggest greater confidence in their decision than a possibly X label). As for choosing the number of trials, generally researchers conducting a reverse correlation study present participants with 300 - 1,000 trials.Step 3: computing a classification image (CI)
Again, a classification image is the calculated average noise of all selected images (stimuli). Classification images can be generated for individuals or the group. Computing a classification image for individuals and groups are slightly different. To compute a classification image for an individual, the researcher will start by creating an average of the all selected images’ noise and then overlay that pattern onto the base image. Before the noise is superimposed, it is scaled to fit the base image (i.e. the smallest and largest pixel intensities are matched to the base image pixels). To generate a classification image for a group, the researcher will either handle each individual classification image separately (making sure to scale the pixels independently) or apply a dependent scaling. A dependent scaling is called such because the scaling that is applied to all classification images depends on the image with the greatest range of pixels. Using this single image and its pixel range, the researcher will match the pixels of the classification image to the pixels of the base image. The scaling factor used for this image is then applied to the remaining classification images. When choosing between these two approaches, keep in mind that in classification images with little signal, independent scaling amplifies signal and noise more than dependent scaling. If the researcher is interested in the strength of signal, it is suggested that they use dependent scaling. When calculating a classification image, it is critical to consider how your external noise will impact your signal to noise ratio (SNR). The SNR is the ratio of desired input (e.g. signal) to undesired information (e.g. noise). One way to produce a high SNR (when observers are unbiased) is to use this formula C=(+)-(+). These researchers have found the optimal experimental parameters for different study designs that will result in high SNR.Step 4: evaluating classification images and drawing conclusions
After computing classification images for individual participants and/or for the group, the researcher will use these images to draw conclusions about their research questions. However, while not always the case, occasionally after the first set of classification images have been generated, researchers will then take these images and present them to a new sample of participants and ask them to rate the images on a subsequent factor of interest. This process is referred to as a two-phase reverse correlation. For example, if a classification image was computed after participants were asked to choose the image that looked the most like a police officer, the generated classification images could then be presented to a new sample who would evaluate the images on how aggressive the faces look. This process makes it easier to draw conclusions on the data. While this step can ease in drawing conclusions, one must use caution to not collect too many participants in the second phase, because high numbers of participants will make the tiniest of differences appear significant, therefore resulting in a Type 1 ErroSee also
*References