HOME

TheInfoList



OR:

Catpac is a computer program that analyzes text samples to identify key concepts contained within the sample. It was conceived chiefly by Richard Holmes, a Michigan State computer programmer and Dr. Joseph Woelfel, a University at Albany and
University at Buffalo The State University of New York at Buffalo (commonly referred to as UB, University at Buffalo, and sometimes SUNY Buffalo) is a public university, public research university in Buffalo, New York, Buffalo and Amherst, New York, United States. ...
sociologist for the analysis of attitude formation and change in the sociological context. Contributions by Rob Zimmelman, an undergraduate and graduate student at the University of Albany, from 1981 to 1984 on the Univac 1100 mainframe, included the inclusion of the CATPAC software in the Galileo*Telegal system, text-labeling and porting of CATPAC output for the Galileo system of paired-comparison conceptual visualization. CATPAC and the Galileo system are still in commercial use today, and with recent data capture and visualization contributions, continues to grow. Contributions by other students at the university resulted in the software that is still in commercial use today. It uses text files as input and produces output such as word and alphabetical frequencies as well as various types of cluster analysis.


Design

Catpac is a
self-organizing Self-organization, also called spontaneous order in the social sciences, is a process where some form of overall order and disorder, order arises from local interactions between parts of an initially disordered system. The process can be spont ...
, i.e. unsupervised, interactive activation and competition (IAC)
artificial neural network In machine learning, a neural network (also artificial neural network or neural net, abbreviated ANN or NN) is a computational model inspired by the structure and functions of biological neural networks. A neural network consists of connected ...
used for text analysis. The program generates a multidimensional
scalar Scalar may refer to: *Scalar (mathematics), an element of a field, which is used to define a vector space, usually the field of real numbers *Scalar (physics), a physical quantity that can be described by a single element of a number field such a ...
output organizing words throughout the text by creating a weighted word-by-word
matrix Matrix (: matrices or matrixes) or MATRIX may refer to: Science and mathematics * Matrix (mathematics), a rectangular array of numbers, symbols or expressions * Matrix (logic), part of a formula in prenex normal form * Matrix (biology), the m ...
that establishes the
eigenvector In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by ...
centralities of concepts. The word-by-word matrix represents the relationship between one word and the occurrence of another. Catpac identifies important words and patterns based on the organization of the text. This process mimics the connections between neurons in a human brain, strengthening connections through conditioning to generate a pattern of similarities among all words within a body of text.


Use

Catpac has been used in commercial studies, in academic scholarship to investigate massive textual data sets, as a strong semantic network analysis tool, for longitudinal analyses, for multilingual analyses, as a predictor of media usage and as a powerful content analysis tool.


Availability

Catpac, conceived as an improvement to simple word-count software more than 30 years ago, is currently available in windows 32 bit format.


References

{{reflist


External links


Bibliography of research using Catpac
Data analysis software