Data Exhaust
   HOME

TheInfoList



OR:

Data exhaust or exhaust data is the trail of data left by the activities of an
Internet The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
or other computer system users during their online activity, behavior, and transactions. This is part of a broader category of unconventional data that includes geospatial, network, and time-series data and may be useful for
predictive analytics Predictive analytics encompasses a variety of Statistics, statistical techniques from data mining, Predictive modelling, predictive modeling, and machine learning that analyze current and historical facts to make predictions about future or other ...
. Every visited website, clicked link, and even hovering with a mouse is collected, leaving behind a trail of data. An enormous amount of often raw data are created, which can be in the form of
cookies A cookie is a sweet biscuit with high sugar and fat content. Cookie dough is softer than that used for other types of biscuit, and they are cooked longer at lower temperatures. The dough typically contains flour, sugar, egg, and some type of ...
, temporary files,
logfile In computing, logging is the act of keeping a log of events that occur in a computer system, such as problems, errors or broad information on current operations. These events may occur in the operating system or in other software. A message or ...
s, storable choices, and more. This information can help to improve the online experience, for example through customized content. It can be used to improve tracking trends and studying data exhaust also improves the user interface and the layout design. On the other hand, they can also compromise privacy, as they offer a valuable insight into the user's habits. For example, as the world's most popular website, Google, uses this data exhaust to refine the predictive value of their products. The data that is collected by companies is often information that does not seem immediately useful. Although the information is not used by the company right away, it can be stored for future use or sold to someone else who can use the information. The data can help with quality control, performance, and revenue. Unlike primary content, these data are not purposefully created by the user, who is often unaware of their very existence. A bank for example would consider as primary data information concerning the sums and parties of a transaction, whilst secondary data might include the percentage of transactions carried out at a
cash machine An automated teller machine (ATM) is an electronic telecommunications device that enables customers of financial institutions to perform financial transactions, such as cash withdrawals, deposits, funds transfers, balance inquiries or account ...
instead of a real bank.


Medical exhaust data

Most medical devices emit some form of exhaust data, such as many pacemakers, dialysis machines, and cameras used during surgery. The majority of this data is never captured, and is primarily abandoned after the surgery is completed, or the device makes its next routine check. Some issues have arisen regarding the use of the data captured by devices like pacemakers. This can lead to larger issues surrounding the use of this exhaust data. Using
electronic health record An electronic health record (EHR) is the systematized collection of electronically stored patient and population health information in a digital format. These records can be shared across different health care settings. Records are shared thro ...
s (EMR) for research poses a large number of challenges, the most prevalent being the amount of data there is. This surplus of data is too much for people to sort through and analyze, thus creating a need for
algorithm In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...
s.


Solutions

Although data exhaust is not a new concept, the ubiquity of Internet-enabled gadgetry has exacerbated the scope and impacts of our passive digital trail. The collection and distribution of data thus generated is not illegal, but there are steps that must be taken to ensure that the use of this data is ethical. In order to ensure privacy of users, when the information is sold it can be anonymized. Also, users can be given the opportunity to
opt-out The term opt-out refers to several methods by which individuals can avoid receiving unsolicited product or service information. This option is usually associated with direct marketing campaigns such as e-mail marketing or direct mail. A list of th ...
of the selling of their information if they choose. Lastly, to build trust, websites can update their
privacy policies A privacy policy is a statement or legal document (in privacy law) that discloses some or all of the ways a party gathers, uses, discloses, and manages a customer or client's data. Personal information can be anything that can be used to identify ...
so that they include all the data they will be collecting about the user.


See also

*
Alternative data In economic policy, alternative data refers to the inclusion of non-financial payment reporting data in credit files, such as telecom and energy utility payments. Types Alternative data in the broadest sense refers to any non-financial informat ...


References

Data management Internet privacy {{Compu-stub