HOME

TheInfoList



OR:

DeepDream is a
computer vision Computer vision tasks include methods for image sensor, acquiring, Image processing, processing, Image analysis, analyzing, and understanding digital images, and extraction of high-dimensional data from the real world in order to produce numerical ...
program created by
Google Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
engineer Alexander Mordvintsev that uses a
convolutional neural network A convolutional neural network (CNN) is a type of feedforward neural network that learns features via filter (or kernel) optimization. This type of deep learning network has been applied to process and make predictions from many different ty ...
to find and enhance patterns in
image An image or picture is a visual representation. An image can be Two-dimensional space, two-dimensional, such as a drawing, painting, or photograph, or Three-dimensional space, three-dimensional, such as a carving or sculpture. Images may be di ...
s via
algorithm In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...
ic
pareidolia Pareidolia (; ) is the tendency for perception to impose a meaningful interpretation on a nebulous stimulus (physiology), stimulus, usually visual, so that one detects an object, pattern, or meaning where there is none. Pareidolia is a specific bu ...
, thus creating a
dream A dream is a succession of images, ideas, emotions, and sensation (psychology), sensations that usually occur involuntarily in the mind during certain stages of sleep. Humans spend about two hours dreaming per night, and each dream lasts around ...
-like appearance reminiscent of a
psychedelic experience A psychedelic experience (known colloquially as a trip) is a temporary altered state of consciousness induced by the consumption of a psychedelic substance (most commonly Lysergic acid diethylamide, LSD, mescaline, psilocybin mushrooms, or N,N- ...
in the deliberately overprocessed images. Google's program popularized the term (deep) "dreaming" to refer to the generation of images that produce desired
activation In chemistry and biology, activation is the process whereby something is prepared or excited for a subsequent reaction. Chemistry In chemistry, "activation" refers to the reversible transition of a molecule into a nearly identical chemical or ...
s in a trained deep network, and the term now refers to a collection of related approaches.


History

The DeepDream software, originated in a deep convolutional network codenamed "Inception" after the film of the same name, was developed for the ImageNet Large-Scale Visual Recognition Challenge (ILSVRC) in 2014 and released in July 2015. The dreaming idea and name became popular on the internet in 2015 thanks to Google's DeepDream program. The idea dates from early in the history of neural networks, and similar methods have been used to synthesize visual textures. Related visualization ideas were developed (prior to Google's work) by several research groups. After Google published their techniques and made their code
open-source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use and view the source code, design documents, or content of the product. The open source model is a decentrali ...
, a number of tools in the form of web services, mobile applications, and desktop software appeared on the market to enable users to transform their own photos.


Process

The software is designed to detect faces and other patterns in images, with the aim of automatically classifying images. However, once trained, the network can also be run in reverse, being asked to adjust the original image slightly so that a given output neuron (e.g. the one for faces or certain animals) yields a higher confidence score. This can be used for visualizations to understand the emergent structure of the neural network better, and is the basis for the DeepDream concept. This reversal procedure is never perfectly clear and unambiguous because it utilizes a one-to-many mapping process. However, after enough reiterations, even imagery initially devoid of the sought features will be adjusted enough that a form of
pareidolia Pareidolia (; ) is the tendency for perception to impose a meaningful interpretation on a nebulous stimulus (physiology), stimulus, usually visual, so that one detects an object, pattern, or meaning where there is none. Pareidolia is a specific bu ...
results, by which psychedelic and surreal images are generated algorithmically. The optimization resembles
backpropagation In machine learning, backpropagation is a gradient computation method commonly used for training a neural network to compute its parameter updates. It is an efficient application of the chain rule to neural networks. Backpropagation computes th ...
; however, instead of adjusting the network weights, the weights are held fixed and the input is adjusted. For example, an existing image can be altered so that it is "more cat-like", and the resulting enhanced image can be again input to the procedure. This usage resembles the activity of looking for animals or other patterns in clouds. Applying gradient descent independently to each pixel of the input produces images in which adjacent pixels have little relation and thus the image has too much high frequency information. The generated images can be greatly improved by including a prior or regularizer that prefers inputs that have natural image statistics (without a preference for any particular image), or are simply smooth. For example, Mahendran et al. used the total variation regularizer that prefers images that are piecewise constant. Various regularizers are discussed further in Yosinski et al. An in-depth, visual exploration of feature visualization and regularization techniques was published more recently. The cited resemblance of the imagery to
LSD Lysergic acid diethylamide, commonly known as LSD (from German ; often referred to as acid or lucy), is a semisynthetic, hallucinogenic compound derived from ergot, known for its powerful psychological effects and serotonergic activity. I ...
- and
psilocybin Psilocybin, also known as 4-phosphoryloxy-''N'',''N''-dimethyltryptamine (4-PO-DMT), is a natural product, naturally occurring tryptamine alkaloid and Investigational New Drug, investigational drug found in more than List of psilocybin mushroom ...
-induced hallucinations is suggestive of a functional resemblance between artificial neural networks and particular layers of the visual cortex. Neural networks such as DeepDream have biological analogies providing insight into brain processing and the formation of consciousness. Hallucinogens such as DMT alter the function of the serotonergic system which is present within the layers of the visual cortex. Neural networks are trained on input vectors and are altered by internal variations during the training process. The input and internal modifications represent the processing of exogenous and endogenous signals respectively in the visual cortex. As internal variations are modified in deep neural networks the output image reflect these changes. This specific manipulation demonstrates how inner brain mechanisms are analogous to internal layers of neural networks. Internal noise level modifications represent how hallucinogens omit external sensory information leading internal preconceived conceptions to strongly influence visual perception.


Usage

The dreaming idea can be applied to hidden (internal) neurons other than those in the output, which allows exploration of the roles and representations of various parts of the network. It is also possible to optimize the input to satisfy either a single neuron (this usage is sometimes called Activity Maximization) or an entire layer of neurons. While dreaming is most often used for visualizing networks or producing computer art, it has recently been proposed that adding "dreamed" inputs to the training set can improve training times for abstractions in Computer Science. The DeepDream model has also been demonstrated to have application in the field of
art history Art history is the study of Work of art, artistic works made throughout human history. Among other topics, it studies art’s formal qualities, its impact on societies and cultures, and how artistic styles have changed throughout history. Tradit ...
. DeepDream was used for
Foster the People Foster the People is an American indie pop band formed in Los Angeles, California, in 2009. Its members include founder and frontman Mark Foster (musician), Mark Foster and keyboardist Isom Innis. Foster founded the band in 2009 after spending ...
's music video for the song " Doing It for the Money". In 2017, a research group out of the University of Sussex created a ''Hallucination Machine'', applying the DeepDream algorithm to a pre-recorded panoramic video, allowing users to explore virtual reality environments to mimic the experience of psychoactive substances and/or psychopathological conditions. They were able to demonstrate that the subjective experiences induced by the Hallucination Machine differed significantly from control (non-‘hallucinogenic’) videos, while bearing phenomenological similarities to the psychedelic state (following administration of psilocybin). In 2021, a study published in the journal ''Entropy'' demonstrated the similarity between DeepDream and actual psychedelic experience with neuroscientific evidence. The authors recorded Electroencephalography ( EEG) of human participants during passive vision of a movie clip and its DeepDream-generated counterpart. They found that DeepDream video triggered a higher
entropy Entropy is a scientific concept, most commonly associated with states of disorder, randomness, or uncertainty. The term and the concept are used in diverse fields, from classical thermodynamics, where it was first recognized, to the micros ...
in the EEG signal and a higher level of
functional connectivity Resting state fMRI (rs-fMRI or R-fMRI), also referred to as task-independent fMRI or task-free fMRI, is a method of functional magnetic resonance imaging (fMRI) that is used in brain mapping to evaluate regional interactions that occur in a rest ...
between brain areas, both well-known
biomarker In biomedical contexts, a biomarker, or biological marker, is a measurable indicator of some biological state or condition. Biomarkers are often measured and evaluated using blood, urine, or soft tissues to examine normal biological processes, ...
s of actual psychedelic experience. In 2022, a research group coordinated by the University of Trento "measure participants’ cognitive flexibility and creativity after the exposure to virtual reality panoramic videos and their hallucinatory-like counterparts generated by the DeepDream algorithm ... following the simulated psychedelic exposure, individuals exhibited ... an attenuated contribution of the automatic process and chaotic dynamics underlying their decision processes, presumably due to a reorganization in the cognitive dynamics that facilitates the exploration of uncommon decision strategies and inhibits automated choices."


See also

*
Artificial imagination Artificial imagination is a narrow subcomponent of artificial general intelligence which generates, simulates, and facilitates real or possible fiction models to create predictions, inventions, or conscious experiences. The term artificia ...
*
DALL-E DALL-E, DALL-E 2, and DALL-E 3 (stylised DALL·E) are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions known as Prompt engineering, ''prompts''. The first ...
*
Feature detection (computer vision) In computer vision and image processing, a feature is a piece of information about the content of an image; typically about whether a certain region of the image has certain properties. Features may be specific structures in the image such as ...
*
Hallucination (artificial intelligence) In the field of artificial intelligence (AI), a hallucination or artificial hallucination (also called bullshitting, confabulation, or delusion) is a response generated by AI that contains false or misleading information presented as fact. Thi ...
* Neural style transfer * Procedural textures *
Psychedelic art Psychedelic art (also known as psychedelia) is art, graphics or visual displays related to or inspired by psychedelic experiences and hallucinations known to follow the ingestion of psychedelic drugs such as lysergic acid diethylamide, LSD, psil ...
* Texture synthesis


References


External links

* * {{cite web, url= https://research.googleblog.com/2015/06/inceptionism-going-deeper-into-neural.html , title= Inceptionism: Going Deeper into Neural Networks , date= June 17, 2015 , first1= Alexander , last1= Mordvintsev , first2= Christopher , last2= Olah , first3= Mike , last3= Tyka , url-status= live , archive-url= https://web.archive.org/web/20150703064823/http://googleresearch.blogspot.co.uk/2015/06/inceptionism-going-deeper-into-neural.html , archive-date= 2015-07-03 2015 software Algorithmic art Deep learning software applications Computer art Computer vision software Free and open-source software Google software Object recognition and categorization Psychedelic art Pareidolia