Yandex Search
   HOME

TheInfoList



OR:

Yandex Search () is a
search engine A search engine is a software system that provides hyperlinks to web pages, and other relevant information on World Wide Web, the Web in response to a user's web query, query. The user enters a query in a web browser or a mobile app, and the sea ...
owned by the company Yandex, based in
Russia Russia, or the Russian Federation, is a country spanning Eastern Europe and North Asia. It is the list of countries and dependencies by area, largest country in the world, and extends across Time in Russia, eleven time zones, sharing Borders ...
. In January 2015, Yandex Search generated 51.2% of all of the search traffic in Russia according to . In February 2024, Yandex N.V. announced the sale of the majority of its Russia-based assets to a consortium of Russia-based investors. In July 2024, the sale was completed, giving the Kremlin more control over the business.


About

The search technology provides local search results in more than 1,400 cities. Yandex Search also features “parallel” search that presents results from both main web index and specialized information resources, including news, shopping, blogs, images and videos on a single page. Yandex Search is responsive to real-time queries, recognizing when a query requires the most current information, such as breaking news or the most recent post on
Twitter Twitter, officially known as X since 2023, is an American microblogging and social networking service. It is one of the world's largest social media platforms and one of the most-visited websites. Users can share short text messages, image ...
on a particular topic. It also contains some additional features: Wizard Answer, which provides additional information (for example, sports results),
spell checker In software, a spell checker (or spelling checker or spell check) is a software feature that checks for misspellings in a text. Spell-checking features are often embedded in software or services, such as a word processor, email client, electronic ...
,
autocomplete Autocomplete, or word completion, is a feature in which an application software, application predicts the rest of a word a user is typing. In Android (operating system), Android and iOS smartphones, this is called predictive text. In graphical us ...
which suggests queries as-you-type, antivirus that detects
malware Malware (a portmanteau of ''malicious software'')Tahir, R. (2018)A study on malware and malware detection techniques . ''International Journal of Education and Management Engineering'', ''8''(2), 20. is any software intentionally designed to caus ...
on webpages and so on. In May 2010, Yandex launched Yandex.com, a platform for
beta testing Software testing is the act of checking whether software satisfies expectations. Software testing can provide objective, independent information about the quality of software and the risk of its failure to a user or sponsor. Software test ...
and improving non-Russian language search.Matt McGee, Search Engine Land
Russia’s Yandex Search Engine Goes Global
. Retrieved 2011-04-30.
The search product can be accessed from personal computers, mobile phones, tablets and other digital devices. In addition to web search, Yandex provides a wide range of specialized search services. In 2009, Yandex launched MatrixNet, a new method of machine learning that significantly improves the relevance of search results. It allows Yandex’s search engine to take into account a very large number of factors when it makes the decision about relevancy of search results.MatrixNet: New Level of Search Quality
. Retrieved 2011-04-30.
Another technology, Spectrum, was launched in 2010. It allows inferring implicit queries and returning matching search results. The system automatically analyses users' searches and identifies objects like personal names, films or cars. Proportions of the search results responding to different user intents are based on the user demand for these results. With the first release on 21 July 2017, Brave web browser features Yandex as one of its default search engines. In March 2022, during the
Russian invasion of Ukraine On 24 February 2022, , starting the largest and deadliest war in Europe since World War II, in a major escalation of the Russo-Ukrainian War, conflict between the two countries which began in 2014. The fighting has caused hundreds of thou ...
, Yandex and Mail.ru were removed as optional search providers from the Mozilla Firefox browser.


Functionality


Overview

The search engine consists of three main components: # An agent is a search robot. It bypasses the network, downloads and analyzes documents. If a new link is found during site analysis, it falls into the list of web addresses of the robot. Search robots are of the following types: ''spiders'' - download sites like the user's browsers; Crawler - discover new, still unknown links based on the analysis of already known documents; ''indexers'' - analyze the detected web pages and add data to the index''.'' Many deflated documents are divided into disjoint parts and are cleared from the markup. #Index is a database compiled by search engine indexing robots. Documents are searched in the index. #Search engine. The search request from the user is sent to the least loaded server after analyzing the load of the search system. To provide such an opportunity, Yandex servers are clustered. Then, the user request is processed by a program called "Metapoisk". Metapoisk analyzes the request in real time: it determines the geographic location of the user, conducts linguistic analysis, etc. The program also determines whether the request belongs to the category of the most popular or recently defined.The issuance of such requests for some time is stored in the memory (cache) of the metasearch, and in case of a match, previously saved results are displayed. If the request is rare and there are no matches in the cache, the system redirects it to the Basic Search program. It analyzes the system index, which is also divided into different duplicate servers (this speeds up the procedure). Then the received information again falls into meta-search, the data is ranked and shown to the user in a final form.


Indexing

In general, Yandex indexes the following file types:
html Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheets ( ...
,
pdf Portable document format (PDF), standardized as ISO 32000, is a file format developed by Adobe Inc., Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, computer hardware, ...
, rtf, doc, xls, ppt, docx, odt, odp, ods, odg, xlsx, pptx. The search engine is also able to index text inside Shockwave Flash objects (if the text is not placed on the image itself), if these elements are transferred as a separate page, which has the MIME type application/x-shockwave-flash , and files with the extension .swf Yandex has 2 scanning robots - the “main” and the “fast”. The first is responsible for the whole Internet, the second indexes sites with frequently changing and updating information (news sites and news agencies). In 2010, the “fast” robot received a new technology called “Orange”, developed jointly by the California and Moscow divisions of Yandex. Since 2009, Yandex has supported Sitemaps technology.


Server logs

In the server logs, Yandex robots are represented as follows: * Mozilla/5.0 (compatible; YandexBot/3.0) is the main indexing bot. * Mozilla/5.0 (compatible; YandexBot/3.0; MirrorDetector) - a bot that detects site mirrors. If there are several sites with the same content, only one will be shown in the search results. *Mozilla/5.0 (compatible; YandexImages/3.0) - Yandex image indexer *Mozilla/5.0 (compatible; YandexVideo/3.0) - Yandex video indexer *Mozilla/5.0 (compatible; YandexMedia/3.0) - multimedia data indexer *Mozilla/5.0 (compatible; YandexBlogs/0.99; robot) is a search bot that indexes post comments. *Mozilla/5.0 (compatible; YandexAddurl/2.0) - is a search bot that indexes pages through the "Add URL" form. *Mozilla/5.0 (compatible; YandexDirect/2.0; Dyatel) - checks Yandex Direct *Mozilla/5.0 (compatible; YandexMetrika/2.0) - Yandex Metrics indexer *Mozilla/5.0 (compatible; YandexCatalog/3.0; Dyatel) - checks Yandex Catalog * Mozilla/5.0 (compatible; YandexNews/3.0) - Yandex News indexer * Mozilla/5.0 (compatible; YandexAntivirus/2.0) - Yandex anti-virus bot



Query language

The following operators are used for setting: * "" - exact quote * , - enter between words, if you need to find one of them * * - enter between words, if some word is missing * site: - search on a specific site * date: - search for documents by date, for example, date: 2007 * + - enter before the word, that should be in the document


Search results

Yandex, automatically, along with the original “exact form” of the query, searches for its various variations and formulations. The Yandex search takes into account the morphology of the Russian language, therefore, regardless of the form of the word in the search query, the search will be performed for all word forms. If morphological analysis is undesirable, you can put an exclamation mark (!) Before the word — the search in this case will show only the specific form of the word. In addition, the search query practically does not take into account the so-called stop-words, that is, prepositions,
punctuation Punctuation marks are marks indicating how a piece of writing, written text should be read (silently or aloud) and, consequently, understood. The oldest known examples of punctuation marks were found in the Mesha Stele from the 9th century BC, c ...
, pronouns, etc., due to their wide distribution As a rule, abbreviations are automatically disclosing, spelling is correcting. It also searches for synonyms (mobile - cellular). The extension of the original user request depends on the context. Expansion does not occur when a set of highly specialized terms, names of proper names of companies (for example, OJSC “Hippo” - OJSC “Hippopotamus”), adding the word “price”, in exact quotes (these are queries highlighted with typewriter quotes). Search results for each user are formed individually based on their location, language of a query, interests and preferences based on the results of previous and current search sessions. However, the key factor in ranking search results is their relevance to the search query. Relevance is determined based on a ranking formula, which is constantly updated based on machine learning algorithms. The search is performed in Russian, English, French, German, Ukrainian, Belarusian, Tatar, Kazakh. Search results can be sorted by relevance and by date (buttons below the search results). The page with the search results consists of 10 links with short annotations - “snippets”. The snippets includes a text comment, link, address, popular sections of the site, pages on social networks, etc. As an alternative to snippets, Yandex introduced in 2014 a new interface called “Islands”. Yandex implements the “parallel searches” mechanism, when together with a web search, a search is performed on Yandex services, such as Catalog, News, Market, Encyclopedias, Images, etc. As a result, in response to a user's request, the system shows not only textual information, but also links to video files, pictures, dictionary entries, etc. A distinctive feature of the search engine is also the technology of "intent search" that mean a search for solving a problem. Intent search elements are - dialog prompts in case of ambiguous request, automatic text translation, information about the characteristics of the requested car, etc. For example, when you request “
Boris Grebenshchikov Boris Borisovich Grebenshchikov (; born ) is a prominent member of the generation which is widely considered to be the "founding fathers" of Russian rock music. He is the founder and lead singer of the band Aquarium which has been active since ...
- Golden City”, the system will show a form for online listening to music from the Yandex Music service, at the request of "st. Koroleva 12 " will be shown a fragment of the map with the marked object on it.


Promotion of misinformation and propaganda

Search results from the Yandex search engine tend to favor Russian media sources, including state media, and Yandex-delivered ads tend to promote misinformation and propaganda produced by more than half a dozen Russian-language news sites. One study found that Yandex-delivered adverts ran alongside false stories about US bioweapons labs in Ukraine, claims that Ukrainian President Volodymyr Zelenskiy is a drug user, and reports repeating Kremlin claims that the war against Ukraine is going entirely to plan. Other fake news promoted by Yandex ads referred to the Russian invasion by using Kremlin talking points, calling the war an “operation to denazify and demilitarise Ukraine”. Another analysis found that Yandex directs Russian speakers worldwide to manipulated information and often to outright disinformation.


Spam and virus protection

In 2013, Yandex was considered by some to be the safest search engine at the time and the third most secure among all web resources. By 2016, Yandex had slipped down to third with Google being first. Checking web pages and warning users appeared on Yandex in 2009: since then, on the search results page, next to a dangerous site there is a note “This site may threaten the security of your computer”. Two technologies at once are used to detect threats. The first was purchased from the American antivirus Sophos and based on a signature approach: that means, when accessing a web page, the
antivirus Antivirus software (abbreviated to AV software), also known as anti-malware, is a computer program used to prevent, detect, and remove malware. Antivirus software was originally developed to detect and remove computer viruses, hence the name ...
system also accesses a
database In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...
of already known viruses and
malware Malware (a portmanteau of ''malicious software'')Tahir, R. (2018)A study on malware and malware detection techniques . ''International Journal of Education and Management Engineering'', ''8''(2), 20. is any software intentionally designed to caus ...
. This approach is fast, but practically powerless against new viruses that have not yet entered the database. Therefore, Yandex along with the signature also uses its own antivirus complex, based on an analysis of the behavioral factor. The Yandex program, when accessing the site, checks whether the latter requested additional files from the browser, redirected it to an extraneous resource, etc. Thus, if information is received that the site begins to perform certain actions (cascading style sheets,
JavaScript JavaScript (), often abbreviated as JS, is a programming language and core technology of the World Wide Web, alongside HTML and CSS. Ninety-nine percent of websites use JavaScript on the client side for webpage behavior. Web browsers have ...
modules are launched and complete programs) without user permission, it is placed in the “black list” and in the database of virus signatures. Information about the infection of the site appears in the search results, and through the Yandex.Webmaster service the owner of the site receives a notification. After the first check, Yandex does the second, and if the infection information is confirmed a second time, the checks will be more frequent until the threat is eliminated. The total number of infected sites in the Yandex database does not exceed 1%. Every day in 2013, Yandex checks 23 million web pages (while detecting 4,300 dangerous sites) and shows users 8 million warnings. Approximately one billion sites are checked monthly.


Search ranking

For a long time, the key ranking factor for Yandex was the number of third-party links to a particular site. Each page on the Internet was assigned a unique citation index, similar to the index for authors of scientific articles: the more links, the better. A similar mechanism was implemented in the Yandex and in the Google's
PageRank PageRank (PR) is an algorithm used by Google Search to rank web pages in their search engine results. It is named after both the term "web page" and co-founder Larry Page. PageRank is a way of measuring the importance of website pages. Accordin ...
. In order to prevent
cheating Cheating generally describes various actions designed to subvert or disobey rules in order to obtain unfair advantages without being noticed. This includes acts of bribery, cronyism and nepotism in any situation where individuals are given pr ...
, Yandex uses multivariate analysis, in which only 70 of the 800 factors are affected by the number of third-party links. Today, the content of the site and the presence or absence of keywords there, the ease of reading the text, the name of the domain, its history and the presence of multimedia content play a much greater role. On 5 December 2013 Yandex announced a complete refusal of accounting the link factor in the future.


Search hints

As the user types the query in the search bar, the search engine offers hints in the form of a drop-down list. Hints appear even before the search results appears and allow you to refine the query, correct the layout or typo, or go directly to the site you are looking for. For each user, hints are generated based on the history of their search queries using the My Finds service. In 2012, the so-called “Smart Search Hints” appeared, which instantly give out information about the main constants (equator length, speed of light, and so on), traffic jams, and have a built-in calculator. In addition, a translator was integrated in the “Hints” (the query “love in French” instantly gives out ''amour, affection'' ), the schedule and results of football matches, exchange rates, weather forecasts and more. You can find out the exact time by asking "what time is it." In 2011, Hints in the search for Yandex became completely local to 83 regions of Russia. In addition to the actual search, Hints are built into Yandex search engines. Dictionaries ”,“ Yandex. Market ”,“ Yandex. Maps "and other Yandex services. The hint function is a consequence of the development of the technology of intent search and first appeared on Yandex.Bar in August 2007, and in October 2008 it was introduced on the main page of the search engine. Available both in the desktop and mobile versions of the site, Yandex shows its users more than a billion search hints per day


History

Changes in the search engine for a long time were not widely represented and remained nameless. And only from the beginning of 2008, when the launch of algorithm ''8 SP1'' was announced, Yandex announced that henceforth the new ranking algorithms will bear the names of cities.


1990s

The name of the system - Yandex, - was invented together by Arkady Volozh and Ilya Segalovich. The word stands for yet another indexer (or as “ I am (''"ya"'' ''in Russian'' language) and index )”. According to the interpretation of Artemy Lebedev, the name of the search engine is consonant with Yandeks, where yang means the masculine beginning, The yandex.ru search engine was announced by CompTek on 23 September 1997 at the Softool exhibition, although some developments in the field of search (
Bible The Bible is a collection of religious texts that are central to Christianity and Judaism, and esteemed in other Abrahamic religions such as Islam. The Bible is an anthology (a compilation of texts of a variety of forms) originally writt ...
indexing, searching for documents on
CD-ROM A CD-ROM (, compact disc read-only memory) is a type of read-only memory consisting of a pre-pressed optical compact disc that contains computer data storage, data computers can read, but not write or erase. Some CDs, called enhanced CDs, hold b ...
, site search) were carried out by the company even earlier. The first index contained information on 5 thousand servers and occupied 4.5 GB. In the same 1997, the search for Yandex began to be used in the Russian version of
Internet Explorer Internet Explorer (formerly Microsoft Internet Explorer and Windows Internet Explorer, commonly abbreviated as IE or MSIE) is a deprecation, retired series of graphical user interface, graphical web browsers developed by Microsoft that were u ...
4.0. It became possible to query in natural language. In 1998, the function “find similar documents” appeared for each search result. “Yandex. Search ”as of 1998 worked on three machines running on
FreeBSD FreeBSD is a free-software Unix-like operating system descended from the Berkeley Software Distribution (BSD). The first version was released in 1993 developed from 386BSD, one of the first fully functional and free Unix clones on affordable ...
under Apache: one machine crawled the Internet and indexed documents, one search engine, and one machine duplicated the search engine. In 1999, a search appeared in the categories - search, a combination of a search engine and a catalog. The version of the search engine was updated.


2000

On 6 June 2000 the second version of the
search engine A search engine is a software system that provides hyperlinks to web pages, and other relevant information on World Wide Web, the Web in response to a user's web query, query. The user enters a query in a web browser or a mobile app, and the sea ...
was presented. A parallel search mechanism was introduced, and along with the issuance,
information Information is an Abstraction, abstract concept that refers to something which has the power Communication, to inform. At the most fundamental level, it pertains to the Interpretation (philosophy), interpretation (perhaps Interpretation (log ...
was offered from large sources. Users were able to limit the search results to the selected topic. The heading “Popular finds” appeared - words that refine the search. In December 2000, the volume of indexed information reached 355.22 GB.


2001

In 2001, Yandex overtook another Russian search engine, Rambler, in terms of attendance, and became the leading search engine of Runet. Yandex began to understand requests in a natural language that were asked in interrogative form. The system has learned to recognize typos and suggest correcting them. The design has changed.


2002

The number of daily queries to the Yandex search engine exceeded 2 million


2003

Indexing . rtf and .
pdf Portable document format (PDF), standardized as ISO 32000, is a file format developed by Adobe Inc., Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, computer hardware, ...
documents was launched. Search results began to be issued including in XML format.


2004

The ranking algorithm has changed. Yandex began indexing documents in . swf ( Flash). xls and . ppt formats. At the end of the year, the study
Some Aspects of Full-Text Search and Ranking in Yandex
” was published (authors Ilya Segalovich, Mikhail Maslov ), which revealed certain ranking details in a search engine.


2005

In summer, the so-called “fast” search robot was launched, working in parallel with the actual pages intended for indexing. The base of the "fast robot" is updated every 1.5–2 hours. The ranking algorithm has been improved to increase search accuracy. Search capabilities have been expanded with the help of Yandex. Dictionaries ”and“ Yandex. Lingvo ". The search engine has learned to understand queries like “''What is omethingin Spanish''” and automatically translate them. It became possible to limit search results by region.


2006

Since May 2006, site icons have been displayed in the search results. In early December, next to each link in the results of search appeared the item “Saved copy”, clicking on which, the user goes to a full copy of the page in a special archive
database In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...
(“Yandex cache”).


2007

Ranking algorithm changed again.


2008

In 2008, Yandex for the first time began to openly announce changes in the
search algorithm In computer science, a search algorithm is an algorithm designed to solve a search problem. Search algorithms work to retrieve information stored within particular data structure, or calculated in the Feasible region, search space of a problem do ...
and started to name the changes with the names of Russian cities. The name of the “city” of each subsequent algorithm begins with the letter that the name of the previous one ended with.


2020

In April 2020, the search engine started artificially placing negative commentary about
Alexei Navalny Alexei Anatolyevich Navalny (, ; 4 June 197616 February 2024) was a Russian Opposition to Vladimir Putin in Russia, opposition leader, anti-corruption in Russia, corruption activist and political prisoner. He founded the Anti-Corruption Found ...
on the top positions in its search results for his name. Yandex declared this was part of an "experiment" and returned to presenting organic search results.


Achievements

According to media expert Mikhail Gurevich, Yandex is a “national treasure”, a “strategic product”. This fact was also recognized in the
State Duma The State Duma is the lower house of the Federal Assembly (Russia), Federal Assembly of Russia, with the upper house being the Federation Council (Russia), Federation Council. It was established by the Constitution of Russia, Constitution of t ...
of the
Russian Federation Russia, or the Russian Federation, is a country spanning Eastern Europe and North Asia. It is the list of countries and dependencies by area, largest country in the world, and extends across Time in Russia, eleven time zones, sharing Borders ...
, where in May 2012 a bill appeared in which Yandex and VKontakte are recognized by strategic enterprises as national information translators. In 2009, President of Russia Dmitry Medvedev initiated the purchase of a “ golden share” of Yandex by Sberbank in order to avoid an important nationwide company falling into foreign hands. In November 2019 Sberbank announced that it would give up its golden share, and the following month Yandex shareholders voted to approve a corporate restructuring backed by the Russian government which would invest control of the golden share in a new public interest foundation, to be implemented by the end of the first quarter of 2020, after Sberbank had previously agreed to sell the golden share for one euro. In 2012, Yandex overtook Channel One in terms of daily audience, which made the Yandex a leader in the domestic media market. In 2013, Yandex confirmed this status, overtaking First in terms of revenue. In 2008, Yandex was the ninth search engine in the world, in 2009 the seventh, and in 2013 the fourth. One of the components of this situation is the presence in Russia of a sufficient number of mathematically savvy specialists with a scientific instinct. By 2002, the word Yandex became so common that when Arkady Volozh`s company demanded to return the yandex.com domain, bought by third parties, the defendant stated that the word "Yandex" was already synonymous with the search and became a household word in Russia. Since late 2012, the Yandex
search engine A search engine is a software system that provides hyperlinks to web pages, and other relevant information on World Wide Web, the Web in response to a user's web query, query. The user enters a query in a web browser or a mobile app, and the sea ...
has outperformed the number of
Google Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
users on the
Google Chrome Google Chrome is a web browser developed by Google. It was first released in 2008 for Microsoft Windows, built with free software components from Apple WebKit and Mozilla Firefox. Versions were later released for Linux, macOS, iOS, iPadOS, an ...
browser in
Russia Russia, or the Russian Federation, is a country spanning Eastern Europe and North Asia. It is the list of countries and dependencies by area, largest country in the world, and extends across Time in Russia, eleven time zones, sharing Borders ...
.


Logo

The Yandex logo appears in numerous settings to identify the search engine company. Yandex has relied on several logos since its renaming, with the first logo created by Arkady Volozh and debuted in 1997 on Яndex.Site and Яndex.CD products, even before the announcement of the Yandex search engine. The logo was designed analog to the CompTek logo. Since 1997 the logos are designed by Art. Lebedev Studios, — which designed four versions. The current logo uses Cyrillic words.


See also

* List of search engines * Comparison of search engines


References


External links

{{Web search engines Internet search engines Yandex Search engine optimization