Jan Leike
Jan Leike (born ) is an AI alignment researcher who has worked at DeepMind and OpenAI. He joined Anthropic in May 2024. Education Jan Leike obtained his undergraduate degree from the University of Freiburg in Germany. After earning a master's degree in computer science, he pursued a PhD in machine learning at the Australian National University under the supervision of Marcus Hutter. Career Leike made a six-month postdoctoral fellowship at the Future of Humanity Institute before joining DeepMind to focus on empirical AI safety research, where he collaborated with Shane Legg. OpenAI In 2021, Leike joined OpenAI. In June 2023, he and Ilya Sutskever became the co-leaders of the newly introduced "superalignment" project, which aimed to determine how to align future artificial superintelligences within four years to ensure their safety. This project involved automating AI alignment research using relatively advanced AI systems. At the time, Sutskever was OpenAI's Chief Scient ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
AI Alignment
In the field of artificial intelligence (AI), alignment aims to steer AI systems toward a person's or group's intended goals, preferences, or ethical principles. An AI system is considered ''aligned'' if it advances the intended objectives. A ''misaligned'' AI system pursues unintended objectives. It is often challenging for AI designers to align an AI system because it is difficult for them to specify the full range of desired and undesired behaviors. Therefore, AI designers often use simpler ''proxy goals'', such as Reinforcement learning from human feedback, gaining human approval. But proxy goals can overlook necessary constraints or reward the AI system for merely ''appearing'' aligned. AI systems may also find loopholes that allow them to accomplish their proxy goals efficiently but in unintended, sometimes harmful, ways (reward hacking). Advanced AI systems may develop unwanted Instrumental convergence, instrumental strategies, such as seeking power or survival because s ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Shane Legg
Shane Legg (born ) is a machine learning researcher and entrepreneur. With Demis Hassabis and Mustafa Suleyman, he cofounded DeepMind Technologies (later bought by Google and now called Google DeepMind), and works there as the chief AGI scientist. He is also known for his academic work on artificial general intelligence, including his thesis supervised by Marcus Hutter. Early life and education Legg attended Rotorua Lakes High School in Rotorua, on New Zealand's North Island. He completed his undergraduate studies at Waikato University in 1996. Also in 1996, he obtained his MSc degree with a thesis entitled " Solomonoff Induction", with Cristian S. Calude at the University of Auckland. Research interests In the early 2000s, Legg re-introduced and popularized with Ben Goertzel the term "artificial general intelligence" (AGI), to describe an AI that can do practically any cognitive task a human can do. At that time, talking about AGI "would put you on the lunatic fringe". Leg ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
21st-century German Scientists
File:1st century collage.png, From top left, clockwise: Jesus is crucified by Roman authorities in Judaea (17th century painting). Four different men (Galba, Otho, Vitellius, and Vespasian) claim the title of Emperor within the span of a year; The Great Fire of Rome (18th-century painting) sees the destruction of two-thirds of the city, precipitating the empire's first persecution against Christians, who are blamed for the disaster; The Roman Colosseum is built and holds its inaugural games; Roman forces besiege Jerusalem during the First Jewish–Roman War (19th-century painting); The Trưng sisters lead a rebellion against the Chinese Han dynasty (anachronistic depiction); Boudica, queen of the British Iceni leads a rebellion against Rome (19th-century statue); Knife-shaped coin of the Xin dynasty., 335px rect 30 30 737 1077 Crucifixion of Jesus rect 767 30 1815 1077 Year of the Four Emperors rect 1846 30 3223 1077 Great Fire of Rome rect 30 1108 1106 2155 Boudican revolt ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
1980s Births
__NOTOC__ Year 198 (CXCVIII) was a common year starting on Sunday of the Julian calendar. At the time, it was known as the Year of the Consulship of Sergius and Gallus (or, less frequently, year 951 ''Ab urbe condita''). The denomination 198 for this year has been used since the early medieval period, when the Anno Domini calendar era became the prevalent method in Europe for naming years. Events By place Roman Empire *January 28 ** Publius Septimius Geta, son of Septimius Severus, receives the title of Caesar. **Caracalla, son of Septimius Severus, is given the title of Augustus. China *Winter – Battle of Xiapi: The allied armies led by Cao Cao and Liu Bei defeat Lü Bu; afterward Cao Cao has him executed. By topic Religion * Marcus I succeeds Olympianus as Patriarch of Constantinople (until 211). Births * Lu Kai, Chinese official and general (d. 269) * Quan Cong, Chinese general and advisor (d. 249) Deaths * Li Jue, Chinese warlord and re ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Living People
Purpose: Because living persons may suffer personal harm from inappropriate information, we should watch their articles carefully. By adding an article to this category, it marks them with a notice about sources whenever someone tries to edit them, to remind them of WP:BLP (biographies of living persons) policy that these articles must maintain a neutral point of view, maintain factual accuracy, and be properly sourced. Recent changes to these articles are listed on Special:RecentChangesLinked/Living people. Organization: This category should not be sub-categorized. Entries are generally sorted by family name In many societies, a surname, family name, or last name is the mostly hereditary portion of one's personal name that indicates one's family. It is typically combined with a given name to form the full name of a person, although several give .... Maintenance: Individuals of advanced age (over 90), for whom there has been no new documentation in the last ten ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Australian National University Alumni
Australian(s) may refer to: Australia * Australia, a country * Australians, citizens of the Commonwealth of Australia ** European Australians ** Anglo-Celtic Australians, Australians descended principally from British colonists ** Aboriginal Australians, indigenous peoples of Australia as identified and defined within Australian law * Australia (continent) ** Indigenous Australians * Australian English, the dialect of the English language spoken in Australia * Australian Aboriginal languages * ''The Australian'', a newspaper * Australiana, things of Australian origins Other uses * Australian (horse), a racehorse * Australian, British Columbia, an unincorporated community in Canada See also * The Australian (other) * Australia (other) * * * Austrian (other) Austrian may refer to: * Austrians, someone from Austria or of Austrian descent ** Someone who is considered an Austrian citizen * Austrian German dialect * Something associated with the coun ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Machine Learning Researchers
A machine is a physical system that uses power to apply forces and control movement to perform an action. The term is commonly applied to artificial devices, such as those employing engines or motors, but also to natural biological macromolecules, such as molecular machines. Machines can be driven by animals and people, by natural forces such as wind and water, and by chemical, thermal, or electrical power, and include a system of mechanisms that shape the actuator input to achieve a specific application of output forces and movement. They can also include computers and sensors that monitor performance and plan movement, often called mechanical systems. Renaissance natural philosophers identified six simple machines which were the elementary devices that put a load into motion, and calculated the ratio of output force to input force, known today as mechanical advantage. Modern machines are complex systems that consist of structural elements, mechanisms and control compon ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Daniel Kokotajlo (researcher)
Daniel Kokotajlo is an artificial intelligence (AI) researcher. He was a researcher in the governance division of OpenAI from 2022 to 2024, and currently leads the AI Futures Project. Biography Kokotajlo is a former philosophy PhD candidate at the University of North Carolina at Chapel Hill where he was a recipient of the 2018–2019 Maynard Adams Fellowship for the Public Humanities. In 2022, he became a researcher in the governance division of OpenAI. Kokotajlo is one of the organizers of a group of OpenAI employees that claimed the company has a secretive and reckless culture that is taking grave risks in the rush to achieve artificial general intelligence (AGI). When he resigned in 2024, he refused to sign OpenAI's non-disparagement clause, which could have cost him approximately $2 million in equity. As of May 2024, Kokotajlo confirmed he retained the vested equity. In June 2024, he, with other former OpenAI employees, signed a letter arguing that top frontier AI companies h ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Artificial Superintelligence
Artificiality (the state of being artificial, anthropogenic, or man-made) is the state of being the product of intentional human manufacture, rather than occurring naturally through processes not involving or requiring human activity. Connotations Artificiality often carries with it the implication of being false, counterfeit, or deceptive. The philosopher Aristotle wrote in his ''Rhetoric'': However, artificiality does not necessarily have a negative connotation, as it may also reflect the ability of humans to replicate forms or functions arising in nature, as with an artificial heart or artificial intelligence. Political scientist and artificial intelligence expert Herbert A. Simon observes that "some artificial things are imitations of things in nature, and the imitation may use either the same basic materials as those in the natural object or quite different materials. Herbert A. Simon, ''The Sciences of the Artificial'' (1996), p. 4. Simon distinguishes between the artif ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Ilya Sutskever
Ilya Sutskever (; born 8 December 1986) is an Israeli-Canadian computer scientist who specializes in machine learning. He has made several major contributions to the field of deep learning. With Alex Krizhevsky and Geoffrey Hinton, he co-invented AlexNet, a convolutional neural network. Sutskever co-founded and was a former chief scientist at OpenAI. In 2023, he was one of the members of OpenAI's board that ousted Sam Altman from his position as the organization's CEO; Altman was reinstated a week later, and Sutskever stepped down from the board. In June 2024, Sutskever co-founded the company Safe Superintelligence alongside Daniel Gross and Daniel Levy. Early life and education Sutskever was born into a Jewish family in Nizhny Novgorod, Russia (then Gorky, Soviet Union). At the age of 5, he made aliyah with his family and lived in Jerusalem, until he was 16, when his family moved to Canada. Sutskever attended the Open University of Israel from 2000 to 2002. After moving ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
AI Safety
AI safety is an interdisciplinary field focused on preventing accidents, misuse, or other harmful consequences arising from artificial intelligence (AI) systems. It encompasses machine ethics and AI alignment, which aim to ensure AI systems are moral and beneficial, as well as monitoring AI systems for risks and enhancing their reliability . The field is particularly concerned with existential risks posed by advanced AI models. Beyond technical research, AI safety involves developing norms and policies that promote safety. It gained significant popularity in 2023, with rapid progress in generative AI and public concerns voiced by researchers and CEOs about potential dangers. During the 2023 AI Safety Summit, the United States and the United Kingdom both established their own AI Safety Institute. However, researchers have expressed concern that AI safety measures are not keeping pace with the rapid development of AI capabilities. Motivations Scholars discuss current risks from ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
DeepMind
DeepMind Technologies Limited, trading as Google DeepMind or simply DeepMind, is a British–American artificial intelligence research laboratory which serves as a subsidiary of Alphabet Inc. Founded in the UK in 2010, it was acquired by Google in 2014 and merged with Google AI's Google Brain division to become Google DeepMind in April 2023. The company is headquartered in London, with research centres in the United States, Canada, France, Germany, and Switzerland. DeepMind introduced neural Turing machines (neural networks that can access external memory like a conventional Turing machine), resulting in a computer that loosely resembles short-term memory in the human brain. DeepMind has created neural network models to play video games and board games. It made headlines in 2016 after its AlphaGo program beat a human professional Go player Lee Sedol, a world champion, in a five-game match, which was the subject of a documentary film. A more general program, AlphaZer ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |