Gregory Piatetsky-Shapiro
   HOME

TheInfoList



OR:

Gregory I. Piatetsky-Shapiro (born 7 April 1958) is a
data scientist Data science is an interdisciplinary academic field that uses statistics, scientific computing, scientific methods, processing, scientific visualization, algorithms and systems to extract or extrapolate knowledge from potentially noisy, struct ...
and the co-founder of the KDD conferences, and co-founder and past chair of the
Association for Computing Machinery The Association for Computing Machinery (ACM) is a US-based international learned society for computing. It was founded in 1947 and is the world's largest scientific and educational computing society. The ACM is a non-profit professional membe ...
SIGKDD SIGKDD, representing the Association for Computing Machinery's (ACM) Special Interest Group (SIG) on Knowledge Discovery and Data Mining, hosts an influential annual conference. Conference history The KDD Conference grew from KDD (Knowledge Dis ...
group for Knowledge Discovery,
Data Mining Data mining is the process of extracting and finding patterns in massive data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and ...
and
Data Science Data science is an interdisciplinary academic field that uses statistics, scientific computing, scientific methods, processing, scientific visualization, algorithms and systems to extract or extrapolate knowledge from potentially noisy, stru ...
. He is the founder and president of KDnuggets, a discussion and learning website for
Business Analytics Business analytics (BA) refers to the skills, technologies, and practices for iterative exploration and investigation of past business performance to gain insight and drive business planning. Business analytics focuses on developing new insights ...
,
Data Mining Data mining is the process of extracting and finding patterns in massive data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and ...
and Data Science.


Early life

A
Jewish Jews (, , ), or the Jewish people, are an ethnoreligious group and nation, originating from the Israelites of History of ancient Israel and Judah, ancient Israel and Judah. They also traditionally adhere to Judaism. Jewish ethnicity, rel ...
refugee from Soviet Union, Gregory Piatetsky was born in
Moscow, Russia Moscow is the Capital city, capital and List of cities and towns in Russia by population, largest city of Russia, standing on the Moskva (river), Moskva River in Central Russia. It has a population estimated at over 13 million residents with ...
to Inna Mogilevskaya and mathematician
Ilya Piatetski-Shapiro Ilya Piatetski-Shapiro (Hebrew: איליה פיאטצקי-שפירו; ; 30 March 1929 – 21 February 2009) was a Soviet-born Israeli mathematician. During a career that spanned 60 years he made major contributions to applied science as well as p ...
. He was admitted in 1970 to Physics-Mathematics School no. 2, a leading math school in Moscow. In March 1974, Piatetsky emigrated to
Israel Israel, officially the State of Israel, is a country in West Asia. It Borders of Israel, shares borders with Lebanon to the north, Syria to the north-east, Jordan to the east, Egypt to the south-west, and the Mediterranean Sea to the west. Isr ...
with his family, studying
mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
and computer science at
Tel Aviv University Tel Aviv University (TAU) is a Public university, public research university in Tel Aviv, Israel. With over 30,000 students, it is the largest university in the country. Located in northwest Tel Aviv, the university is the center of teaching and ...
for one semester at Technion. He subsequently earned MS (1979) and Ph.D. (1984) degrees from NYU
Courant Institute The Courant Institute of Mathematical Sciences (commonly known as Courant or CIMS) is the mathematics research school of New York University (NYU). Founded in 1935, it is named after Richard Courant, one of the founders of the Courant Institute ...
. In 1984, his first paper was published in
SIGMOD SIGMOD is the Association for Computing Machinery's Special Interest Group on Management of Data, which specializes in large-scale data management problems and databases. The annual ACM SIGMOD Conference, which began in 1975, is considered one of ...
, proving that secondary index selection is NP-complete by reducing it to a set cover problem. In his dissertation, he proved that the greedy method for set cover has a lower bound of 1 - 1/e ~ 63% of the optimal.


Career

He joined GTE Laboratories, where he worked on intelligent interfaces relating to
databases In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and ana ...
. In 1989, he proposed a new project at GTE called "
Knowledge Discovery in Databases Data mining is the process of extracting and finding patterns in massive data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and s ...
". The project created advanced prototypes, including KEFIR (Key Findings Reporter), a system for analysis and summarization of key changes in large databases, which was a forerunner of systems like
Google Analytics Google Analytics is a web analytics service offered by Google that tracks and reports website traffic and also mobile app traffic and events, currently as a platform inside the Google Marketing Platform brand. Google launched the service in N ...
Intelligence. A KEFIR prototype was applied to GTE
health care Health care, or healthcare, is the improvement or maintenance of health via the preventive healthcare, prevention, diagnosis, therapy, treatment, wikt:amelioration, amelioration or cure of disease, illness, injury, and other disability, physic ...
data and received GTE's highest technical
award An award, sometimes called a distinction, is given to a recipient as a token of recognition of excellence in a certain field. When the token is a medal, ribbon or other item designed for wearing, it is known as a decoration. An award may be d ...
. In 1997, he left GTE to join Knowledge Stream Partners (KSP), where he was Director and later Vice President and Chief Scientist. In April 2000, KSP was acquired by Xchange, Inc., where Piatetsky served as VP and Chief Scientist. Piatetsky left Xchange in May 2001 to become a self-employed
consultant A consultant (from "to deliberate") is a professional (also known as ''expert'', ''specialist'', see variations of meaning below) who provides advice or services in an area of specialization (generally to medium or large-size corporations). Cons ...
and focus on KDnuggets.


KDD and SIGKDD

In 1989, Piatetsky organized the first workshop on Knowledge Discovery in Data (KDD-89), held at
IJCAI The International Joint Conference on Artificial Intelligence (IJCAI) is a conference in the field of artificial intelligence. The conference series has been organized by the nonprofit IJCAI Organization since 1969.Jointly sponsored by the IJCAI O ...
-1989 in Detroit, MI. This workshop had over 60 attendees, including researchers
Ross Quinlan John Ross Quinlan is a computer science researcher in data mining and decision theory. He has contributed extensively to the development of decision tree algorithms, including inventing the canonical C4.5 and ID3 algorithms. He also contributed to ...
and
Jaime Carbonell Jaime Guillermo Carbonell (July 29, 1953 – February 28, 2020) was a computer scientist who made seminal contributions to the development of natural language processing tools and technologies. His extensive research in machine translation resul ...
. Piatetsky organized the next two KDD workshops, in 1991 and 1993. With
Usama Fayyad Usama M. Fayyad (born July 1963) is a Tunisian-born Jordanian-American data scientist. He is a co-founder of KDD conferences and ACM SIGKDD association for Knowledge Discovery and Data Mining. He is a speaker on Business Analytics, Data Mini ...
and Ramasamy (Sam) Uthurusamy, he expanded the workshops into an annual international conference on
Data Mining Data mining is the process of extracting and finding patterns in massive data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and ...
and was the General Chair of the KDD-98 conference. He served as the chair of the KDD Steering committee until 1998, when the
SIGKDD SIGKDD, representing the Association for Computing Machinery's (ACM) Special Interest Group (SIG) on Knowledge Discovery and Data Mining, hosts an influential annual conference. Conference history The KDD Conference grew from KDD (Knowledge Dis ...
group was formed as part of
ACM ACM or A.C.M. may refer to: Aviation * AGM-129 ACM, 1990–2012 USAF cruise missile * Air chief marshal * Air combat manoeuvring or dogfighting * Air cycle machine * IATA airport code for Arica Airport in Amazonas Department, Colombia Computing ...
to run the annual KDD conference and help promote research in Knowledge Discovery and Data Mining. He served as Director of SIGKDD for 2001–2005 and as SIGKDD Chair for 2005–2009. In 1997, Piatetsky and Ismail Parsa initiated the KDD Cup competition, which was the world's first open data mining contest. The annual ACM SIGKDD conference is the leading research conference on Knowledge Discovery and Data Mining, according to Microsoft Academic search and
Google Scholar Google Scholar is a freely accessible web search engine that indexes the full text or metadata of Academic publishing, scholarly literature across an array of publishing formats and disciplines. Released in Beta release, beta in November 2004, th ...
. The 21st ACM SIGKDD conference was held in Sydney, Australia in August 2015.


KDnuggets

In 1993, Piatetsky started Knowledge Discovery Nuggets (KDnuggets) as a newsletter to connect researchers who attended the KDD-93 workshop. With the emergence of the Internet and
Mosaic A mosaic () is a pattern or image made of small regular or irregular pieces of colored stone, glass or ceramic, held in place by plaster/Mortar (masonry), mortar, and covering a surface. Mosaics are often used as floor and wall decoration, and ...
, he and Chris Matheus eventually created the website: Knowledge Discovery Mine, hosted at GTE Labs. The newsletter served as an unofficial publication of KDD workshops. When Piatetsky left GTE Labs, he created the KDnuggets website, with the mission of covering the field with short, concise "nuggets". The resource started as a directory for the subjects of data mining and data science, including Software, jobs, academic positions, CFP (calls for papers), companies, courses, datasets, education, meetings, publications and webcasts. KDnuggets' main focus is to cover the fields of Business Analytics, Data Mining, and Data Science, including
interviews An interview is a structured conversation where one participant asks questions, and the other provides answers.Merriam Webster DictionaryInterview Dictionary definition, Retrieved February 16, 2016 In common parlance, the word "interview" re ...
with key leaders. It offers a free data mining course for advanced undergraduates or first-year graduate students.
@KDnuggets
Twitter was * Voted th

by Big Data Republic (2013) * I
Top 10 Most Influential Brands on Big Data
Onalytica, May 2017. * No. 1 i

Nov 2016. * No. 1 i

Nov 2016. * No. 3 i
AI Intelligence & Machine Learning: Top 100 Influencers and Brands
Onalytica, Mar 2016. * No. 4 i
Big Data 2016: Top 100 Influencers
Onalytica, Feb 2016. * In ''
InformationWeek ''InformationWeek'' is a digital magazine which conducts corresponding face-to-face events, virtual events, and research. It is headquartered in San Francisco, California California () is a U.S. state, state in the Western United State ...
'
Twitter Top 10 Data Science, Analytics, And BI Feeds
Jan 2016 In February 2015, Piatetsky and Data ScienceTech Institute announced a partnership and he became an Honorary Member of its Scientific Advisory Board.


Research and publications

In 1991, Piatetsky and William (Bud) Frawley edited their first book ''Knowledge Discovery in Databases.'' In 1996, Piatetsky,
Usama Fayyad Usama M. Fayyad (born July 1963) is a Tunisian-born Jordanian-American data scientist. He is a co-founder of KDD conferences and ACM SIGKDD association for Knowledge Discovery and Data Mining. He is a speaker on Business Analytics, Data Mini ...
, Padhraic Smyth, and
Ramasamy Uthurusamy Ramasamy Uthurusamy is a computer engineer at Oakland University in Rochester, Michigan. He was named a Fellow of the Institute of Electrical and Electronics Engineers (IEEE) in 2013 for his contributions to data mining and artificial intelligence ...
edited a follow-up ''Advances in Knowledge Discovery and Data Mining''. Piatetsky also helped launch and co-edit the ''
Data Mining and Knowledge Discovery ''Data Mining and Knowledge Discovery'' is a bimonthly peer-reviewed scientific journal focusing on data mining published by Springer Science+Business Media. It was started in 1996 and launched in 1997 by Usama Fayyad as founding Editor-in-Chief ...
'' journal. He authored 9 edited books and collections and over 60 technical papers, articles and book chapters, mostly focusing on data mining and knowledge discovery..


Recognition

* 1984, NYU Award for Best Dissertation in Computer Sciences, PhD Thesis: "A Self-Organizing Database System - A Different Approach to Query Optimization". * 1985, NYU Award for Best Dissertation in all Natural Sciences (1985). * 1995, Leslie H. Warner award—GTE's highest for technical achievement—for the KEFIR system. * 2000, First SIGKDD Service Award, for contributions to Data Mining and Knowledge Discovery. * 2007 IEEE ICDM Outstanding Service Award, for major contributions to data mining field, 2007.


References

*Journeys to Data Mining: Experiences from 15 Renowned Researchers, edited by Mohamed Medhat Gaber {{DEFAULTSORT:Piatetsky-Shapiro, Gregory I. 1958 births Living people American computer scientists American people of Russian-Jewish descent Courant Institute of Mathematical Sciences alumni American data scientists Jewish Russian scientists Science bloggers 21st-century science writers