NetOwl is a suite of multilingual text and identity analytics products that analyze
big data
Though used sometimes loosely partly because of a lack of formal definition, the interpretation that seems to best describe Big data is the one associated with large body of information that we could not comprehend when used only in smaller am ...
in the form of text data – reports, web,
social media, etc. – as well as structured entity data about people, organizations, places, and things.
NetOwl utilizes artificial intelligence (AI)-based approaches, including
natural language processing
Natural language processing (NLP) is an interdisciplinary subfield of linguistics, computer science, and artificial intelligence concerned with the interactions between computers and human language, in particular how to program computers to pro ...
(NLP),
machine learning (ML), and
computational linguistics
Computational linguistics is an Interdisciplinarity, interdisciplinary field concerned with the computational modelling of natural language, as well as the study of appropriate computational approaches to linguistic questions. In general, comput ...
, to extract entities, relationships, and events; to perform
sentiment analysis; to assign latitude/longitude to geographical references in text; to translate names written in foreign languages; and to perform name matching and
identity resolution.
["SRA International."](_blank)
Washington Post. Retrieved 2013-07-02.[Zelenko, Dmitry, and Chinatsu Aone]
“Discriminative Methods for Transliteration.”
In Proceedings of 2006 Conference Empirical Applications of Natural Language Processing (2006). Retrieved 2013-05-20.[Maybury, Mark (2012)]
Multimedia Information Extraction
Hoboken, New Jersey: John Wiley & Sons, Inc., p. 18. Retrieved 2013-07-02.
NetOwl's uses include
semantic search and discovery, geospatial analysis,
[Smith, Susan]
“Notes from the GEOINT 2007 Symposium.”
GISCafe (2007-10-29). Retrieved 2013-07-02. intelligence analysis, content enrichment,
[Guess, Angela (2012-01-19)]
"LexisNexis Releases New Version of Lexis Advance".
semanticweb.com. Retrieved 2013-07-28. compliance monitoring,
[Aone, Chinatsu, et al]
“Assentor®: an NLP-based Solution to E-mail Monitoring.”
In Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence (2000), pp. 945-540. Retrieved 2013-05-20. cyber threat monitoring, risk management, and bioinformatics.
History
The first NetOwl product was NetOwl Extractor, which was initially released in 1996. Since then, Extractor has added many new capabilities, including relationship and event extraction, categorization, name translation, geotagging, and sentiment analysis, as well as entity extraction in other languages. Other products were added later to the NetOwl suite, namely TextMiner, NameMatcher, and EntityMatcher.
NetOwl has participated in several 3rd party-sponsored text and entity analytics software benchmarking events. NetOwl Extractor was the top-scoring named entity extraction system at the
DARPA-sponsored
Message Understanding Conference MUC-6 and the top-scoring link and event extraction system in MUC-7. It was also the top-scoring system at several of the
NIST
The National Institute of Standards and Technology (NIST) is an agency of the United States Department of Commerce whose mission is to promote American innovation and industrial competitiveness. NIST's activities are organized into physical sci ...
-sponsored
Automatic Content Extraction
Automatic content extraction (ACE) is a research program for developing advanced information extraction technologies convened by the NIST from 1999 to 2008, succeeding MUC and precedinText Analysis Conference
Goals and efforts
In general objecti ...
(ACE) evaluation tasks.
The ACE 2005 (ACE'05) Evaluation Plan.
Retrieved 2013-05-20. NetOwl NameMatcher was the top-scoring system at th
MITRE Challenge
for Multicultural Person Name Matching.
Products
The NetOwl suite includes, among others, the following text and entity analytics products:
Text Analytics
NetOwl Extractor performs entity extraction from unstructured texts using natural language processing
Natural language processing (NLP) is an interdisciplinary subfield of linguistics, computer science, and artificial intelligence concerned with the interactions between computers and human language, in particular how to program computers to pro ...
(NLP), machine learning (ML), and computational linguistics
Computational linguistics is an Interdisciplinarity, interdisciplinary field concerned with the computational modelling of natural language, as well as the study of appropriate computational approaches to linguistic questions. In general, comput ...
. Extractor also performs semantic relationship
Contemporary ontologies share many structural similarities, regardless of the ontology language in which they are expressed. Most ontologies describe individuals (instances), classes (concepts), attributes, and relations.
Overview
Common compo ...
and event extraction as well as geotagging of text. It is used for a variety of data sources including both traditional sources (e.g., news, reports, web pages, email) and social media (e.g., Twitter, Facebook, chats, blogs). It runs on a variety of Big Data analytics platforms, including Apache Hadoop
Apache Hadoop () is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage ...
and LexisNexis’s High-Performance Computer Cluster ( HPCC) technology. It has been integrated with a number of 3rd party analytical tools such as Esri ArcGIS and Google Earth/Maps.
Identity Analytics
NetOwl NameMatcher and EntityMatcher perform name matching and identity resolution for large multicultural and multilingual entity databases using machine learning (ML) and computational linguistics
Computational linguistics is an Interdisciplinarity, interdisciplinary field concerned with the computational modelling of natural language, as well as the study of appropriate computational approaches to linguistic questions. In general, comput ...
approaches. They are used for applications such as anti-money laundering (AML), watch lists, regulatory compliance
In general, compliance means conforming to a rule, such as a specification, policy, standard or law. Compliance has traditionally been explained by reference to the deterrence theory, according to which punishing a behavior will decrease the viol ...
, fraud detection, etc.
See also
* Knowledge extraction
* Text mining
* Data mining
* Computational linguistics
Computational linguistics is an Interdisciplinarity, interdisciplinary field concerned with the computational modelling of natural language, as well as the study of appropriate computational approaches to linguistic questions. In general, comput ...
* Named entity recognition
* Unstructured data
* Document classification
References
{{reflist, 2
External links
NetOwl website
Natural language processing software
Natural language processing
Data mining and machine learning software