Collect. Extract. Monitor.
At Parabots, we believe that our solutions will soon become one of the biggest segments in the industry. We’ve only just started, but we already know that every product we build requires hard-earned skills, dedication and a daring attitude. Continue reading and learn all there is to know about the smart tech behind our successful A.I. and data mining software.
The Internet can be searched in many different ways. Search engines like Google and Yahoo allow people to look for pages containing one or more words. Specialized collection sites like Startpagina.nl or Yahoo Directories offer lists of pages in a certain category.
But what if you want to collect a list of web pages about your own particular topic of interest? Wouldn't it be convenient if you could learn a computer the concept of what you are looking for in a few simple steps, and let the computer do the searching for you?
Parabots develops intelligent web spiders and classifiers that find information fulfilling specific user demands. These demands are translated to search concepts that are used for the selection of relevant sites and for the determination of interesting search paths.
A user can create search concepts in a wide variety of ways. He can give some examples of relevant web sites, but can also give some examples by means of structured information (about cultural objects) or encyclopaedic information like a Wikipedia page. Concepts like 'hotel' or 'restaurant' can be used to retrieve restaurant and hotel sites, or the addresses found on these sites.
Our intelligent spiders crawl over the internet and search for relevant pages and sites. This information from these sites is structured and stored in a database, or in other formats like XML or HTML.
The Xbots software suite is a powerful environment for concept-driven web searches and a good example of the capabilities of intelligent web collection.
Extract and understand
Looking for all addresses of potential business to business customers in your area? Do you want to have a list of all hotels in your city that is always up-to-date? Do you want to automatically digitalise your collection of articles or bills?
Parabots might have the software components you need.
Text needs interpretation in order to become useful. A lot of the information available on the Internet however is presented in an unstructured way, which makes it difficult to interpret it automatically. Parabots' information extraction components are designed to make sense from these unstructured texts.
WebID is an application used in many European countries to extract addresses from web pages and other documents. With minor alterations WebId can be used in new countries with other address formats.
We also create software that structures newspaper advertisements and interprets resumes. Adaptive techniques are used to learn how to extract new information from the annotation of examples. This technology is applied in various different domains like ontology filling, and the extraction of semi-structured information, like the opening times of restaurants, departure times of trains, hotel information, and resumes. This information is described in a wide variety of ways, but the annotation of some examples may suffice for the automated extraction using adaptive techniques.
The opinion mining component is one of the latest additions to our software base. It searches documents for affective utterances about persons or companies. This component is used in the Vox-pop.nl webmonitor where it deduces the general opinion of bloggers and forum participants about e.g. politicians or companies or even about products or groups of people connected by philosophy of life or ethnicity.
Always wanted to know how people think about your company or your products? Always the last to know new developments or marketing trends? Interested in the general opinions about your partners and clients? Want to know the market value of the website of a company, or e-store? Our monitoring software can serve your needs.
Parabots has created monitoring software that captures the activities of companies and communities. This software can monitor both internet and community movements. Monitoring the internet is relevant for those who want to know how the internet develops - what kind of sites are becoming more popular (auction sites, gambling sites, forums, blogs, etc.) and what are emerging (or disappearing) trends on the web? 'Community monitoring' focuses on the contents of a specific collection of (one or more) sites. It searches web spaces for emerging and popular topics and trends, for users, new products or technologies, or opinions. The WebMonitor is a web-based software suite that detects and visualises trends and hot topics on forums, auction sites, and blogs and inspects smaller niches for new hot topics. The Web Monitor can be used, for example, to check auction sites for valuable trade, or forums for dynamic discussions and hot topics.
The Vox-pop.nl web application combines the monitoring software with our opinion mining component and searches blogs and forums for affective utterances towards for example politicians, political parties, and celebrities.