Software PILP (GISELA) | EGI Applications Database

Applications Database
Supporting
Applications Database
Supporting
Applications Database
Supporting
Applications Database
Supporting
Applications Database
Supporting

Click to visit the AppDB VMOps Dashboard for deploying and managing Virtual Machines to the EGI Cloud infrastructure.read more

.noscript.softwareentry {display: block;width: 1000px;}.noscript.softwareentry .field {display: block;padding-bottom: 0.5em;padding-left: 10em;width: auto;text-align:left;}.noscript.softwareentry .field .fieldtype {color: #444444;display: inline-block;font-weight: bold;min-width: 90px;vertical-align: top;width: 90px;}.noscript.softwareentry .field .fieldvalue {display: inline-block;max-width: 780px;text-align:left;}.noscript.softwareentry .field.image {height: 100px;left: -110px;position: absolute;top: 0;width: 100px;.height: 100px;border:none;}

Name:PILP (GISELA)

Description:Parallel Inductive Logic Programming

Abstract:This application is intended to discover hidden data from relational databases. It uses a technique called Inductive Logic Programming (ILP), where, given a background knowledge, a set of positive examples, a set of negative examples, and a language bias, the objective is to generate first order rules that (almost) perfectly describes all positive examples and none of the negative examples. We have been working with several domains, when applying ILP: drug discovery, analysis of mammograms, link discovery, among others. These domains present very large databases and sets of examples.<BR/>ILP systems have been quite successful in extracting comprehensible models of relational data. Indeed, for over a decade, ILP systems have been used to construct predictive models for data drawn from diverse domains. These include the sciences, engineering, language processing, environment monitoring, and software analysis. In a nutshell, ILP systems repeatedly examine candidate clauses (the “search space’’) to find good rules. Ideally, the search will stop when the rules cover nearly all positive examples with only a few negative examples being covered. Unfortunately, the search space can grow very quickly in ILP applications. Several techniques have therefore been proposed to improve search efficiency. Such techniques include improving computation times at individual nodes, better representations of the search, sampling the search space, and parallelism. Parallelism can be obtained from very different alternative approaches, such as dividing the search tree, dividing the examples, or even through performing cross-validation in parallel. An intriguing alternative approach that can lead to better accuracy whilst taking advantage of parallelism is the use of ensembles. Ensembles are classifiers that combine the predictions of multiple classifiers to produce a single prediction. To some extent, an induced theory is an ensemble of clauses. We can go one step further and combine different theories to form a single ensemble. The mai