Uso de medidas de desempenho e de grau de interesse para análise de regras descobertas nos classificadores

AUTOR(ES)
DATA DE PUBLICAÇÃO

2008

RESUMO

The process of knowledge discovery in databases has become necessary because of the large amount of data currently stored in databases of companies. They operated properly can help the managers in decision-making in organizations. This process is composed of several steps, among them there is a data mining, stage where they are applied techniques for obtaining knowledge that can not be obtained through traditional methods of analysis. In addition to the technical, in step of data mining is also chosen the task of data mining that will be used. The data mining usually produces large amount of rules that often are not important, relevant or interesting to the end user. This makes it necessary to review the knowledge discovered in post-processing of data. In the stage of post-processing is used both measures of performance but also of degree of interest in order to sharpen the rules more interesting, useful and relevant. In this work, using a tool called WEKA (Waikato Environment for Knowledge Analysis), were applied techniques of mining, decision trees and rules of classification by the classification algorithms J48.J48 and J48.PART respectively. In the post-processing data was implemented a package with functions and procedures for calculation of both measures of performance but also of the degree of interest rules. At this stage consultations have also been developed (querys) to select the most important rules in accordance with measures of performance and degree of interest.

ASSUNTO(S)

rule evaluation measures mineração de dados data mining post-processing classificação pós-processamento descoberta de conhecimento em banco de dados medidas de avaliação de regras engenharia eletrica knowledge discovery in databases classification

Documentos Relacionados