Core Concepts in Data Analysis: Summarization, Correlation and Visualization [recurso electrónico] / by Boris Mirkin.

Por: Mirkin, Boris [author.]Colaborador(es): SpringerLink (Online service)Tipo de material: TextoTextoSeries Undergraduate Topics in Computer ScienceEditor: London : Springer London : Imprint: Springer, 2011Descripción: XX, 390p. 129 illus. online resourceTipo de contenido: text Tipo de medio: computer Tipo de portador: online resourceISBN: 9780857292872Tema(s): Computer science | Computational complexity | Artificial intelligence | Optical pattern recognition | Computer Science | Discrete Mathematics in Computer Science | Probability and Statistics in Computer Science | Math Applications in Computer Science | Artificial Intelligence (incl. Robotics) | Pattern RecognitionFormatos físicos adicionales: Printed edition:: Sin títuloClasificación CDD: 004.0151 Clasificación LoC:QA76.9.M35 Recursos en línea: Libro electrónicoTexto En: Springer eBooksResumen: Core Concepts in Data Analysis: Summarization, Correlation and Visualization provides in-depth descriptions of those data analysis approaches that either summarize data (principal component analysis and clustering, including hierarchical and network clustering) or correlate different aspects of data (decision trees, linear rules, neuron networks, and Bayes rule). Boris Mirkin takes an unconventional approach and introduces the concept of multivariate data summarization as a counterpart to conventional machine learning prediction schemes, utilizing techniques from statistics, data analysis, data mining, machine learning, computational intelligence, and information retrieval. Innovations following from his in-depth analysis of the models underlying summarization techniques are introduced, and applied to challenging issues such as the number of clusters, mixed scale data standardization, interpretation of the solutions, as well as relations between seemingly unrelated concepts: goodness-of-fit functions for classification trees and data standardization, spectral clustering and additive clustering, correlation and visualization of contingency data.   The mathematical detail is encapsulated in the so-called “formulation” parts, whereas most material is delivered through “presentation” parts that explain the methods by applying them to small real-world data sets; concise “computation” parts inform of the algorithmic and coding issues. Four layers of active learning and self-study exercises are provided: worked examples, case studies, projects and questions.         
Star ratings
    Valoración media: 0.0 (0 votos)
Existencias
Tipo de ítem Biblioteca actual Colección Signatura Copia número Estado Fecha de vencimiento Código de barras
Libro Electrónico Biblioteca Electrónica
Colección de Libros Electrónicos QA76.9 .M35B (Browse shelf(Abre debajo)) 1 No para préstamo 370538-2001

Core Concepts in Data Analysis: Summarization, Correlation and Visualization provides in-depth descriptions of those data analysis approaches that either summarize data (principal component analysis and clustering, including hierarchical and network clustering) or correlate different aspects of data (decision trees, linear rules, neuron networks, and Bayes rule). Boris Mirkin takes an unconventional approach and introduces the concept of multivariate data summarization as a counterpart to conventional machine learning prediction schemes, utilizing techniques from statistics, data analysis, data mining, machine learning, computational intelligence, and information retrieval. Innovations following from his in-depth analysis of the models underlying summarization techniques are introduced, and applied to challenging issues such as the number of clusters, mixed scale data standardization, interpretation of the solutions, as well as relations between seemingly unrelated concepts: goodness-of-fit functions for classification trees and data standardization, spectral clustering and additive clustering, correlation and visualization of contingency data.   The mathematical detail is encapsulated in the so-called “formulation” parts, whereas most material is delivered through “presentation” parts that explain the methods by applying them to small real-world data sets; concise “computation” parts inform of the algorithmic and coding issues. Four layers of active learning and self-study exercises are provided: worked examples, case studies, projects and questions.         

19

Con tecnología Koha