The data warehouse ETL toolkit [recurso electrónico] : practical techniques for extracting, cleaning, conforming, and delivering data / Ralph Kimball, Joe Caserta.

Por: Kimball, RalphColaborador(es): Caserta, Joe, 1965-Tipo de material: TextoTextoDetalles de publicación: Indianapolis, IN : Wiley, c2004Descripción: 1 online resource (xxxiv, 491 p.) : illTipo de contenido: text Tipo de medio: computer Tipo de portador: online resourceISBN: 0764579231 (electronic bk.); 9780764579233 (electronic bk.); 0764567578 (paper/website); 9780764567575 (paper/website)Tema(s): Data warehousing | Database design | COMPUTERS -- Desktop Applications -- Databases | COMPUTERS -- Database Management -- General | COMPUTERS -- System Administration -- Storage & Retrieval | Data warehousing | Database design | Electronic book collection | Entrepôts de données (Informatique) | Bases de données -- Conception | Data warehousing | Database design | Data-Warehouse-KonzeptGénero/Forma: Electronic books.Formatos físicos adicionales: Print version:: Data warehouse ETL toolkit.Clasificación CDD: 005.74 Clasificación LoC:QA76.9.D37 | K53 2004ebOtra clasificación: DAT 620f | ST 530 Recursos en línea: Libro electrónicoTexto
Contenidos:
Requirements, Realities, and Architecture -- Surrounding the Requirements -- The Mission of the Data Warehouse -- The Mission of the ETL Team -- ETL Data Structures -- To Stage or Not to Stage -- Designing the Staging Area -- Data Structures in the ETL System -- Planning and Design Standards -- Data Flow -- Extracting -- The Logical Data Map -- Building the Logical Data Map -- Integrating Heterogeneous Data Sources -- The Challenge of Extracting from Disparate Platforms -- Mainframe Sources -- Flat Files -- XML Sources -- Web Log Sources -- ERP System Sources -- Extracting Changed Data -- Cleaning and Conforming -- Defining Data Quality -- Assumptions -- Design Objectives -- Cleaning Deliverables -- Screens and Their Measurements -- Conforming Deliverables -- Delivering Dimension Tables -- The Basic Structure of a Dimension -- The Grain of a Dimension -- The Basic Load Plan for a Dimension -- Flat Dimensions and Snowflaked Dimensions -- Date and Time Dimensions -- Big Dimensions -- Small Dimensions -- One Dimension or Two -- Dimensional Roles -- Dimensions as Subdimensions of Another Dimension -- Degenerate Dimensions -- Slowly Changing Dimensions -- Type 1 Slowly Changing Dimension (Overwrite) -- Type 2 Slowly Changing Dimension (Partitioning History) -- Precise Time Stamping of a Type 2 Slowly Changing Dimension -- Type 3 Slowly Changing Dimension (Alternate Realities) -- Hybrid Slowly Changing Dimensions -- Late-Arriving Dimension Records and Correcting Bad Data -- Multivalued Dimensions and Bridge Tables.
Resumen: Annotation Cowritten by Ralph Kimball, the world's leading data warehousing authority, whose previous books have sold more than 150,000 copiesDelivers real-world solutions for the most time- and labor-intensive portion of data warehousing-data staging, or the extract, transform, load (ETL) processDelineates best practices for extracting data from scattered sources, removing redundant and inaccurate data, transforming the remaining data into correctly formatted data structures, and then loading the end product into the data warehouseOffers proven time-saving ETL techniques, comprehensive guidance on building dimensional structures, and crucial advice on ensuring data quality.
Star ratings
    Valoración media: 0.0 (0 votos)
Existencias
Tipo de ítem Biblioteca actual Colección Signatura Copia número Estado Fecha de vencimiento Código de barras
Libro Electrónico Biblioteca Electrónica
Colección de Libros Electrónicos QA76.9 .D37 K53 2004 EB (Browse shelf(Abre debajo)) 1 No para préstamo 369742-2001
Navegando Biblioteca Electrónica Estantes, Código de colección: Colección de Libros Electrónicos Cerrar el navegador de estanterías (Oculta el navegador de estanterías)
QA76.9 .D35 Spatial Information Theory QA76.9 .D37 B68 2009 EB Pentaho solutions QA76.9 .D37 D38 2008 EB The data warehouse lifecycle toolkit QA76.9 .D37 K53 2004 EB The data warehouse ETL toolkit QA76.9 .E94 Cloud Computing QA76.9 .E94 Stochastic Models for Fault Tolerance QA76.9 .E94 High Performance Computing and Applications

Includes index.

Description based on print version record.

Annotation Cowritten by Ralph Kimball, the world's leading data warehousing authority, whose previous books have sold more than 150,000 copiesDelivers real-world solutions for the most time- and labor-intensive portion of data warehousing-data staging, or the extract, transform, load (ETL) processDelineates best practices for extracting data from scattered sources, removing redundant and inaccurate data, transforming the remaining data into correctly formatted data structures, and then loading the end product into the data warehouseOffers proven time-saving ETL techniques, comprehensive guidance on building dimensional structures, and crucial advice on ensuring data quality.

Requirements, Realities, and Architecture -- Surrounding the Requirements -- The Mission of the Data Warehouse -- The Mission of the ETL Team -- ETL Data Structures -- To Stage or Not to Stage -- Designing the Staging Area -- Data Structures in the ETL System -- Planning and Design Standards -- Data Flow -- Extracting -- The Logical Data Map -- Building the Logical Data Map -- Integrating Heterogeneous Data Sources -- The Challenge of Extracting from Disparate Platforms -- Mainframe Sources -- Flat Files -- XML Sources -- Web Log Sources -- ERP System Sources -- Extracting Changed Data -- Cleaning and Conforming -- Defining Data Quality -- Assumptions -- Design Objectives -- Cleaning Deliverables -- Screens and Their Measurements -- Conforming Deliverables -- Delivering Dimension Tables -- The Basic Structure of a Dimension -- The Grain of a Dimension -- The Basic Load Plan for a Dimension -- Flat Dimensions and Snowflaked Dimensions -- Date and Time Dimensions -- Big Dimensions -- Small Dimensions -- One Dimension or Two -- Dimensional Roles -- Dimensions as Subdimensions of Another Dimension -- Degenerate Dimensions -- Slowly Changing Dimensions -- Type 1 Slowly Changing Dimension (Overwrite) -- Type 2 Slowly Changing Dimension (Partitioning History) -- Precise Time Stamping of a Type 2 Slowly Changing Dimension -- Type 3 Slowly Changing Dimension (Alternate Realities) -- Hybrid Slowly Changing Dimensions -- Late-Arriving Dimension Records and Correcting Bad Data -- Multivalued Dimensions and Bridge Tables.

Use copy Restrictions unspecified star MiAaHDL

Electronic reproduction. [S.l.] : HathiTrust Digital Library, 2010. MiAaHDL

Master and use copy. Digital master created according to Benchmark for Faithful Digital Reproductions of Monographs and Serials, Version 1. Digital Library Federation, December 2002. MiAaHDL

http://purl.oclc.org/DLF/benchrepro0212

digitized 2010 HathiTrust Digital Library committed to preserve pda MiAaHDL

19

Con tecnología Koha