Informational Modelling
Data Formats. Dimensional Modeling and Data Models. Structured Data and Unstructured Data. Extraction, Transformation and Loading (ETL and ELT). Streaming data. Data Warehouse and Data Lake projects. Big Data Management Systems and Best Practices.
Basic Information
Workload
60 hours
Requirements
Databases
Mandatory:
- Inmom, W. H. Data Lake Architecture: Designing the Data Lake and Avoiding the Garbage Dump.
- Hay, David. Data Model Patterns: a metadata map. Morgan Kaufmann, 2006.
- Sharma, Ben. Architecting Data Lakes: Data Management Architectures for Advanced Business Use Cases <http://www.oreilly.com/data/free/architecting-data-lakes.csp>
Complementary:
- Holden Karau. Learning Spark: Lightning-Fast Big Data Analysis
- Vermeulen, Andreas François. Practical Data Science: A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets
- Silverston, Len. The Data Model Resource Book, Vol. 1: A Library of Universal Data Models for all Enterprises. Wiley, 2001.
- Ambler, Scott; Pramod J. Sadalage. Refactoring Databases: Evolutionary Database Design. Addison-Wesley Professional. 2006.
- Gamma, Erich; Richard Helm, Ralph Johnson, John Vlissides. Design Patterns: Elements of Reusable Object-Oriented Software. Addison-Wesley Professional. 2016.