Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Upcoming SlideShare
What to Upload to SlideShare
What to Upload to SlideShare
Loading in …3
×
1 of 18

How to build a data dictionary

3

Share

Download to read offline

What is data dictionary, how to build and maintain it.

How to build a data dictionary

  1. 1. How to build a Data Dictionary Piotr Kononow Founder @ Dataedo https://dataedo.com
  2. 2. What is Data Dictionary • A specification of tables (data sets) in a database • List of columns/fields with their attributes • Data Dictionary is (technical) metadata https://dataedo.com
  3. 3. Sample Data Dictionary entry https://dataedo.com
  4. 4. Elements of Data Dictionary Relationships / Joins List of data sets/tables List of columns/fields Data type Constraints Default values Primary / unique keys Allowed values Data Dictionary Ownership Classification Comments SourcesStatus https://dataedo.com
  5. 5. Why you should care about Data Dictionary? • Find the data • Understand the data – Purpose of tables and columns – Relationships/joins – Sources – Status/quality • Capture knowledge about the data • Share knowledge about the data https://dataedo.com
  6. 6. Active vs Passive Data Dictionary https://dataedo.com
  7. 7. Active Data Dictionary has many names • „System Catalog Views” – MS SQL Server • „System Catalog Tables” – Amazon Redshift, IBM Informix • „System Catalogs” – PostgreSQL • „System Views” – SAP HANA, Oracle • „System Tables” – SAP/Sybase ASE, SAP HANA, Vertica, Firebird, … • „Catalog Views” – IBM Db2 • „Information schema” – MySQL , Snowflake • „Data Dictionary Views” – Oracle, Teradata • … https://dataedo.com
  8. 8. Active Data Dictionary views • Sample MS SQL Server System Catalog Views: – sys.tables – sys.columns – sys.view – sys.foreign_keys – sys.index_columns – … https://dataedo.com
  9. 9. Querying (Active) Data Dictionary More at: https://dataedo.com/kb/query/ https://dataedo.com
  10. 10. Data Dictionary in different platforms • RDBMS/data warehouse: Integral part of a database, easily accessible (active data dictionary) • Document database/NoSQL (e.g. MongoDB) – embedded in data itself, requires parsing • Data Lake: external or none! https://dataedo.com
  11. 11. Data Dictionary flow https://dataedo.com
  12. 12. Where you can find Data Dictionary? • Design documents / Spreadsheets • DBMS – active data dictionary • Data modeling tools • Database documentation • Data Dictionary tools / Metadata Reporisoties / Data Catalogs • ETL tools https://dataedo.com
  13. 13. Data Dictionary tools • Spreadsheet or word processor • DBMS active data dictionary + GUI tool • Data Modeling tools (e.g. Erwin) • Data Dictionary tools (e.g. Dataedo) • Data Catalogs https://dataedo.com
  14. 14. Building a Data Dictionary
  15. 15. Data Dictionary with Dataedo 1. Metadata repository 2. Import metadata from data sources 3. Comments 4. Aliases 5. Additional metadata fields 6. Primary / unique keys 7. Relationships + diagrams 8. Organization with modules 9. Data Classification 10. Map Business Glossary 11. Share! https://dataedo.com
  16. 16. Questions? https://dataedo.com
  17. 17. Start today! Free trial https://dataedo.com/free-trial
  18. 18. Thank you! Piotr Kononow Founder @ Dataedo https://dataedo.com

×