huemul-bigdatagovernance
Huemul BigDataGovernance is a framework built on Spark, Hive, and HDFS that enables organizations to implement a corporate single-data strategy based on data governance best practices. It supports tables with Primary Key and Foreign Key controls during data insertion and updates, along with validation of nulls, text lengths, numeric and date ranges, unique values, and default values. Fields can be classified for ARCO rights applicability to simplify GDPR-type data protection compliance, and security levels and encryption status can be identified. The framework also allows adding complex validation rules to tables. It simplifies configuration and reading of input interfaces, adjusting parameters in changing schemas, creates automatic traceability between interfaces and tables, and stores data dictionaries in a central repository. Additionally, it automates code generation from input interface definitions and creates initial business logic code. By using templates, data engineers can define input interfaces and