Atlas is a set of core foundational governance services that are scalable and extensible - enabling enterprises to effectively and efficiently meet compliance needs in Hadoop and allowing integration with the entire enterprise data ecosystem.
Apache Atlas provides organizations with open metadata management and governance capabilities to catalog their data assets, classify and manage those assets, and provide data scientists, Analysts and data governance teams provide collaborative capabilities around these data assets.
Features
Metadata types and instances
Predefined types for various Hadoop and non-Hadoop metadata
As required Managed metadata Ability to define new types
Types can have primitive properties, complex properties, object references; can inherit from other types.
Instances of types, called entities, capture metadata object details and their relationships
REST API works with types and instances for easier integration
Classification
The ability to dynamically create classifications, such as PII, ExIPRESION, DATAAL quality, sensitive
Classifications can include attributes, such as the expiry_date attribute in the EXPIRES_ON classification
Entities can be associated with Multiple classifications are associated, allowing for easier discovery and secure enforcement.
Propagate classification through lineage - automatically ensure classification follows data through various processing
Search/Discover
By type, classification, attribute Intuitive UI for searching entities by value or free text
Rich REST API to enable complex standard searches
SQL-like query language for searching entities - Domain Specific Language (DSL)
For more Apache related technical articles, please visit the Apache usage tutorial column to learn!
The above is the detailed content of what is apache atlas. For more information, please follow other related articles on the PHP Chinese website!