PyApacheAtlas supports using Excel to do bulk uploads, creating type and classification definitions, updating lineage, and more. There are some default settings that help you get started quickly but you can also customize the excel spreadsheet somewhat.
;
(semi-colon)experts
column into an array on the BulkEntities tab.owners
column into an array on the BulkEntities tab.[Relationship] meanings
into an array on the BulkEntities tab.[root]
into an array on the BulkEntities tab.[root] labels
or [root] classifications
You can customize your Excel Template (manually) and then inform the ExcelReader by feeding the ExcelConfiguration the parameters and updated values.
Here's an example where you might...
from pyapacheatlas.readers import ExcelConfiguration, ExcelReader
ec = ExcelConfiguration(
value_separator = "|",
bulkEntity_sheet = "Sheet1",
source_prefix = "Quelle",
target_prefix = "Ziel",
process_prefix = "Prozess"
)
reader = ExcelReader(ec)
entities = reader.parse_bulk_entities("./some/file/path.xlsx")
# Assuming you've authenticated to your Purview or Atlas service
client.upload_entities(entities)
Learn more about the parse_*
family of methods on the ExcelReader
on the PyApacheAtlas Excel Support page.