Home OSS About Privacy

PyApacheAtlas Excel Configuration

PyApacheAtlas supports using Excel to do bulk uploads, creating type and classification definitions, updating lineage, and more. There are some default settings that help you get started quickly but you can also customize the excel spreadsheet somewhat.

Default Configuration and What Can Be Customized

You can customize your Excel Template (manually) and then inform the ExcelReader by feeding the ExcelConfiguration the parameters and updated values.

Here's an example where you might...

from pyapacheatlas.readers import ExcelConfiguration, ExcelReader

ec = ExcelConfiguration(
    value_separator = "|",
    bulkEntity_sheet = "Sheet1",
    source_prefix = "Quelle",
    target_prefix = "Ziel",
    process_prefix = "Prozess"

)

reader = ExcelReader(ec)

entities = reader.parse_bulk_entities("./some/file/path.xlsx")

# Assuming you've authenticated to your Purview or Atlas service
client.upload_entities(entities)

Learn more about the parse_* family of methods on the ExcelReader on the PyApacheAtlas Excel Support page.