Skip to main content

Copy data between workspaces

To copy data between lakehouses in different Workspaces, the Copy_Data_Lakehouses notebook can be used. This notebook requires the following parameters:

ParameterExampleDescription
source_workspaceFB-Company-PRDThe workspace to get the data from
destination_workspaceFB-Company-TSTThe workspace to copy the data into
source_lakehouseBronzeThe lakehouse to get the data from
destination_lakehouseBronzeThe lakehouse to copy the data into
source_schemahisThe schema in the source
destination_schemahisThe schema in the destination
opt_filter_tableNoneFilter the tables to copy by supplying the first characters to filter on (very basic filtering. Will just look for tables that start with this string)
max_worker5The copy instruction will be done in parallel. Default is 5 tables at once

Limitations

Copy into a workspace that has "prd", "prod" or "production" in it's name will be prevented. Normally you don't want to use this way to copy data into a production workspace. Is your workspace not called this way (even if it is a production workspace) data will be copied.

The destination table has to exist.

Table will be overwritten. Schema is also overwritten.