Monte Carlo
What is Monte Carlo?
Monte Carlo is a leading data observability platform that helps data teams monitor, resolve, and prevent data quality issues. It provides insights into the health, freshness, and lineage of your data assets across your entire data stack.
Value of Monte Carlo with Paradime
Integrating Monte Carlo with Paradime enables teams to centralize data observability and enhance the monitoring of production jobs (Bolt schedules) and dbt™ models. Key benefits include:
Enhanced Observability: Overlay dbt™ context onto Monte Carlo's lineage graph for easier troubleshooting.
Incident Detection: Detect and centralize dbt™ model errors, test failures, and other data incidents in one place.
Run Insights: Visualize dbt™ job execution times, success/error statuses, and run histories.
Simplified Impact Analysis: Evaluate downstream and upstream impacts of dbt™ transformations on table updates.
With this integration, data teams can proactively address failures, optimize dbt™ models, and ensure reliable data pipelines.
Setting Up the Integration
Follow these steps to configure the Monte Carlo integration within Paradime.
Step 1: Generate API Key and API ID
Log in to your Monte Carlo account.
Follow the instructions in Monte Carlo Docs to generate:
API Key
API ID
The key is required to be generated with the "Editor" or "Owner" roles, for example if you create a Service Account Key you need to select "Editors" or "Account Owners" under "Authorization Groups".
If you're using a personal key, the user that generated it needs to be an "Editor" or "Owner".
Step 2: Add API Credentials to Paradime
In the Paradime UI, go to: Settings → Workspace → Environment Variables.
Add the following variables and their respective values from Step 1:
MCD_DEFAULT_API_TOKEN
MCD_DEFAULT_API_ID
Click Save to confirm the changes.
Step 3: Set Your Project Name
In the same Environment Variables section, add the following variable:
MONTECARLO_PROJECT_NAME
Set a value for the project name.
You can reuse your existing dbt project name or create any name that aligns with your dbt models.
Step 4: Obtain the Connection ID
The Connection ID identifies the warehouse or lake connection in Monte Carlo. Follow these steps to retrieve it:
Navigate to Bolt in Paradime.
Create a temporary Bolt schedule:
Go to
Schedules → + New Schedule → Create New Schedule
Provide a schedule name.
In the command settings, enter the following Monte Carlo command:
Click Deploy.
From the Bolt Home Screen, select the newly created schedule and click Run.
In the Run History, open the most recent execution logs.
Locate the Connection ID in the Run Logs.
Step 5: Add the Connection ID
Copy the Connection ID from the logs.
Go back to the Environment Variables section in Paradime.
Add the following variable:
MONTECARLO_CONNECTION_ID
Click Save to confirm.
Step 6: Enable the Integration
In the same Environment Variables section, add the following variable:
RUN_MONTECARLO_UPLOAD
Set its value to
TRUE
.
By the end of this step, your Monte Carlo environment variables should include:
Testing the Integration
To verify the integration, run the following steps in Paradime's Bolt:
Trigger a Run for one of you Bolt schedule which which contains either
dbt build
,dbt run
ordbt test
command.Verify the results in Monte Carlo:
Check the lineage graph for updated dbt™ context.
View job statuses, model run results, and test outcomes.
For more details on the logs that Montecarlo will ingest check the Montecarlo dbt integration documentation.
Last updated