Skip to main content
Feedback

Getting started with Boomi DataHub

The Boomi DataHub is a cloud-based, flexible master data synchronization service that helps you keep valuable data domains, such as customer data, consistent, reliable, and accurate.

Hub is set up, configured, and managed from a web browser in a single instance, multi-tenant environment. It synchronizes with Boomi Integration to connect to any combination of SaaS cloud, local, and hybrid environments. You can configure data sources to contribute to and/or receive quality data using integrations and the Boomi DataHub connector.

After you define the ideal record and deploy a data model, Hub identifies record elements that do not match your data quality standards. Hub maintains validated and up-to-date records, called golden records, and quarantines low-quality data for your review.

Multiple applications in your organization can coordinate and reference golden records to obtain consistent, high-quality, up-to-date information.

Boomi DataHub lifecycle

There are 4 core data management activities in Hub:

Dart in the center of a dart boardDefineDefine the characteristics and criteria for data models in a domain.
A database in the cloudDeployDeploy models to a Hub repository and identify the source systems that will interact with them.
Large box connected to smaller boxes illustrating data flowSynchronizeLeverage Integration to orchestrate data synchronization and design process flows to ensure data quality.
An eye next to reports symbolizing data stewardshipStewardSteward data as it flows into domains to resolve duplicates and fix data entry issues, as well as identify and correct inaccurate data.

Enroll in the DataHub Essentials course to learn more about data management, data stewardship, and the lifecycle.

Boomi DataHub architecture

The Boomi Hub Cloud hosts your repositories, deployed models, and golden records. Sources use integrations to connect to the deployed model to contribute master data, access master data, or both. Models can reference data from other models in the same repository. For example, the Contact model can reference data from the customer ID field in the Customer model.

Boomi recommends you create a development repository, test repository, and production repository so you can develop and test the flow of data and prevent errors. Your production repository is the single source of truth for your business data.

Diagram illustrating the data flow between Integration and Hub: sources contribute data to the model and sources accept master data from the Hub repository where the deployed model resides.

Data management workflow: A quick start guide

There are 8 steps to create a new data management project in Boomi DataHub.

Step 1: Create your repositories

Create a repository that will host your master data, models, and source configurations. Repositories are virtual runtimes for your validated, trusted data. The data in a repository is hosted in the Boomi Hub Cloud. By default, you can have up to three repositories.

Boomi recommends that you create the following three repositories to minimize the risk of errors to live master data:

  • Development repository - use this repository to establish and update deployed models and source settings with a small amount of data. It allows developers to safely experiment with new models and updates.

  • Test repository - use this repository with a larger amount of data to test connections and ensure data flows correctly between golden records and sources.

  • Production repository - use this repository to contain the actual, live master data that is accessed by data users for business decisions.

Read Repositories overview and Creating a Repository to learn more.

Step 2: Create integrations in Boomi Integration

Create integrations that will flow data to and from sources using Boomi Integration and the Boomi DataHub connector. Read the following topics to help you:

Although you can use the Boomi DataHub APIs to build integrations between your sources and repository, using Integration simplifies the process because it:

  • Does not require coding to build integrations
  • Contains built-in tools to deploy and manage processes
  • Allows you to use the Boomi DataHub connector, which handles the technical aspects of exchanging data between sources and repositories
  • Simplifies administration. Integration and Boomi DataHub are interconnected

Step 3: Create sources in Boomi DataHub

Establish source connections that will contribute data to the repository, accept record updates, or both. Source applications can be local or cloud. Read Creating a source to learn more.

Step 4: Create a model

Create a model that defines the structure of golden records. Models contain rules to identify new records, identify record updates, and quarantine low-quality data. Read Creating a model to learn more.

Step 5: Configure source settings in the model

You can specify how sources contribute and accept data. Source configurations automatically attach to any deployed model across repositories. Read Adding a source to a model to learn more.

Step 6: Publish and deploy the model to your repository

Hub uses your deployed model to load data from sources, create golden records, and maintain master data in your repository. Read Publishing a model and Deploying a model to a repository to learn more.

Step 7: Synchronize and load data from sources

Load data from sources into your repository. Read Loading data from a source to learn more.

Step 8: Steward data in golden records

View golden records and quarantined data. Read Viewing domain data and Viewing a domain’s quarantine entries to learn more.

On this Page