Amplitude walkthrough
Amplitude is a product analytics platform for web and mobile.
Prerequisites
- Amplitude Connection. If you do not have one, you can create it using Amplitude Connection page.
- Organization Admin or Manager permissions in Amplitude.
Pulling data from Amplitude
Step 1: Select the Amplitude connection
- Choose the appropriate Amplitude connection from the available list in the Data Integration Console.
- If the desired connection does not exist, create or update an existing connection.
Step 2: Choose reports
Choose a report from the Report drop-down menu.
Data Integration can fetch data using the Raw Events report only in Amplitude Data Flow.
Events report
The Events report can include data about both Event Properties and User Properties on each event.
Set the Time Period for the report, as the report runs data by dates.
Data Integration manages the increment for the Data Flows and fetches incremental data from the source.
Step 3: Selecting Time period
If you choose Custom Range, define the Start Date to run the Data Flow. If you do not set the End Date, Data Integration runs to the current date on which you executed the run. Data Integration uses the next end date as the next Start Date in the next run.
Interval chunks: Data Integration can run over the increment using chunks to make the loading efficient and precise.
The available chunk options are:
- Don't Split: Data Integration pulls the data from the chosen start date to the end date in one bulk.
- Hourly: Data Integration chunks the Data Flow hourly from the start to the end date.
- Day: Data Integration chunks the run daily from the start to the end date.
- Week: Data Integration chunks the run weekly from the start to the end date.
- Monthly: Data Integration chunks the run monthly from the start to the end date.
Interval Size: You can also define the interval chunk size. For example, if you want a 3-hour chunk, set the Interval Chunks to Hourly and the Interval Size to 3.
Step 4: Reports structure configuration
You can normalize the results. This can increase the number of rows in the returned data.
Enabling this option for large datasets is not recommended.