MixPanel walkthrough
You can integrate data from Mixpanel, a powerful analytics platform, into your web or mobile application. Mixpanel offers advanced user behavior tracking, event analysis, and user segmentation, enabling businesses to gain valuable insights into user interactions and optimize their products or services.
Connection
To connect Mixpanel with your destination, follow the Mixpanel connection guide. Choose a Source Connection after creating a connection.
Mixpanel reports in Data Integration
Data Integration provides access to the reports within the Mixpanel reporting dashboard. Create your Mixpanel River by choosing the Mixpanel report type.
Download raw data
The Download Raw Data report extracts every event for the selected event type within the defined time period. This report supports standard "GET Request" without any data size restrictions, making it easy to retrieve large datasets.
Setting the time range
Start Date: Defines when the event begins. To capture events up to the present day, leave the "End date" blank.
If a River run fails, the Start Date remains unchanged. If you want to update the Start Date regardless of River run success, navigate to More Options and enable the setting to advance the start date (not recommended).
Data filtering by date
Mixpanel lets you filter data only daily. Use the Last days back option to backtrack and capture data.
Event type selection
Event Types: Choose the specific events for data extraction or leave blank to retrieve all events. A comprehensive list is available for selection, and additional events not listed can be added manually.
Instead of pulling all events into one River, it is recommended to categorize them by business relevance.
Data interval configuration
-
Segment the data retrieval into intervals to manage large datasets and prevent API overloads, with daily intervals as the default.
-
For extensive data retrieval periods, you may widen these intervals to decrease the number of API requests.
-
For data sets with millions of events within the specified time frame, keep the retrieval intervals between 1 and 2 days to prevent Mixpanel service errors.
Events per API call
Data Integration partitions large data requests into manageable event counts per API call. This segmentation ensures Mixpanel can efficiently process each request.
Handling multi-type columns
Use Convert column to String to resolve issues with columns containing multiple data types (for example, '6' and 'A' in a 'grade' column).
-
Initially, leave these settings blank until the mapping phase concludes.
-
After auto-mapping, Data Integration indicates columns with multi-type issues. You can exclude these columns or convert them to strings for data compatibility.
-
When converting to strings, all data within those columns is considered as text, which aids in loading the data to the target table.
-
If Data Integration does not detect all multi-type columns and errors occur during data loading, you can adjust the mapping accordingly.
Use the Stringify all columns option for uniformity across all column types. This action is reversible by deselecting the option or removing columns from the conversion list and re-running the auto-mapping process.
Flatten arrays
Data Integration can flatten your Download Raw Data report array values. Flattening an array means transforming a multidimensional array into a one-dimensional array by concatenating all its elements.
Procedure
- Select the Flatten Arrays checkbox.
- Each report's array value will be an individual column. Each array contains an array_name alias in the schema mapping. For example, the report's Properties array will have an alias of properties for all values.
The Flatten Arrays option applies to the Download Raw Data report.
Retention
This report provides the retention data in 3 types: Recurring, First time, and Addiction.
- Select the retention report.
- Select the start date from which to start pulling events. Leave the end date empty to pull events until today:
a. Use days back to pull data from a given number of days before the start date. Having a retention report for some given number of days can be helpful.
b. Hard start date: Check if it causes the river to run constantly from the given start date. It updates after the next run of the River.
- Select the events for the Retention report. For first time Retention report, select an additional event.
Clicking on the event input shows a list of all available events.
-
Click Add condition to filter the results by events or user properties.
-
Select a segment for the Retention report in the On input/ a. When clicking on the input, a list with all the available properties of the event is displayed
b. When pulling a retention report with a segment, the results do not contain a date column. (However, you can add a date column in the mapping that contains the current date.)
- Select an interval count for the retention report: how many days back the report will contain. Leaving it empty results in two days only.
- Select the Retention report unit: This decides the level of granularity. Days returns the Retention report in days, and weeks/months returns the Retention report in weeks/months intervals.
Segmentation
This report provides a segmentation report with zero, one, or two segments.
- Select the Start date from which to start pulling events. Leave the end date empty to pull events until today.
a. Use days back to pull data from a given number of days before the start date. It can be useful to have a retention report for some given last days.
b. Hard start date - checking will cause the river to run constantly from the given start date. Otherwise, it will be updated after the next run of the river.
- Select the event for the Segmentation report.
Clicking on the event input shows a list of all available events.
-
Click add condition to filter the results by events or user properties.
-
Select segments for the Segmentation report in the On input (this input will be visible only after selecting an event).
When clicking on the input, a list of all the event's available properties is displayed.
-
Select the level of granularity in the Segmentation report. Each level changes the granularity of the time in the results (for example, week will result as every row is a week, and month will result as every row is a month).
-
Select the analysis type of the Segmentation report. This input determines the nature of the count in the report. General is the total count.
Funnels
- Select the Start date from which to start pulling events. Leave the end date empty to pull events until today:
a. Use Last days back to pull data from a given number of days before the start date. Having a retention report for some given number of days can be useful.
b. Hard start date - checking it causes the river to constantly run from the given start date. It will be updated after the next run of the River.
-
Select the Funnel for which you want the results. When you click on the input, a list with all the Funnels in the account is displayed.
-
Click Add overview condition to filter the results by some properties. These conditions are effective in an overview of the funnel (and not for a specific event that composes the funnel).
-
Select segments for the Funnel report in the On input.
When clicking on the input, a list of all the event's available properties is displayed.
- Select the length unit and the unit of the funnel report. The default settings are for the original funnel in Mixpanel.
- Time resolution - Determines the resolution of the results.
a. Results of the funnels report are a record for each event that composes the funnel and the time range (columns date and to_date in the results).
b. The summary results in a record for each event in the funnel for the entire time range. Any other resolution will result in a record for an event and the selected time resolution.
c. For example, for a funnel with two events A & B, the results of a funnel report for 14 days will result in 2 rows in case of summary resolution (2 rows for each event that contains the all-time range). In case of daily resolution, the results will contain 28 rows (1 row for each event and day). Note that it is important to pull the funnels report in a Summary resolution to refer to unique metrics. The results in the **Mixpanel are only in Summary resolution.
Profiles
This report provides all the users in the mixpanel account. The results contain all the users in the Mixpanel account while running the report.
Input Stringify lists: The results contain columns that are lists. Select this checkbox to have those lists in String types in the target table.
Events, event properties
General reports that provide a summary analysis of the selected events and properties. The results contain the count of the event/property for each day in the time range.
Events top, events properties top, event property values, user properties, user property values.
These reports provide look-up tables that contain the data in the Mixpanel account.
Export raw data
The Export Raw Data report mirrors the Download Raw Data report in features but employs JQL, which imposes a 5GB limit on data retrieval.