My fist blog post is on a topic which connects me to Talend in a special way - custom components. The open source nature and concept of custom components are a chance - especially for consultants/developers like me - to open spaces and make Talend a solution which does not end at the limits given by the vendor.
Developers can publish these components or extensions on the Talend Exchange platform for other users to leverage and benefit from.
Today, I want to talk about a few ways to enable Talend to work with Google services (Analytics, YouTube, Adwords) using these custom components found on Talend Exchange. Google is one of the largest service providers for web analytics. With its substantial knowledge of the Internet and a large pool of data, Google has carved out a leading market position in this area and they offer a very good web interface for accessing web data. They’ve also developed RESTful web service interfaces for clients that want to use this data in conjunction with their own in-house data and in a data warehouse.
Here are 9 ways you can use Google’s services with Talend right now:
1. Google Analytics: Core Reporting API
This is a classic among the Google APIs. Web analytics is a core competence of Google and the API is well tested and well engineered. Google Analytics Reports are described by a compilation of dimensions, metrics, filter conditions (etc.) and a start and end date. These reports are based on data collected daily.
Component:tGoogleAnalyticsInput
Special Features:
- Automatic repetition of requests for certain error types
- Can use service accounts
- Normalized output of the results
2. Google Analytics: Unsampled Reports API
If you are a Google Analytics user that deals with large data quantities (for example: web clicks) using the Core Reporting API, it can happen that Google does not set up the reports based on the whole database, but rather on a subset. This is called sampling. Usually the deviation is rather small, less than 5 per cent, but it can reach double digit rates if filters are applied.
However, there is a way to get to all that data! Google supplies an API in order to create such reports in an asynchronous manner from the full data quantity.
The necessary steps to do so are:
- Starting of the report
- Checking the processing status
- Download of the results (Google Drive)
- Import of the results
Component:tGoogleAnalyticsUnsampledReports
This component supports the steps 1, 2 and 4, with dedicated operational modes.
Special Features:
- Automatic repetition of requests for certain error types
- Can use service accounts
- Normalized output of the results
3. Google Drive API
Google offers a very safe cloud with hardly any limits in space.
This cloud is interesting in the setting of Google Analytics as the results of un-sampled reports that are preferably stored as CSV files in Google Drive.
We can execute the following file operations in Talend with the fitting/right component:
- Upload (including the setting of rights)
- Download
- Delete
- Move/Copy
- List (including directories and user rights)
Component:tGoogleDrive
Special Features:
- Automatic repetition of requests for certain error types
- Can use service accounts
- Normalized output of the results
Talend has developed special components for Google Drive. They differ mainly by the above mentioned special features. Thus for the time being there is a necessity to use these special Google Drive components.
4. Google Analytics: Real Time API
In order to collect web identification figures very soon after data processing, Google offers a Real-time API. It supplies less key data points than the Core Reporting API, but it supplies real-time results and results from the preceding minutes unlike the Core Reporting API. This is especially interesting for live monitoring in connection with marketing campaigns.
Component:tGoogleAnalyticsRealtimeInput
Special Features:
- Automatic repetition of requests for certain error types
- It is possible to use service accounts
- Normalized output of the results
- Dimension minutesAgo is additionally given as time stamp (real time)
5. Google Analytics: Management API
All website management data is available via this API:
- Accounts
- Web properties
- Views
- Segments
- Goals
- User rights
- Descriptions for all dimensions and metrics
This data helps to keep an overview of your website performance. A report on all users and their user rights for the accounts in the own data warehouse - their history is available as well, which might be interesting for larger companies.
Component:tGoogleAnalyticsManagement
Special Features:
- Automatic repetition of requests for certain error types
- It is possible to use service accounts
6. Google AdWords: AdWords Report API
These days, Google AdWords is THE standard for digital advertising on the web. Google offers various reports regarding advertising performance and data for campaigns. It is absolutely necessary to have a regular look at these evaluations in order to take trends into account and in order to place the advertisements effectively.
Component:tGoogleAdWordsReports
Special Features:
- Automatic repetition of requests for certain error types
- Possible to use service accounts
- AdWords Query Language can also be applied as regular report controlled by attributes
- Download of the results as flow or allocation as input flow
7. Google Analytics: Multi-Channel-Funnel Analysis Reports
Google offers functionality that can be used to survey the conversion of potential clients via so called channel funnels. The Talend component collects data from the Google Analytics Multi Channel Funnel API.
Component:tGoogleAnalyticsMCFInput
8. Google Analytics: Upload
Google offers a function for return-on-investment (ROI) calculations which can integrate external data sources into the reports. These data sources are fed out of CSV files.
The files then can be comfortably and automatically uploaded into the data sources with Talend. The data can be processed by Talend jobs and the upload is done by Talend jobs as well.
Component:tGoogleAnalyticsUpload
9. Google YouTube Analytics
There are also analysis and reports for YouTube channels reporting. Typically, the advertising departments request a longer storage of these data in the data warehouse and expect detailed reports about trends.
Component:tYoutubeAnalyticsInput
In this post we have examined how Talend can connect to several services including Google Analytics, Adwords, Drive and YouTube. Visit Talend Exchange to download and try these extensions or to explore other useful components, connectors, jobs, templates, patterns, data models and more contributed by Talend and the broader community.