Google Search Reader
Note
This adapter is in preview and is available on Striim Developer only. See Striim Developer for more information.
The Google Search Reader helps you integrate and analyze Google Search data in a structured way for more efficient reporting and decision-making. This reader seamlessly integrates Google search data with other data warehouses that aid in making data-driven decisions based on the performance of various search queries, keywords, and pages. This data can help you refine content strategies and marketing campaigns.
Feature summary
Feature | Supported? | Notes | |
---|---|---|---|
Objects | Standard objects | ✓ | |
Custom objects | ✓ | ||
Authentication | Basic authentication | Username and password | |
OAuth authentication | Manual configuration based | ||
Custom authentication methods | ✓ | API key-based authentication | |
Operations | Automated mode | ✓ | |
Initial load | ✓ | ||
Pull-based incremental load | ✓ | ||
Push-based incremental load | |||
Automated pipeline | |||
Governance | Connection profile | ||
Sherlock AI | |||
Sentinel AI | |||
Schema handling | Initial schema creation | ✓ | Works with supported targets |
Schema evolution | |||
Setup | Wizard template | ||
Flow Designer | ✓ | ||
Striim TQL | ✓ | ||
Runtime | Resilience/recovery | ✓ | Supported only for Incremental load. Recovers the data based on the provided Incremental Load Marker. |
Parallel execution | |||
Metrics | ✓ | Standard metrics |
Supported authentication method
The Google Search Reader supports connecting to Google Search using API key-based authentication. To set up and fetch authorization credentials:
Configuring a custom search engine site
Sign in to the Google Custom Search Engine site.
Click New Search Engine and add a site to focus the search on.
Click Edit Search Engine, select your search engine in the menu, and click Setup.
In the Details section, click Search Engine Id to retrieve the search engine ID used as the
CustomSearchId
in the connection string.In the Sites to Search section, you can select the Search the Entire Web but Emphasize Included Sites option in the menu.
Getting the API key
To authenticate requests to the Custom Search API,
Log in to the Google API console and select a project.
In the sidebar under APIs & Services, choose Library and then select the Custom Search API. Click Enable.
In the sidebar, choose APIs & Services > Credentials.
Choose Create Credentials > API Key.
Select the API Key to restrict the key to your platform.
Supported objects
The Google Search Reader can read from the following supported objects in Google:
ImageSearch
WebSearch
Google Search Reader properties
Property | Type | Default value | Notes |
---|---|---|---|
API key | Password | Google API key generated for authentication. | |
Connection pool size | Integer | 20 | Specifies the maximum number of active connections. |
Custom search ID | String | Unique ID associated with the search engine created. The Search Engine ID (CX) is visible in the settings of your Custom Search Engine. | |
Exclude tables | String | A list of tables excluded from read operations. Typically used to create a list of exceptions when the Tables property includes wildcards. Misconfiguration of the Tables and Exclude Tables properties can cause "Invalid table names" errors. | |
Incremental load marker | String | The incremental load marker is a unique incremental column in each object used for incremental load. When no marker is specified, tables are resynced at each polling interval. Specify the name of the column that contains the start position value. This column must meet the following criteria:
| |
Migrate schema | Boolean | False | Only available in Initial Load or Automated mode. Set to |
Mode | Select list:
| Automated | Automated mode applies incremental updates to objects that support incremental load and performs full resyncs for objects that do not support incremental load. |
Polling interval | Integer | 5m | Specifies an interval as an integer followed by a unit. Supported units are days ( |
Refresh token | Password | An OAuth 2.0 refresh token.Use the value generated while creating the token. | |
Search term | String | Search expression for the data you want to extract. Format: <TableName1>=<searchTerm1>;<TableName2>=<searchTerm2> For example: WebSearch=striim This expression extracts data that contains the search phrase "striim." | |
Start Position | String | %=-1 | Value of the incremental load marker that defines the initial reading position. |
Tables | String | A semicolon-delimited (;) list of objects to read from the source. Supports the | |
Thread pool count | Integer | 10 | The number of parallel running threads. The default value of zero specifies single-threaded operation. When the value of the thread pool counter is higher than the connection pool size, large data ingestion operations can cause the app to halt. Since best performance is achieved when using one thread for each table being synced, increasing the size of the connection pool to match the number of threads in use is a performance best practice. |