About Data Sources
What data sources are, why they matter, and how they power search, latest news, deep search, and analysis across the News module.
About Data Sources
Data sources are the connections that bring content into the News module. Every search result, every "latest news" feed, and every document available for analysis comes from a data source. Without data sources, there is nothing to search, read, or analyze.
Navigate toNews → Data Sourcesto view and manage the custom data sources available to your organization.
What Is a Data Source?
AData Sourceis a configured connection to an external content provider. It can point to a website, a social media channel, an RSS feed, a news portal, a technology platform, or a file store — anything that produces content you want to monitor or analyze.
Each data source has a set ofcapabilitiesthat define what it can do. Depending on the capabilities, a data source can:
- Return thelatestarticles or posts
- Besearchedby keyword or semantically
- Providecommentsattached to documents
SeeData Source Capabilitiesfor a full description of all capabilities.
Built-in vs. Custom Data Sources
The News module combines two layers of data sources:
- Built-in data sources— Over 100 pre-configured sources maintained by the platform covering news portals, social media, and specialized content feeds. These are always available and require no configuration.
- Custom data sources— Sources you configure yourself. These let you target a specific website, social channel, RSS feed, or upload your own files, giving you focused coverage tailored to your organization's needs.
Custom data sources are managed on theNews → Data Sourcespage.
Data Source Types
Custom data sources fall into several categories:
- Web— Google Site search, RSS and Atom feeds, WordPress sites, Google Custom Search, and the Telex news portal
- Social— Facebook Pages and Profiles, YouTube Channels, Reddit Subreddits and Users, TikTok Users, Instagram Users, and Polymarket Events
- AI-enhanced— An AI-powered web page reader that uses a language model to extract structured content from any page (Beta), and a Combined source that merges several existing sources into one (Beta). SeeMerged Data Sources.
- Technology— GitHub Repositories for tracking issues, pull requests, and discussions
- Corporate— Microsoft Outlook, OneDrive, Entra, Gmail, and Google Drive (coming soon)
SeeData Source Typesfor details on each type.
Enhanced Data Stores
In addition to online connections, the platform providesEnhanced Data Stores— private document stores where you can upload your own files or pull content from existing data sources and enrich it with vector search, AI transcription, OCR, and other processing capabilities.
Enhanced Data Stores are managed separately underNews → Data Sources → Enhanced Data Stores.
SeeEnhanced Data Storesfor a full explanation.
Where Data Sources Are Used
Data sources appear throughout the News module:
| Where | What the data source provides |
|---|---|
| News → Search | The sources queried when you run a keyword or semantic search. SeeAbout Search. |
| News → Deep Search | The sources used during automated multi-step research |
| Dashboards | The sources powering "Latest News" and search-based dashboard tiles |
| Entities | Sources linked to an entity to collect media coverage automatically |
When you add a custom data source, it becomes immediately available for all these purposes across your organization.