---
title: Data Source Types
description: A guide to all available custom data source types, their required configuration, and the capabilities each type provides.
---

# Data Source Types

When you add a custom data source from the [News → Data Sources]({APP_HOST}/documents/datasources/definitions) page, you choose one of the types below. Each type connects to a different kind of external service and brings its own set of capabilities.

For an explanation of what each capability means, see [Data Source Capabilities](data-source-capabilities).

---

## Web Sources

### Google Site

Searches a specific website using Google's search index. You provide the domain name (e.g. `bbc.com`) and the platform queries Google for pages on that domain.

| | |
|---|---|
| **Required** | Domain name |
| **Capabilities** | Web Search |
| **Best for** | Finding articles on a known news portal or corporate site without an RSS feed |

### RSS Feed

Connects to any standard RSS feed. Fetches the latest items published to the feed URL.

| | |
|---|---|
| **Required** | Feed URL |
| **Capabilities** | Latest |
| **Best for** | Monitoring blogs, news portals, and any service that publishes an RSS feed |

### Atom Feed

Same as RSS but for Atom-format feeds. Many platforms (including GitHub and government portals) publish Atom instead of RSS.

| | |
|---|---|
| **Required** | Feed URL |
| **Capabilities** | Latest |
| **Best for** | Sources that use Atom rather than RSS |

### WordPress

Connects to a WordPress site through its built-in JSON API (WP REST API v2). Returns recent posts from the site.

| | |
|---|---|
| **Required** | Site URL |
| **Capabilities** | Latest |
| **Best for** | Monitoring WordPress-powered news portals and blogs |

### Google Custom Search

Uses a Google Programmable Search Engine to query a curated set of websites. Requires a Search Engine ID from the Google Programmable Search Control Panel.

| | |
|---|---|
| **Required** | Search Engine ID |
| **Capabilities** | API Search |
| **Best for** | Targeted searching across a defined set of websites you control |

### Telex

Connects to the Telex Hungarian news portal. You can filter by section (rovatok) and/or tag (cimkék) to narrow coverage to a specific topic or editorial area.

| | |
|---|---|
| **Required** | At least one section or tag |
| **Capabilities** | Latest, API Search |
| **Best for** | Monitoring Hungarian news filtered to a specific topic or editorial section |

---

## AI-Enhanced Sources

### AI Web Page Reader *(Beta)*

Uses a large language model (LLM) to read and extract content from a specific web page URL. Unlike a simple RSS feed, the model understands the page structure and can extract meaningful content even from pages without a feed. You can provide additional instructions to guide what the model extracts.

| | |
|---|---|
| **Required** | Page URL; choice of language model |
| **Optional** | Additional instructions for the model |
| **Capabilities** | Latest |
| **Best for** | Monitoring a specific page that does not publish an RSS feed, or where structured extraction is needed |

### Combined *(Beta)*

Merges two or more existing data sources into a single logical source. When you search or browse the combined source, the platform queries all the member sources simultaneously.

| | |
|---|---|
| **Required** | Two or more existing data sources from your organization |
| **Capabilities** | Inherits the capabilities of all member sources |
| **Best for** | Creating a unified source from several related feeds or sites so you can search them together |

See [Merged Data Sources](merged-data-sources) for a full guide on creating and using combined sources.

---

## Social Sources

### Facebook Page

Connects to a public Facebook Page. Returns the page's posts and, for each post, the reader comments.

| | |
|---|---|
| **Required** | Page slug (the part of the URL after `facebook.com/`) |
| **Capabilities** | Latest, Comments |
| **Best for** | Monitoring the posts and audience reactions on a brand, media, or public figure page |

### Facebook Profile

Connects to a public Facebook user profile. Returns the profile's public posts and comments.

| | |
|---|---|
| **Required** | Profile slug |
| **Capabilities** | Latest, Comments |
| **Best for** | Tracking public activity and audience reactions on an individual's profile |

### YouTube Channel

Connects to a YouTube channel. Returns the channel's videos and, for each video, the viewer comments.

| | |
|---|---|
| **Required** | Channel ID |
| **Capabilities** | Latest, Comments |
| **Best for** | Monitoring video output and audience sentiment for a creator, news channel, or brand |

### Reddit Subreddit

Connects to a Reddit community (subreddit). Returns the most recent posts and their comments.

| | |
|---|---|
| **Required** | Subreddit name (without `r/`) |
| **Capabilities** | Latest, Comments |
| **Best for** | Monitoring community discussions on a specific topic |

### Reddit User

Connects to a Reddit user's profile. Returns the user's recent posts and comments.

| | |
|---|---|
| **Required** | Reddit username |
| **Capabilities** | Latest, Comments |
| **Best for** | Tracking the activity of a specific Reddit user |

### TikTok User

Connects to a TikTok user's profile. Returns the user's recent videos.

| | |
|---|---|
| **Required** | TikTok username |
| **Capabilities** | Latest |
| **Best for** | Monitoring a creator's content output on TikTok |

### Instagram User

Connects to a public Instagram profile. Returns the user's recent posts.

| | |
|---|---|
| **Required** | Instagram username (slug) |
| **Capabilities** | Latest |
| **Best for** | Monitoring an influencer's, brand's, or public figure's Instagram content |

### Polymarket Event

Connects to a Polymarket prediction market event and returns the comments posted on that event.

| | |
|---|---|
| **Required** | Event slug (the last segment of the Polymarket event URL) |
| **Capabilities** | Latest, Comments |
| **Best for** | Monitoring public opinion and discussion around a specific prediction market event |

---

## Technology Sources

### GitHub Repository

Connects to a GitHub repository. Returns issues, pull requests, and discussions.

| | |
|---|---|
| **Required** | Repository owner and repository name |
| **Capabilities** | Latest |
| **Best for** | Tracking open-source project activity, release discussions, and community feedback |

---

## Corporate Sources *(Coming Soon)*

The following corporate integration types are planned and will be available in a future release:

| Type | Description |
|---|---|
| **Microsoft Outlook** | Email inbox monitoring |
| **Microsoft OneDrive** | File and document monitoring |
| **Microsoft Entra** | Identity and directory integration |
| **Gmail** | Google email inbox monitoring |
| **Google Drive** | Google file and document monitoring |
