2 points | by renegat0x0 2 days ago
1 comments
I would like to share an open database focused on link-level metadata extraction and aggregation, which may be of interest to researchers.
The project maintains a structured dataset of links enriched with metadata such as:
- page title
- description / summary
- publication date (when available)
- thumbnail / preview image
- etc.
The goal is to provide a reusable, inspectable set of link metadata that can be used for experiments in areas such as:
- RSS and feed analysis
- news analysis
- link rot analysis?
The database is publicly available here:
https://github.com/rumca-js/RSS-Link-Database-2025
There are also databases for previous years
I would like to share an open database focused on link-level metadata extraction and aggregation, which may be of interest to researchers.
The project maintains a structured dataset of links enriched with metadata such as:
- page title
- description / summary
- publication date (when available)
- thumbnail / preview image
- etc.
The goal is to provide a reusable, inspectable set of link metadata that can be used for experiments in areas such as:
- RSS and feed analysis
- news analysis
- link rot analysis?
The database is publicly available here:
https://github.com/rumca-js/RSS-Link-Database-2025
There are also databases for previous years