ArcGIS portal discovery
How ArcGIS Hub/Open Data portal discovery works (sitemap + orgId fallback).
What discovery does
When you add an ArcGIS portal as a PORTAL source, discovery attempts to expand it into individual dataset URLs.
It uses two strategies:
-
Sitemap discovery
- tries common sitemap paths (
/sitemap.xml,/sitemap_index.xml) - extracts URLs that contain
/datasets/
- tries common sitemap paths (
-
OrgId fallback
- fetches the portal HTML and tries to extract
orgId - enumerates datasets via
https://hub.arcgis.com/api/v3/datasets?filter[orgId]=...
- fetches the portal HTML and tries to extract
When to provide a sitemap URL
Some portals block generic crawling or hide sitemap routes.
If discovery returns no datasets, add a source of type SITEMAP pointing directly at a sitemap URL.