DataShield MCP Dataset Library

ArcGIS portal discovery

How ArcGIS Hub/Open Data portal discovery works (sitemap + orgId fallback).

What discovery does

When you add an ArcGIS portal as a PORTAL source, discovery attempts to expand it into individual dataset URLs.

It uses two strategies:

  1. Sitemap discovery

    • tries common sitemap paths (/sitemap.xml, /sitemap_index.xml)
    • extracts URLs that contain /datasets/
  2. OrgId fallback

    • fetches the portal HTML and tries to extract orgId
    • enumerates datasets via https://hub.arcgis.com/api/v3/datasets?filter[orgId]=...

When to provide a sitemap URL

Some portals block generic crawling or hide sitemap routes.

If discovery returns no datasets, add a source of type SITEMAP pointing directly at a sitemap URL.