Apache Nutch REST API
REST API for managing Apache Nutch crawl jobs, configurations, seed URL lists, database queries (CrawlDB and FetchDB), and data readers. Supports full crawl lifecycle management including inject, generate, fetch, parse, updatedb, and index operations. Secured via HTTP Basic Authentication.
Documentation
Specifications
Schemas & Data
JSONSchema
Nutch Config Schema
JSONSchema
Job Config Schema
JSONSchema
Job Info Schema
JSONSchema
Server Info Schema
JSONSchema
Seed List Schema
JSONSchema
DB Query Schema
Other Resources
NaftikoCapability
https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/capabilities/apache-nutch-admin.yaml
NaftikoCapability
https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/capabilities/apache-nutch-configuration.yaml
NaftikoCapability
https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/capabilities/apache-nutch-database.yaml
NaftikoCapability
https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/capabilities/apache-nutch-job.yaml
NaftikoCapability
https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/capabilities/apache-nutch-reader.yaml
NaftikoCapability
https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/capabilities/apache-nutch-seed.yaml
NaftikoCapability
https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/capabilities/apache-nutch-services.yaml