Create Scraping Job

curl --request POST \
  --url https://api.skop.dev/scrape/ \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "website": "https://example.com",
  "prompt": "Find board meeting minutes from 2025",
  "parameters": {
    "single_page": false,
    "timeout": 1800,
    "confidence_threshold": 0.1,
    "file_type": "document",
    "max_file_size_mb": 100
  }
}'

{
  "job_id": "job_4fc79a89797e",
  "status": "pending",
  "message": "Job created successfully and queued for processing",
  "estimated_completion": "2023-11-07T05:31:56Z",
  "created_at": "2023-11-07T05:31:56Z"
}

POST

scrape

Create Scraping Job

curl --request POST \
  --url https://api.skop.dev/scrape/ \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "website": "https://example.com",
  "prompt": "Find board meeting minutes from 2025",
  "parameters": {
    "single_page": false,
    "timeout": 1800,
    "confidence_threshold": 0.1,
    "file_type": "document",
    "max_file_size_mb": 100
  }
}'

{
  "job_id": "job_4fc79a89797e",
  "status": "pending",
  "message": "Job created successfully and queued for processing",
  "estimated_completion": "2023-11-07T05:31:56Z",
  "created_at": "2023-11-07T05:31:56Z"
}

Creates a new document scraping job that runs asynchronously in the background.

Request Body

{
  "website": "https://example.com",
  "prompt": "Find all board meeting minutes from 2025",
  "parameters": {
    "single_page": false,
    "timeout": 1800,
    "confidence_threshold": 0.7,
    "file_type": "document",
    "max_file_size_mb": 100
  }
}

Required Fields

Field	Type	Description
`website`	string	Starting URL to scrape (must be valid HTTP/HTTPS URL)
`prompt`	string	Description of documents to find (10-500 characters)

Parameters Object

Field	Type	Default	Description
`single_page`	boolean	`true`	Only scrape the provided URL (no navigation)
`timeout`	integer	`1800`	Max time in seconds (60-3600)
`confidence_threshold`	float	`0.1`	Min AI confidence score (0.0-1.0)
`file_type`	string	`"document"`	Type of files to extract
`max_file_size_mb`	integer	`100`	Max file size in MB (1-500)

Example Request

const response = await fetch('https://api.skop.dev/scrape/', {
  method: 'POST',
  headers: {
    'Authorization': 'Bearer sk-your-api-key',
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    website: 'https://example.com',
    prompt: 'Find meeting minutes for 2025',
    parameters: {
      single_page: true,
      timeout: 1800,
      confidence_threshold: 0.7,
      file_type: 'document',
      max_file_size_mb: 100
    }
  })
})

const job = await response.json()

Response (201 Created)

{
  "job_id": "job_4fc79a89797e",
  "status": "pending",
  "message": "Job created successfully and queued for processing",
  "estimated_completion": "2025-07-24T21:00:00Z",
  "created_at": "2025-07-24T20:50:00Z"
}

Response Fields

Field	Type	Description
`job_id`	string	Unique identifier for the created job
`status`	string	Initial job status (always `pending`)
`message`	string	Success message
`estimated_completion`	string	ISO 8601 estimated completion time
`created_at`	string	ISO 8601 job creation timestamp

Error Responses

Status	Error Code	Description
`400`	`validation_error`	Invalid request parameters
`402`	`insufficient_credits`	Not enough credits
`429`	`concurrency_limit_exceeded`	Too many concurrent jobs
`503`	`service_unavailable`	Required services not configured

Authorizations

Authorization

string

header

required

API key in format 'sk-xxxxxxxxxxxxx' or 'sk_xxxxxxxxxxxxx'

Body

application/json

website

string<uri>

required

Starting URL to scrape (must be valid HTTP/HTTPS URL)

Example:

"https://example.com"

prompt

string

required

Description of documents to find (10-500 characters)

Required string length: 10 - 500

Example:

"Find board meeting minutes from 2025"

parameters

object

Show child attributes

Response

Job created successfully

job_id

string

Unique identifier for the created job

Example:

"job_4fc79a89797e"

status

enum<string>

Initial job status (always 'pending')

Available options:

pending

message

string

Success message

Example:

"Job created successfully and queued for processing"

estimated_completion

string<date-time>

ISO 8601 estimated completion time

created_at

string<date-time>

ISO 8601 job creation timestamp

Health Check Get Job Status

⌘I

Getting Started

API Endpoints

Integrations

Create Scraping Job

Request Body

Required Fields

Parameters Object

Example Request

Response (201 Created)

Response Fields

Error Responses

Authorizations

Body

Response

Getting Started

API Endpoints

Integrations

​Request Body

​Required Fields

​Parameters Object

​Example Request

​Response (201 Created)

​Response Fields

​Error Responses

Authorizations

Body

Response

Request Body

Required Fields

Parameters Object

Example Request

Response (201 Created)

Response Fields

Error Responses