GetObj - Intelligent Web Crawler API

Content Extraction

Extract page content in HTML, Markdown, or plain text. Perfect for documentation, archiving, or content migration.

File Downloads

Download files from any webpage and receive them as base64-encoded data. Supports PDFs, ZIPs, documents, and more.

Link Extraction

Extract all hyperlinks from a page with their text and titles. Great for SEO analysis and site mapping.

Text Analysis

Count words, analyze headings, and extract structured text. Get detailed metrics about page content.

Screenshots

Capture full-page or viewport screenshots of any website. Returns images as base64-encoded data.

CSS Selectors

Find specific elements using CSS selectors. Extract precise data from any webpage structure.

YouTube Transcripts

NEW

Extract transcripts from YouTube videos in any language. Supports Korean, Japanese, Chinese, and 50+ languages. Get timestamped segments or full text.

Image Extraction

Extract all images from any webpage with their URLs and alt text. Perfect for media archiving and analysis.

Quick Start

# Extract page content curl -X POST https://getobj.com/api/crawl \ -H "Content-Type: application/json" \ -d '{ "url": "https://example.com", "format": "markdown" }' # Download files curl -X POST https://getobj.com/api/crawl \ -H "Content-Type: application/json" \ -d '{ "url": "https://example.com", "instruction": "download files" }'

# Get YouTube transcript (English) curl "https://getobj.com/api/youtube/transcript?url=dQw4w9WgXcQ" # Get Korean transcript curl "https://getobj.com/api/youtube/transcript?url=jkL7DjPchRo&lang=ko" # POST request with JSON curl -X POST https://getobj.com/api/youtube/transcript \ -H "Content-Type: application/json" \ -d '{ "url": "https://youtube.com/watch?v=VIDEO_ID", "lang": "ko" }' # Response includes segments with timestamps # { "segments": [{ "start": 0, "end": 5, "text": "..." }], "fullText": "..." }

import requests response = requests.post('https://getobj.com/api/crawl', json={ 'url': 'https://example.com', 'instruction': 'count word "example"' } ) data = response.json() print(data['result']['count'])

const axios = require('axios'); const response = await axios.post('https://getobj.com/api/crawl', { url: 'https://example.com', instruction: 'take screenshot' }); const screenshot = response.data.result.screenshot; // Base64-encoded image data

View Full Documentation

Features

Quick Start