Textract

aws/ml aws/ai aws/service

💡 Definition

Amazon Textract is a machine learning service that automatically extracts text, handwriting, and data from scanned documents. It goes beyond simple optical character recognition (OCR) to identify, understand, and extract data from forms and tables.

🔑 Key Concepts

⚙️ How it Works

You upload a document (image or PDF). Textract analyzes it and returns the raw text plus the structural information (this text belongs to this field in a form).

🎯 Use Cases

💰 Pricing Model

📝 Exam Tips (CLF-C02)


See Also: * Rekognition * Comprehend