Skip to main content
Version: 2.0.0

LLMWhisperer MCP server

LLMWhisperer has support for MCP server which can be used to integrate the functionality of LLMWhisperer with a MCP host.

Features

Processes documents (PDF, DOCX, XLSX, CSV, images) using the LLMWhisperer API to extract layout-preserved text, supporting multiple processing modes and output formats.

Tools

extract_text - Submits a file to the LLMWhisperer API, polls for processing, and retrieves the extracted text. Supports advanced options for page selection, line splitting, and output formatting.

Configuration

Getting an API Key

To use the tool, one needs to set the LLMWHISPERER_API_KEY environment variable. For obtaining the key, please check the details under Getting Started.

Usage with Claude Desktop

One can integrate the LLMWhisperer tool with Claude Desktop by adding the following configuration to the claude_desktop_config.json replacing the values of LLMWHISPERER_API_KEY with the right values. Also, since the tool will need a file to be processed, the file needs to be mounted in the container. The sample below uses /tmp as the mount point. This can be changed to any other location.

Docker

{
"mcpServers": {
"extract_text": {
"command": "docker",
"args": [
"run",
"-i",
"--rm",
"-v",
"/tmp:/tmp",
"-e",
"LLMWHISPERER_API_KEY",
"unstract/mcp-server",
"llm_whisperer"
],
"env": {
"LLMWHISPERER_API_KEY": ""
}
}
}
}

Once the configuration is added, user can use extract_text as a tool to extract text from the supported documents.

Sample prompt

Extract text from the document /tmp/sample-bank_statement.pdf

This should invoke the tool and get the extracted text as the response.