Documentation
Getting started
Models

Models

Upstage is at the forefront of developing a suite of AI models tailored for diverse business needs, such as Solar LLM and Document AI with Upstage's mission to achieve AGI (Artificial General Intelligence) for work.

Solar LLM

Upstage Solar is a compact yet powerful large-language model (LLM).

ModelRelease dateContext lengthDescription
solar-pro
preview
2024-09-104096A more intelligent, instruction-following Solar LLM with IFEval 80+. The official version with expanded language support and longer context length will be released in November 2024. solar-pro supports English only at this time.
solar-pro is an alias for our latest Solar Pro model. (Currently solar-pro-preview-240910)
solar-docvision
preview
2024-09-108192A model specialized for Document Visual Question Answering (opens in a new tab). solar-docvision supports English only at this time.
solar-docvision is an alias for our latest Solar DocVision model. (Currently solar-docvision-preview-240910)
solar-1-mini-chat
beta
2024-06-1432768A compact LLM offering superior performance to GPT-3.5, with robust multilingual capabilities for both English and Korean, delivering high efficiency in a smaller package.
solar-1-mini-chat is an alias for our latest Solar Mini model. (Currently solar-1-mini-chat-240612)
solar-1-mini-chat-ja
beta
2024-06-1432768A compact LLM that extends the capabilities of Solar Mini with specialization in Japanese, while maintaining high efficiency and performance in English and Korean.
solar-1-mini-chat-ja is an alias for our latest Solar Mini ja model. (Currently solar-1-mini-chat-ja-240612)
solar-1-mini-translate-enko
beta
2024-05-2232768English-to-Korean translation specialized model based on the Solar Mini.
solar-1-mini-translate-enko is alias for our latest Solar Mini Translation enko model. (Currently solar-1-mini-translate-enko-240507)
solar-1-mini-translate-koen
beta
2024-05-2232768Korean-to-English translation specialized model based on the Solar Mini.
solar-1-mini-translate-koen is alias for our latest Solar Mini Translation koen model. (Currently solar-1-mini-translate-koen-240507)
solar-1-mini-groundedness-check
beta
2024-05-0232768Solar Mini based groundedness check model with a 32k context limit.
solar-1-mini-groundedness-check is alias for our latest Solar Mini Groundedness check model. (Currently solar-1-mini-groundedness-check-240502)

For details about the model architecture, see this paper (opens in a new tab).

Solar Embeddings

Embed any text with Solar Embeddings API.

ModelRelease dateContext LengthDescription
solar-embedding-1-large-query
beta
2024-05-104000Solar-based Query Embedding model with a 4k context limit. This model is optimized for embedding user's question in information-seeking tasks such as retrieval & reranking.
solar-embedding-1-large-passage
beta
2024-05-104000Solar-based Passage Embedding model with a 4k context limit. This model is optimized for embedding documents or texts to be searched.

Document Parse

Extract tables and figures from any document.

ModelAvailabilityRelease dateDescription
document-parseLatest2024-09-10Major update in API spec and changed the model name to document-parse. Support for Microsoft Word, Excel, and Powerpoint. Markdown output for tables and list items. Base64 encoding of extracted images for all requested layout categories.
document-parse is an alias for our latest Document Parse model. (Currently document-parse-240910)
layout-analysis-0.4.0 betaAvailable until Nov 10, 20242024-07-04Improved the accuracy for table recognition. Added new layout elements: heading1, list, index, and footnote. Changed the default value for ocr field to false
layout-analysis-0.3.1 betaDeprecated2024-06-17Fixed a bug where extracted text from table elements was truncated.
layout-analysis-0.3.0 betaDeprecated2024-06-11Improved the inference speed by 2x for digital-born PDF documents.
layout-analysis-0.2.1 betaDeprecated2024-05-02Removed unnecessary <thead> tags from table elements and fixed bugs.
layout-analyzer-0.2.0 betaDeprecated2024-04-04Improved the accuracy for table recognition and performance for layout detection.
layout-analyzer-0.1.0 betaDeprecated2024-02-28A layout analyzer model which detects elements within a document, recognizes tables, and serializes elements according to reading order.

Document OCR

Extract all text from any document.

ModelAvailabilityRelease dateDescription
ocr-2.2.1Latest2024-06-11Additional support for Japanese character set.
ocr-2.1.1Deprecated2024-04-04Improved text detection for single characters and special characters.
ocr-2.1.0Deprecated2024-02-28Additional support for Hanja, Hanzi and Kanji. Improved accuracy and performance.
ocr-1.0.0Deprecated2023-04-10An OCR model specialized for English and Korean. Resilient against real-world images, including wrinkled papers and rotated text.

Key Information Extraction

Extract key information from target documents.

ModelAvailabilityRelease dateDescription
air-waybill-extraction-4.1.6 betaLatest2024-06-11An extractor model for air waybill (AWB)
bill-of-lading-and-shipping-request-extraction-4.1.6 betaLatest2024-06-11An consolidated extractor model for Bill of lading (BL or BoL) and Shipping request (SR).
commercial-invoice-and-packing-list-extraction-4.1.6 betaLatest2024-06-11An consolidated extractor model for Commercial invoice (CI) and Packing list (PL).
kr-export-declaration-certificate-extraction-4.1.6 betaLatest2024-06-11An extractor model for Korea export declaration certificate.
receipt-extraction-3.2.0Latest2024-04-11Additional support for English. Improved accuracy and performance.
receipt-extractor-1.0.0Deprecated2023-04-11An extractor model for paper receipts, that include store descriptions and list of items. Works best for Korean receipts.