V7 Go is powered by a selection of LLMs and foundation models. Let’s breakdown each of the available tools.

📘

OCR and audio extraction

Whenever text or audio are detected in an input file, V7 Go will automatically use an OCR (Optical Character Recognition) or AST (Automatic Speech Recognition) model to extract text, regardless of which AI Tool is selected. This means that audio and text extraction are implicit and optimized for minimal token usage.

Multimodal Models

GPT 4o

Developed by: OpenAI (check out OpenAI’s GPT 4 docs here)

Inputs: Text, Images, PDF documents, Audio

Outputs: Text, JSON, single-select, multi-select

Using GPT 4 in V7 Go: GPT 4 Omni is the most advanced of OpenAI’s language models available in V7 Go. It was trained across text, video, and audio, and it offers slight improvements over GPT 4 turbo in text tasks with increased speed and efficiency. Use GPT 4 Omni to perform complex text or multi-modal tasks where accuracy is paramount.

GPT 4o mini

Developed by: OpenAI (check out OpenAI’s GPT 4 docs here)

Inputs: Text, Images, PDF documents, Audio

Outputs: Text, JSON, single-select, multi-select

Using GPT 4 in V7 Go: GPT 4o mini is a lightweight version of GPT 4o. It has higher intelligence than GPT 3.5 Turbo while being 60% cheaper. Use GPT 4o mini for simple tasks that GPT 3.5 Turbo would have been used for previously.

GPT 4 Turbo

Developed by: OpenAI (check out OpenAI’s GPT 4 docs here)

Inputs: Text, Images, PDF documents, Audio

Outputs: Text, JSON, single-select, multi-select

Using GPT 4 in V7 Go: GPT 4 is OpenAI's second most powerful model behind GPT 4o. It has a 128k context window, and improved language understanding and more accurate responses over GPT 3.5 Turbo, but is less optimized for speed and efficiency. Use GPT 4 to perform complex text analysis as well as visual tasks where accuracy is paramount.

Gemini 1.5 Pro

Developed by: Google (check out Google's Gemini landing page here)

Inputs: Text, Images, PDF documents, Audio

Outputs: Text, JSON, single-select, multi-select

Using Gemini 1.5 Pro in V7 Go: Gemini 1.5 Pro is Google DeepMind’s Most advanced LLM. It is capable of advanced language understanding and multi-modal capability. Use Gemini pro to perform complex text analysis as well as visual tasks where accuracy is paramount.

Gemini 1.5 Flash

Developed by: Google (check out Google's Gemini landing page here)

Inputs: Text, Images, PDF documents, Audio

Outputs: Text, JSON, single-select, multi-select

Using Gemini 1.5 Pro in V7 Go: Gemini 1.5 Flash is the fastest and cheapest version of Gemini 1.5. Use Gemini 1.5 Flash perform complex text analysis as well as visual tasks where efficiency is paramount.

Claude 3 Opus

Developed by: Anthropic (Check out Anthropic's Claude langing page here)

Inputs: Text, Images, PDF documents

Outputs: Text, JSON, single-select, multi-select

Using Claude 3 Opus in V7 Go: Opus is Anthropic's highest performing model, and competes with or outperforms GPT 4 and Gemini Ultra on most text-based tasks. Opus can be used with text inputs as well as image data. Use Claude 3 Opus to perform complex text analysis as well as visual tasks where accuracy is paramount.

Claude 3 Sonnet

Developed by: Anthropic (Check out Anthropic's Claude langing page here)

Inputs: Text, Images, PDF documents

Outputs: Text, JSON, single-select, multi-select

Using Claude 3 Sonnet in V7 Go: Sonnet is the mid tier of Anthropic's models available on V7 Go and offers a balance between price and speed, compared to Claude 3 Opus and Haiku. It can be used with text inputs as well as image data. Use Claude 3 Sonnet to perform text analysis and visual tasks to optimise for both performance and Go Token cost.

Claude 3 Haiku

Developed by: Anthropic (Check out Anthropic's Claude langing page here)

Inputs: Text, Images, PDF documents

Outputs: Text, JSON, single-select, multi-select

Using Claude 3 Haiku in V7 Go: Haiku is the fastest and cheapest model of the Claude 3 family. Like Opus and Sonnet, Haiku is multimodal and can reason across text as well as images. Use Claude 3 Haiku where speed and Token cost are a priority.

Claude 3.5 Sonnet

Developed by: Anthropic (Check out Anthropic's Claude langing page here)

Inputs: Text, Images, PDF documents

Outputs: Text, JSON, single-select, multi-select

Using Claude 3 Haiku in V7 Go: Claude 3.5 Sonnet is the latest and most powerful model released by Anthropic with the speed and cost of their mid-tier model.

Text-only Models

Plug any of the models below into V7 Go properties to process, understand, and generate human-like text. Check out our prompting guide here to get the most out of each model.

GPT 3.5 Turbo

Developed by: OpenAI (check out OpenAI’s GPT 3.5 Turbo docs here)

Inputs: Text

Outputs: Text, JSON, single-select, multi-select

Using GPT 3.5 Turbo in V7 Go: Another of OpenAI’s language models, GPT 3.5 Turbo is a variant on GPT 3.5 optimized for speed and efficiency. While GPT 3.5 is still available on V7 Go, and can be used for lightweight tasks, it has since been made largely redundant by GPT 4o mini, which offers improved intelligence with greater efficiency.

Are we missing a tool? Let us know!