Jina AI (Page 2)

Abstract illustration of a sound wave or heartbeat, formed by blue, orange, and gray dots on a white background.

Bootstrapping Audio Embeddings from Multimodal LLMs

Turn any multimodal LLM into a small audio embedding model that beats CLAP with 25x less data.

Fingerprint illustration made from numbers, showcasing digital and high-tech design on a light background.

Identifying Embedding Models from Raw Numerical Values

A tiny transformer that fingerprints embedding models by reading raw numerical digits. No feature engineering.

Abstract digital artwork in black and white, featuring scattered dots forming letters in a halftone effect. The central lette

jina-embeddings-v5-text: New SOTA Small Multilingual Embeddings

Two sub-1B multilingual embeddings with best-in-class performance, available on Elastic Inference Service, Llama.cpp and MLX.

Artistic representation of "Vln" in vibrant, rainbow-like colors on a minimalistic white background, with a focus on color di

Jina-VLM: Small Multilingual Vision Language Model

New 2B vision language model achieves SOTA on multilingual VQA, no catastrophic forgetting on text-only tasks.

Light blue background with stylized text in the center, composed of small dots or squares, evoking a modern and minimalistic

Jina Reranker v3: 0.6B Listwise Reranker for SOTA Multilingual Retrieval

New 0.6B-parameter listwise reranker that considers the query and all candidate documents in a single context window.

Humorous office cartoon depicting a team gathered around robots; signs labeled "embeddings", "tools", "reasoning", and "lol"

Embeddings Are AI’s Red-Headed Stepchild

Embedding models aren't the most glamorous aspect of the AI industry, but image generators and chatbots couldn't exist without them.

Cartoon llama in the center of a white background, emitting laser-like beams from its eyes. The illustration creates a playfu

Multimodal Embeddings in Llama.cpp and GGUF

We brought multimodal embeddings to llama.cpp and GGUF, and uncovered a few surprising issues along the way.

Green "Code Embeddings" text displayed in a LED dot style on a black background, evoking a futuristic and technological atmos

Jina Code Embeddings: SOTA Code Retrieval at 0.5B and 1.5B

Code generation LLMs → code embeddings: 0.5B/1.5B models achieve SOTA performance across 25 code retrieval benchmarks.

Digital map of Europe formed with binary code in shades of blue, grey, and white, with red, yellow, and blue highlights in so

Agentic Workflow with Jina Remote MCP Server

Jina MCP streamlines agent development by connecting our APIs to any LLM, reducing custom code and improving reliability of the workflow.

Text "DGUF for Embedding Models" written in yellow on a dark background, conveying a sleek, minimalistic, digital design.

Optimizing GGUFs for Decoder-Only Embedding Models

4000 tokens/sec for a 3B-parameter embedding model on L4 GPU is probably as fast as you'll get with llama.cpp. Or is it?

Conference scene in a large auditorium with a "SIGIR 2025" banner on the projected screen, a speaker on stage, and attendees

What We Learned at SIGIR 2025

Sharing what we saw and learned at SIGIR 2025, feat. CLIP-AdaM, RE-AdaptIR and evaluations for LLM-based retrieval systems.

Abstract composition with a dark background featuring a flower-like design, radiant eye-like feature, rainbow-colored curved

How Image Resolution Impacts Visual Document Retrieval

Image resolution is crucial for embedding visually rich documents. Too small and models miss key details; too large and they can't connect the parts.

Black-and-white design for "Jinavor Benchmark" with bold text. Below, "Visual Docs: 95 Tasks: 20 Languages" appears; an abstr

Press

JinaVDR: New Visual Document Retrieval Benchmark with 95 Tasks in 20 Languages

JinaVDR is a new benchmark spanning 95 tasks across 20 languages for visual document retrieval, soon on MTEB.

Network illustration of interconnected hexagons, some solid and some hollow blue, connected by red lines indicating paths or

Tech Blog

Submodular Optimization for Text Selection, Passage Reranking & Context Engineering

While others rely on prompt tuning and hope for the best, you should learn submodular optimization that provides a principled framework with theoretical guarantees for better context engineering.

Black and white typographic design of "1993" with a 3D effect, minimalistic black border, and a sense of depth on a white bac

Tech Blog

Submodular Optimization for Diverse Query Generation in DeepResearch

Many know the importance of query diversity in DeepResearch, but few know how to solve it rigorously via submodular optimization.

Retro-style digital screen displaying four pixelated images: a cat, a woman, an abstract figure, and a man's portrait, with l

Tech Blog

Quantization-Aware Training of jina-embeddings-v4

Quantization gives smaller embeddings. We show you fine-tuned quantization gives you even lossless embeddings.

Word "Embeddings" followed by a numeric or symbol representation, displayed in multiple colors on a technology-themed, colorf

Press

Jina Embeddings v4: Universal Embeddings for Multimodal Multilingual Retrieval

Jina Embeddings v4 is a 3.8 billion parameter universal embedding model for multimodal and multilingual retrieval that supports both single-vector and multi-vector embedding outputs.

Technical screen showing green and yellow visual data, including charts in the lower half and a heat-map-like visualization a

Tech Blog

Correlations: Vibe-Testing Embeddings in GUI

As serious as we are about MTEB, we also love vibe-testing. Correlations is a simple GUI we use for validating citations in DeepSearch, debugging late chunking, and vibe-testing embeddings. Now it's open-source.

Three people smiling on a stage at a conference with an ICLR banner visible, suggesting a warm and lively event atmosphere.

Events

What We Learned at ICLR2025

We collect some most interesting papers in ICLR 2025, featuring TIPS, FlexPrefill, Zero-Shot Rerankers, SVD-LLM, Hymba etc.

Stacked glowing green ovals on a background transitioning from black to green, with the top oval having an unusual, split sha

Tech Blog

Fair Scoring for Multimodal Documents with jina-reranker-m0

Text similarity: 0.7. Image similarity: 0.5. Which document is more relevant? You literally cannot tell—and that's the core problem breaking multimodal search. We solve it with unified reranking.

Still life drawing of a purple bowl filled with apples and oranges on a white table. The scene features rich colors against a

Tech Blog

Model Soup’s Recipe for Embeddings

Boost robustness and performance with model soups: averaging weights. No extra cost, better results.

Black background with a simple white ruler marked in centimeters, emphasizing a minimalist design.

Tech Blog

On the Size Bias of Text Embeddings and Its Impact in Search

Size bias refers to how the length of text inputs affects similarity, regardless of semantic relevance. It explains why search systems sometimes return long, barely-relevant documents instead of shorter, more precise matches to your query.

Modern dot matrix text display on a dark blue background, conveying a digital feel.

Press

jina-reranker-m0: Multilingual Multimodal Document Reranker

Introducing jina-reranker-m0, our new multilingual multimodal reranker for retrieving visual documents, with SOTA performance on multilingual long documents and code searching tasks.

Brown background with a stylized whale graphic and the text "THINK:" and ":SEARCH>" in code-like font.

Tech Blog

Using DeepSeek R1 Reasoning Model in DeepSearch

Standard LLM or reasoning model, which is better for DeepSearch? In this post, we explored using DeepSeek-R1 in the DeepSearch implementation for choosing the next action.

Latest

Bootstrapping Audio Embeddings from Multimodal LLMs

Identifying Embedding Models from Raw Numerical Values

jina-embeddings-v5-text: New SOTA Small Multilingual Embeddings

Jina-VLM: Small Multilingual Vision Language Model

Jina Reranker v3: 0.6B Listwise Reranker for SOTA Multilingual Retrieval

Embeddings Are AI’s Red-Headed Stepchild

Multimodal Embeddings in Llama.cpp and GGUF

Jina Code Embeddings: SOTA Code Retrieval at 0.5B and 1.5B

Agentic Workflow with Jina Remote MCP Server

Optimizing GGUFs for Decoder-Only Embedding Models

What We Learned at SIGIR 2025

How Image Resolution Impacts Visual Document Retrieval

JinaVDR: New Visual Document Retrieval Benchmark with 95 Tasks in 20 Languages

Submodular Optimization for Text Selection, Passage Reranking & Context Engineering

Submodular Optimization for Diverse Query Generation in DeepResearch

Quantization-Aware Training of jina-embeddings-v4

Jina Embeddings v4: Universal Embeddings for Multimodal Multilingual Retrieval

Correlations: Vibe-Testing Embeddings in GUI

What We Learned at ICLR2025

Fair Scoring for Multimodal Documents with jina-reranker-m0

Model Soup’s Recipe for Embeddings

On the Size Bias of Text Embeddings and Its Impact in Search

jina-reranker-m0: Multilingual Multimodal Document Reranker

Using DeepSeek R1 Reasoning Model in DeepSearch