Skip to content

Overview

Embedding Studio

version Python 3.9 CUDA 11.7.1 Docker Compose Version

WebsiteDocumentationChallenges & SolutionsUse Cases

Embedding Studio is an innovative open-source framework designed to seamlessly convert a combined embedding model and vector database into a comprehensive search engine. With built-in functionalities for clickstream collection, continuous improvement of search experiences, and automatic adaptation of the embedding model, it offers an out-of-the-box solution for a full-cycle search engine.

Our framework enables you to continuously fine-tune your model based on user experience, allowing you to form search results for user queries faster and more accurately.

RED: On the graph, typical search solutions without enhancements, such as Full Text Searching (FTS), Nearest Neighbor Search (NNS), and others, are marked in red. Without the use of additional tools, the search quality remains unchanged over time.

ORANGE: In yellow, solutions are depicted that accumulate some feedback (clicks, reviews, votes, discussions, etc.) and then initiate a full model retraining. The primary issue with these solutions is that full model retraining is a time-consuming and expensive procedure, thus lacking reactive adjustments (for example, when a product suddenly experiences increased demand, and the search system has not yet adapted to it).

INDIGO: We propose a solution that allows collecting user feedback and rapidly retraining the model on the difference between the old and new versions. This enables a smoother and more relevant search quality curve for your system.

Features

  • Turn your vector database into a full-cycle search engine
  • Collect users feedback like clickstream
  • (*) Improve search experience on-the-fly without frustrating wait times
  • (*) Monitor your search quality
  • Improve your embedding model through an iterative metric fine-tuning procedure
  • (*) Use the new version of the embedding model for inference

(*) - features in development

Embedding Studio is highly customizable, so you can bring your own:

  • Data source
  • Vector database
  • Clickstream database
  • Embedding model

How it works

Follow our tutorial to get acquainted with the main functions.