About ImageSearcher

This repository aims to implement an Image Search engine powered by the CLIP model.

m

Published by

manuelfay

Visit View Profile

README.md

View on GitHub

ImageSearcher

Leveraging CLIP to perform image search on personal pictures

This repository implements an Image Search engine on local photos powered by the CLIP model. It is surprisingly precise and is able to find images given complex queries. For more information, refer to the Medium blogpost here.

The added functionality of classifying pictures depending on the persons portrayed is implemented with the face_recognition library. Several filters are also available, enabling you to find your group pictures, screenshots, etc...

Setup

In a Python 3.8+ virtual environment, install either from PIP or from source:

Installation from the PIP package:

pip install image-searcher

pip install face_recognition  # Optional to enable face features
pip install flask flask_cors  # Optional to enable a flask api

Installation from source

pip install -r dev_requirements.txt

pip install face_recognition  # Optional to enable face features
pip install flask flask_cors  # Optional to enable a flask api

Troubleshooting: If problems are encountered building wheels for dlib during the face_recognition installation, make sure to install the python3.8-dev package (respectively python3.x-dev) and recreate the virtual environment from scratch with the aforementionned command once it is installed.

Usage

Currently, the usage is as follows. The library first computes the embeddings of all images one by one, and stores them in a picked dictionary for further reference. To compute and store information about the persons in the picture, enable the include_faces flag (note that it makes the indexing process up to 10x slower).

from image_searcher import Search

searcher = Search(image_dir_path="/home/manu/perso/ImageSearcher/data/", 
                  traverse=True, 
                  include_faces=False)

Once this process has been done once, through Python, the library is used as such:

from image_searcher import Search

searcher = Search(image_dir_path="/home/manu/perso/ImageSearcher/data/", 
                  traverse=True, 
                  include_faces=False)

# Option 1: Pythonic API
from PIL import Image

ranked_images = searcher.rank_images("A photo of a bird.", n=5)
for image in ranked_images:
    Image.open(image.image_path).convert('RGB').show()

# Option 2: Launch Flask api from code
from image_searcher.api import run
run(searcher=searcher)

Using tags in the query

Adding tags at the end of the query (example: A bird singing #photo) will filter the search based on the tag list. Supported tags for the moment are:

#{category}: Amongst "screenshot", "drawing", "photo", "schema", "selfie"
#groups: Group pictures (more than 5 people)

To come is support for:

#dates: Filtering based on the time period

Running the local web interface and API

After having indexed the images of interest, the Flask application loads models once and serves both the API and a browser interface for navigating results.

Specify a Config YAML file:

image_dir_path: /home/manu/Downloads/facebook_logs/messages/inbox/
save_path: /home/manu/
traverse: true
include_faces: true
reindex: false
n: 42

port:
host:
debug:
threaded:

Start a server:

from image_searcher.api import run

# Option 1: Through a config file
run(config_path="path_to_config_file.yml")

# Option 2: Through an instanciated Search object
from image_searcher import Search

run(searcher=Search(image_dir_path="/home/manu/perso/ImageSearcher/data/", 
                    traverse=True, 
                    include_faces=False))

For local browser use, launch the application with Gunicorn from the repository root:

./venv/bin/gunicorn "api.run_flask_gunicorn:create_app('config.yaml')" \
    --name image_searcher \
    --bind 127.0.0.1:${GUNICORN_PORT:-5000} \
    --worker-tmp-dir /dev/shm \
    --workers=${GUNICORN_WORKERS:-2} \
    --threads=${GUNICORN_THREADS:-4} \
    --worker-class=gthread \
    --log-level=info \
    --log-file '-' \
    --timeout 30

Note: Adapt the timeout parameter (in seconds) if a lot of new images are being indexed/

Use the web interface

Open http://127.0.0.1:5000/ in a browser after starting the server.

The web interface supports:

Semantic text search and supported tag shortcuts such as #photo, #selfie, and #group.
A detail view for each result, including dimensions, file size, and the best available date metadata.
Face-similarity navigation when include_faces: true.
Related photos from the same Messenger conversation when the collection follows an inbox/<conversation>/photos/ export layout.
Related photos captured or shared at nearby timestamps.
Picture-by-picture history navigation with a back button while following related images.

For timestamp metadata, Messenger exports are read from adjacent message_*.json files when available. For regular image folders, EXIF capture dates are used when present; otherwise the filesystem modification date is displayed as a fallback.

Query the API

Through the search endpoint: http://127.0.0.1:5000/get_best_images?q=a+photo+of+a+bird
For contextual metadata and related photos: http://127.0.0.1:5000/image_context?path=/absolute/path/to/an/image.jpg

In Python:


import requests
import json
import urllib.parse

query = "a photo of a bird" r = requests.get(f"http://127.0.0.1:5000/get_best_images?q={urllib.parse.quote(query)}") print(json.loads(r.content)["results"])

### Tips

Using this tool with vacation photos, or Messenger and Whatsapp photo archives leads to rediscovering 
old photos and is amazing at locating long lost ones.

## Tests

Run the tests with 

```bash
python -m unittest

The local web interface route tests can be run with:

python -m unittest tests.test_api.test_web_interface

and lint with:

pylint image_searcher

Contributing

This repo is a work in progress that has recently been started. As is, it computes about 10 images per second during the initial indexing phase, then is almost instantaneous during the querying phase.

Feature requests and contributions are welcomed. Improvements to the Search Web interface would also be greatly appreciated !

Todo list

Simplify and robustify the Search class instanciation:

Check indexation arguments are compatible with pre-loaded file
Store indexation arguments in pre-loaded file and give option to index new pictures with these options
Add the option to index for faces on previously CLIP indexed images

Speed:

Parallel indexation / dynamic batching based on image size
Data loader before indexation
Optimized vector computation with optimized engine (FAISS)

Features:

Image auto-tagging (screenshot, drawing, photo, nature, group picture, selfie, etc)
Image deduplication (perceptual hashing)

Embedding files:

Integrate with local version control (git-lfs ?)

Frontend:

Enable Image upload

Deployment:

Dockerize and orchestrate containers (image uploader, storage, indexation pipeline, inference)

ImageSearcher

About ImageSearcher

Platforms

Languages

Links

README.md

ImageSearcher

Leveraging CLIP to perform image search on personal pictures

Setup

Installation from the PIP package:

Installation from source

Usage

Using tags in the query

Running the local web interface and API

Specify a Config YAML file:

Start a server:

Use the web interface

Query the API

Contributing

Todo list