LLM Anotator

Using LLM to image annotation

Overview Image annotation has long been a challenging task, especially for domain-specific datasets that require accurate class assignments. However, with advancements in models like Grounding DINO and SAM (Segment Anything Model), the process has become more efficient and accessible. Despite these advancements, assigning the correct classes for specific datasets remains a significant challenge.

This repository aims to bridge the gap by leveraging LLMs to streamline and enhance the annotation process for images.

HOW TO INSTALL

1 - create conda env

conda create --name myenv python=3.10

2- git clone env

git clone https://github.com/mojaravscki/llmanotator
cd llmanotator

3- Install LLM Annotator requirements

pip install -r requirements.txt

4- Install GroundingDINO

cd GroundingDINO
pip install -q -e .
cd ..

5- download groundingdino_swint_ogc.pth

mkdir weights
cd weights
curl -L -o groundingdino_swint_ogc.pth https://huggingface.co/ShilongLiu/GroundingDINO/resolve/main/groundingdino_swint_ogc.pth
cd ..

6- Install Segment Anything

git clone https://github.com/facebookresearch/segment-anything.git
cd segment-anything; pip install -e .
cd..

7- Download sam_vit_h.pth (pr select one of options below)

curl -L -o sam_vit_h.pth https://dl.fbaipublicfiles.com/segment_anything/sam_vit_h_4b8939.pth

curl -L -o sam_vit_h_4b8939.pth https://dl.fbaipublicfiles.com/segment_anything/sam_vit_h_4b8939.pth

curl -L -o sam_vit_l_0b3195.pth https://dl.fbaipublicfiles.com/segment_anything/sam_vit_l_0b3195.pth

curl -L -o sam_vit_b_01ec64.pth https://dl.fbaipublicfiles.com/segment_anything/sam_vit_b_01ec64.pth
cd ..

8- Download CLIP weights

cd clip-vit-base-patch32/
curl -L -o pytorch_model.bin https://huggingface.co/openai/clip-vit-base-patch32/resolve/main/pytorch_model.bin
curl -L -o flax_model.msgpack https://huggingface.co/openai/clip-vit-base-patch32/resolve/main/flax_model.msgpack
curl -L -o tf_model.h5 https://huggingface.co/openai/clip-vit-base-patch32/resolve/main/tf_model.h5
cd ..

How to use

python gpt.py \
    --config_file config.txt \
    --reference_images_folder references/ \
    --input_images_folder input/ \
    --output_folder output/ \
    --groundingdino_config GroundingDINO/groundingdino/config/GroundingDINO_SwinT_OGC.py \
    --groundingdino_weights weights/groundingdino_swint_ogc.pth \
    --clip_model_dir clip-vit-base-patch32/ \
    --prompt_file prompt.txt \
    --openai_key sk-OPEN_AI_API_KEY \
    --use_lab \
    --patch_width 150 \
    --patch_height 150 \
    --gpt_model "gpt-4o" \
    --target_objects "olive fruit" \
    --persistent

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Anotator

HOW TO INSTALL

1 - create conda env

2- git clone env

3- Install LLM Annotator requirements

4- Install GroundingDINO

5- download groundingdino_swint_ogc.pth

6- Install Segment Anything

7- Download sam_vit_h.pth (pr select one of options below)

8- Download CLIP weights

How to use

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
GroundingDINO		GroundingDINO
clip-vit-base-patch32		clip-vit-base-patch32
input		input
references		references
README.md		README.md
config.txt		config.txt
gpt.py		gpt.py
prompt.txt		prompt.txt
requirements.txt		requirements.txt
vdb.py		vdb.py

mojaravscki/llmanotator

Folders and files

Latest commit

History

Repository files navigation

LLM Anotator

HOW TO INSTALL

1 - create conda env

2- git clone env

3- Install LLM Annotator requirements

4- Install GroundingDINO

5- download groundingdino_swint_ogc.pth

6- Install Segment Anything

7- Download sam_vit_h.pth (pr select one of options below)

8- Download CLIP weights

How to use

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages