What number and quality of images are needed for labeling and fine-tuning a pretrained model?

roman_dimakov · December 25, 2024, 4:55pm

Hello!

I am new to Label Studio and Layout Parser and would greatly appreciate your help with a couple of questions about image annotation for subsequent model fine-tuning (using scripts from this page: GitHub - Layout-Parser/layout-model-training: The scripts for training Detectron2-based Layout Models on popular layout analysis datasets).

How many images would be sufficient to label in Label Studio to fine-tune a pre-trained model (e.g., Faster R-CNN)?
My dataset contains approximately 15,000 images. ChatGPT suggests labeling 100–200 images initially and 500+ for better performance. In contrast, Copilot recommends labeling 2,000–3,000 images to start.
Do the quality of images and the number/types of labels per image affect the speed of model fine-tuning?
My images are in PNG format, RGB color space, and have a resolution of 1800x1200.

I look forward to your response!

Best regards,
Roman

Topic		Replies	Views
Can't to annotate images larger than 20 000x20 000 pixels Label Studio Support	1	93	October 23, 2024
Best Practices for Large-Scale Annotation Projects in Label Studio? General Discussion	0	122	February 20, 2025
Advice Needed for Handling Large Video Annotations in Label Studio Label Studio Support	0	151	September 19, 2024
Using Label Configurations with Multiple Images Slack Archivist	0	470	December 1, 2023
Applying a numerical label to an image Slack Archivist	0	210	August 28, 2023