ComputerVision

CV team project (A+)

Period:

2023.10.10 ~ 2023.11.20

Problem Description

Improve "fine-grained classification" accuracy by modifying the model
"Fine-grained dataset": refers to a dataset where the categories or classes are finely divided or detailed (object recognition, image classification, segementation, attribute recognition)

기존 코드

Best Accuracy: 92.9054%
Test Loss: 0.3288, Accuracy: 90.2685%
Elasped Time: 22m 30s

Experiments

1. Augmentation - Geometric Transformation + Visual Corruptions)

Geometric Transformation: Movement, Size, Rotation, Symmetry Transformation
-> Apply vertical and horizontal flip (left and right inversion, upper and lower inversion), rotate
Visual Corruptions:
Images are exposed to various environmental noises, and images with noise affect model performance.
We want to increase the robustness of our model by using visual corruption, which is a disturbance such as noise and blur.
-> gaussian noise, contrast, brightness

2. Optimizer - AdamP:

an optimizer that inhibits excessive weight norm growth by eliminating gradient components parallel to the weight direction caused by momentum through projection.

3. Hyperparameter(BATCH_SIZE, EPOCH, lr):

The smaller the batch size, the better the performance
The more you learn by repeating the epoxy, the better the performance
The lower the learning rate, the better the performance

4. Scheduler: get_cosine_schedule_with_warmup

After a warm-up period that increases linearly by the specified value (three times the train_loader)
In the optimizer, the schedule is generated with a learning rate that decreases with cosine function values.
-> This allows you to get out of saddle point quickly.
Congestion segments that occur in the middle of learning can also be quickly removed
Maximize model generalization performance

5.Label Smoothing

A method to reduce overconfidence in deep learning predictions by softening Hard label (which consists of 1 correct answer index with one-hot encoded vector and 0 for the rest).

Model

Efficientnet b0 model

There are three ways to improve performance: "Compound Scaling",
1. Increase the depth of the network (increase the number of layers)
2. Increase channel width (increase the number of filters)
3. Increase the resolution of the input image
EfficientNet is a model that can find the best combination for three methods, using all three scaling, performing well with less FLOPS (calculated amount) than conventional models.

Final Result

[FINAL]

Test Loss: 0.3447,Accuracy: 94.6309%
Best Accuracy: 94.5945945945946
Elapsed Time: 2h, 12m, 26s
time: 2h, 12m, 26s

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
CV_AL(DA2).ipynb		CV_AL(DA2).ipynb
CV_AL1(Data_Augmentation_1).ipynb		CV_AL1(Data_Augmentation_1).ipynb
CV_AL2_EfficientNet(2)_1.ipynb		CV_AL2_EfficientNet(2)_1.ipynb
CV_AL2_EfficientNet(3).ipynb		CV_AL2_EfficientNet(3).ipynb
CV_AL2_EfficientNet_final.ipynb		CV_AL2_EfficientNet_final.ipynb
CV_AL2_EfficientNet_final2.ipynb		CV_AL2_EfficientNet_final2.ipynb
CV_AL2_EfficientNetv2_2.ipynb		CV_AL2_EfficientNetv2_2.ipynb
CV_AL2_EfficientNetv2_3.ipynb		CV_AL2_EfficientNetv2_3.ipynb
CV_AL2_EfficientNetv2_4.ipynb		CV_AL2_EfficientNetv2_4.ipynb
CV_AL2_EfficientNetv2_5.ipynb		CV_AL2_EfficientNetv2_5.ipynb
CV_AL2_FINAL CODE.ipynb		CV_AL2_FINAL CODE.ipynb
CV_최종1(mixup제거).ipynb		CV_최종1(mixup제거).ipynb
CV_최종1.ipynb		CV_최종1.ipynb
CV_최종2(g_v).ipynb		CV_최종2(g_v).ipynb
CV_최종3(g_v+mixup).ipynb		CV_최종3(g_v+mixup).ipynb
Mixup Augmentation+RAdam.ipynb		Mixup Augmentation+RAdam.ipynb
README.md		README.md
RandAug+g_v+SGDP.ipynb		RandAug+g_v+SGDP.ipynb
RandAugment(2,8).ipynb		RandAugment(2,8).ipynb
RandAugment(N=5,M=3).ipynb		RandAugment(N=5,M=3).ipynb
RandAugment+다른거 추가.ipynb		RandAugment+다른거 추가.ipynb
RandAugment+증강기법 추가+AdamP.ipynb		RandAugment+증강기법 추가+AdamP.ipynb
RandAugment2(3,5).ipynb		RandAugment2(3,5).ipynb
RandAugmentation		RandAugmentation
randaugment.ipynb		randaugment.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ComputerVision

Period:

Problem Description

기존 코드

Experiments

Model

Final Result

About

Uh oh!

Releases

Packages

Languages

iey704/ComputerVision

Folders and files

Latest commit

History

Repository files navigation

ComputerVision

Period:

Problem Description

기존 코드

Experiments

Model

Final Result

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages