Computer Vision SpecializationΒΆ
πΌοΈ OverviewΒΆ
Master computer vision from classification to generative models!
Time: 2-3 months | 150-200 hours
Prerequisites: Phases 1-8 complete
Outcome: Build production CV applications
π What Youβll LearnΒΆ
Image classification (ResNet, Vision Transformers)
Object detection (YOLO, DETR)
Image embeddings (CLIP, DINO)
Semantic segmentation
Generative models (Stable Diffusion, DALL-E)
Multimodal AI (text + vision)
Video understanding
OCR and document AI
ποΈ Module StructureΒΆ
computer-vision/
βββ 00_START_HERE.ipynb
βββ 01_image_classification.ipynb
βββ 02_object_detection.ipynb
βββ 03_clip_embeddings.ipynb
βββ 04_stable_diffusion.ipynb
βββ 05_multimodal_rag.ipynb
βββ projects/
β βββ visual_search/
β βββ image_qa/
β βββ content_moderation/
βββ README.md
π― Key ProjectsΒΆ
Visual Search Engine - Find similar images using CLIP
Image Q&A System - Chat with images
Content Moderation - Classify safe/unsafe images
AI Art Generator - Creative tool with Stable Diffusion
Start here: 00_START_HERE.ipynb