Birds 525 Quality Score 77: DataClinic Diagnosis Report

Metric	Birds 450 (#11)	Birds 525 (#116)	Change
Overall Score	Quality score 65 (Fair)	Quality score 77 (Fair)	+12 pts ↑
Classes	450 species	525 species	+75 species ↑
Total Images	75,100	89,880	+14,780 ↑
Avg per Class	150.6	161.2	+10.6 ↑
Class Balance Std Dev	15.7	20.6	-4.9 ↓
Min Resolution	170×196px	45×109px	Smaller
Max Resolution	224×224px	4,763×3,421px	Much more diverse ↑
L1 Statistics Grade	Poor	Good	Major improvement ↑
L2 Analysis Lens	BLIP (Multimodal)	Wolfram ImageIdentify Net V2	Lens changed
L3 Dimensions	41 dimensions	81 dimensions	2x more precise ↑
Commercial Use	Allowed ✓	Not allowed ✗	⚠️ Caution

Level 1: Basic Quality Check — The Decisive Difference from 450

Level 1 examines image integrity, missing values, class balance, and pixel statistics. This stage reveals the core reason why Birds 525 significantly outperforms Birds 450.

✅ Missing Values: None

No missing values were found across all 89,880 images. This is on par with Birds 450's equally clean record.

✅ Channel Composition: Consistent RGB Throughout

All 89,880 images across 525 classes share identical RGB channel composition. With no grayscale images or alpha channel contamination, preprocessing overhead is minimized.

✅ Class Balance: Good

The number of images per class ranges from a minimum of 130 to a maximum of 263, with a mean of 161.2 and standard deviation of 20.6. While the deviation is slightly higher than Birds 450 (mean 150.6, std dev 15.7), the bias risk remains low for large-scale multi-class classification tasks.

🌟 Image Resolution: The Decisive Difference from Birds 450

The most distinctive feature of Birds 525 is its resolution diversity:

Minimum: 45×109px (very small thumbnails)
Maximum: 4,763×3,421px (high-resolution professional photographs)

In contrast, Birds 450 had a minimum of 170×196px and a maximum of 224×224px, with nearly all images uniformly preprocessed.

💡 Why Does Preserving Original Resolution Matter?

Intuitively, uniform resolution seems convenient for AI training, but DataClinic's perspective differs. The key is not resolution size itself, but preserving the statistical diversity created by real-world shooting conditions. Birds 450, uniformly resized to 224px, lost the pixel-level variations from different cameras, distances, and lighting — resulting in a "Poor" L1 grade. The best practice is to resize during model input preprocessing while keeping the dataset itself at original resolution. This is the key L1 factor among the multiple contributors to the 12-point improvement, alongside expanded class diversity and more precise L3 analysis.

Level 2: The Bird World Through a General-Purpose AI Lens

Level 2 extracts features from all images using Wolfram ImageIdentify Net V2 (1,280 dimensions). Unlike Birds 450, which used BLIP (a multimodal language-image model) for L2, Birds 525 employs a pure vision model to analyze distributions based solely on visual features.

🔵 Two High-Density Clusters

PCA and density topography analysis reveal two high-density clusters. Interpreting these with domain knowledge:

Cluster A — Tropical and subtropical birds with vivid colors and distinctive forms. Visually striking species such as Peacock, Bird of Paradise, Emerald Cuckoo, and Crowned Crane
Cluster B — Brown and gray forest/grassland birds. Passeriformes (perching birds) with relatively similar appearances

This two-cluster structure was similarly observed in Birds 450. General-purpose AI models tend to first classify birds by the strongest visual signal: "colorful birds vs. plain birds."

L2 PCA — Birds 525 mean feature distribution by class

📊 Density Distribution Comparison: Birds 450 vs. 525

Comparing the L2 density distributions side by side reveals structural differences between the two datasets. Birds 450 (left) shows a peak density of 4.26 (COMMON POORWILL) with a narrow, concentrated range, while Birds 525 (right) shows a peak density of 0.34 (PEACOCK) with a broader, more dispersed range.

This difference indicates that the feature space expanded with the addition of 75 species. The lower density is not a negative signal — it is a positive sign that more species occupy the feature space more evenly.

Birds 450 L2 density · peak 4.26 · narrow distribution

Birds 525 L2 density · peak 0.34 · broad distribution

🔬 Per-Class Density Comparison: Crowned Crane vs. Firefinch

AFRICAN CROWNED CRANE density distribution

AFRICAN CROWNED CRANE — features concentrated due to distinctive crown shape

AFRICAN FIREFINCH — broad distribution typical of small passerines

Level 3: Bird-Specific Lens — More Precise with 81 Dimensions

Level 3 applies a domain-specific lens optimized to 81 dimensions based on Wolfram ImageIdentify Net V2, tailored for the Birds 525 dataset. Compared to Birds 450's L3 at 41 dimensions, Birds 525 uses a representation space that is 2x more precise.

🔵 Cluster Structure Preserved

The domain-specific lens (L3) maintains a two-cluster structure similar to L2. This demonstrates that the visual dichotomy of "colorful birds vs. plain birds" remains a core distinguishing criterion even in domain-specific models.

This contrasts with the Korean Food dataset (2 clusters in L2 → merged into 1 cluster in L3). With Korean food, the domain-specific lens highlights the commonality of "all being Korean food," whereas for birds, distinguishing between colorful and plain species remains visually meaningful even to domain experts.

📈 Density Distribution Comparison: 450 vs. 525 at L3

The L3 density comparison shows a similar pattern to L2. Birds 450 (left, 41 dimensions) has a concentrated density distribution, while Birds 525 (right, 81 dimensions) is more broadly spread.

Notably, the bell-shaped distribution remains stable despite the dimensionality doubling (41→81). This indicates that the structural integrity of the data is preserved even in the 81-dimensional space. However, embedding boundaries between some classes are not fully separated, earning a "Fair" consistency rating.

L3 density distribution — Birds 450 (41 dimensions)

Birds 450 L3 density · 41 dimensions · narrow distribution

L3 density distribution — Birds 525 (81 dimensions)

Birds 525 L3 density · 81 dimensions · broad distribution

Outlier Analysis — Why the Peacock Is the Most "Typical" Bird

🏆 High-Density Samples — The "Typical Birds" as Seen by AI

The PEACOCK overwhelmingly dominates the top high-density samples, with densities of 0.33-0.34. This contrasts with Birds 450, where COMMON POORWILL recorded the highest density.

Ironically, the Peacock is the most typical precisely because it is the most flamboyant:

Distinctive yet consistent visual pattern — The male's fan-shaped tail feathers are highly distinctive, yet Peacock photographs are remarkably similar to each other
Color vividness — The blue-green iridescent body and eye-patterned tail are visual patterns that AI models recognize strongly
Compositional consistency — Front-facing photographs with spread tails are overwhelmingly common, resulting in low feature vector variance

PEACOCK (High density)

density: 0.3402

PEACOCK (High density)

density: 0.3363

PEACOCK (High density)

density: 0.3332

PEACOCK (High density)

density: 0.3308

⚠️ Low-Density Outliers — Birds That Break the Mold

Low-density outliers include species that look dramatically different from "typical birds":

SNOWY SHEATHBILL (0.0529) — Nearly all-white with a sharp beak. Frequently photographed against snowy backgrounds, placing it at an extreme position in feature space
EMU (0.0532) — A large flightless bird. Its long neck and massive body create a silhouette entirely unlike typical birds
BIRD OF PARADISE (0.0541) — Males have extremely ornate and bizarre plumage. As flamboyant as Peacocks but with far more varied photography compositions
RED BILLED TROPICBIRD (0.0545) — Distinctive long tail feathers and white body, with many in-flight capture photographs

SNOWY SHEATHBILL

density: 0.0529 ⚠️

EMU

density: 0.0532 ⚠️

BIRD OF PARADISE

density: 0.0541 ⚠️

RED BILLED TROPICBIRD

density: 0.0545 ⚠️

↔️ Most Different Pair: OILBIRD vs. JACOBIN PIGEON

The most distant pair in feature space is OILBIRD and JACOBIN PIGEON. The Oilbird is a cave-dwelling nocturnal bird with a sharp beak and dark brown body, while the Jacobin Pigeon is an ornamentally bred pigeon with white plumage and a distinctive head crest. They are polar opposites in color, form, and habitat.

OILBIRD — reference image of the most distant pair

OILBIRD
(Cave-dwelling, nocturnal)

VS

JACOBIN PIGEON — most distant from OILBIRD

JACOBIN PIGEON
(Ornamental pigeon)

▲ The most distant pair in feature space. Polar opposites in color, form, and habitat.

Improvement Suggestions — Data Diet + Bulk-up Combined

DataClinic recommends combining Data Diet (duplicate removal) and Data Bulk-up (data augmentation) for this dataset. A quality score of 77 is a solid starting point, and applying both prescriptions together could push the score into the 85-90 range.

🥗 Data Diet — Cleaning Up Peacock Class Duplicates

The PEACOCK class is a classic Data Diet candidate with clustered high-density outliers. If front-facing spread-tail photographs are excessively repetitive, replacing them with varied angles, behaviors, and non-tail photographs would improve model robustness.

💉 Data Bulk-up — Reinforcing Outlier Classes

Classes that appeared as low-density outliers need data augmentation:

EMU — Add images from diverse shooting angles (side, front, group) and varied environments (grassland, farm)
BIRD OF PARADISE — Add various male courtship display scenes and female images to increase intra-class diversity
SNOWY SHEATHBILL — Add non-snowy environment photographs to reduce background bias

🎯 Key Takeaways

✅ Overall quality score: 77 (+12 over Birds 450, multiple factors)

✅ L1 statistics: Good (original resolution preserved) / L2·L3: broad feature space distribution

✅ Missing values: None / Channels: All RGB / L3 81 dimensions (2x more precise)

⚠️ Minimum resolution 45px — preprocessing standardization needed

⚠️ Commercial use not permitted — research and education only

📈 Data Diet + Bulk-up could target 85-90 score range

The full diagnosis results and detailed per-class analysis for all 525 classes are available at DataClinic Report #116.

Executive Summary

Birds 450 vs Birds 525 — What Changed

🔑 The Key to Score Improvement: Multiple Factors

⚠️ Important: Commercial Use Not Permitted

Dataset Overview — The World of 525 Bird Species

Level 1: Basic Quality Check — The Decisive Difference from 450

✅ Missing Values: None

✅ Channel Composition: Consistent RGB Throughout

✅ Class Balance: Good

🌟 Image Resolution: The Decisive Difference from Birds 450

Level 2: The Bird World Through a General-Purpose AI Lens

🔵 Two High-Density Clusters

📊 Density Distribution Comparison: Birds 450 vs. 525

🔬 Per-Class Density Comparison: Crowned Crane vs. Firefinch

Level 3: Bird-Specific Lens — More Precise with 81 Dimensions

🔵 Cluster Structure Preserved

📈 Density Distribution Comparison: 450 vs. 525 at L3

Outlier Analysis — Why the Peacock Is the Most "Typical" Bird

🏆 High-Density Samples — The "Typical Birds" as Seen by AI

⚠️ Low-Density Outliers — Birds That Break the Mold

↔️ Most Different Pair: OILBIRD vs. JACOBIN PIGEON

Improvement Suggestions — Data Diet + Bulk-up Combined

🥗 Data Diet — Cleaning Up Peacock Class Duplicates

💉 Data Bulk-up — Reinforcing Outlier Classes