Animal Body Condition Score(BCS) Classification using AP-10K Dataset

This project implements an animal weight classification system using deep learning and traditional machine learning approaches. The system classifies animals into three weight categories: Underweight, Healthy, and Overweight based on their body proportions(Body length) extracted from the AP-10K animal pose dataset.

🎯 Project Overview

This project uses two complementary approaches for animal weight classification:

Traditional ML Approach: Histogram of Oriented Gradients (HOG) features + Random Forest
Deep Learning Approach: Custom CNN with data augmentation

The classification is based on body length measurements calculated from animal keypoint annotations, creating proxy labels for weight categories.

✨ Features

Multi-approach Classification: Both traditional ML and deep learning methods
Robust Data Processing: Handles missing data and various image formats
Comprehensive Evaluation: Detailed metrics, confusion matrices, and visualizations
Custom CNN Architecture: Lightweight CNN model optimized for animal classification
HOG Feature Extraction: Traditional computer vision features for comparison
Data Augmentation: Improves model generalization with rotation, shifts, and flips
Model Persistence: Save and load trained models with weights and architecture

📋 Prerequisites

Python 3.8+
Anaconda or Miniconda
Jupyter Notebook
8GB+ RAM recommended
GPU support optional but recommended for faster training

🚀 Installation & Setup

1. Create Conda Environment

conda create -n animal_classification python=3.9
conda activate animal_classification

2. Install Jupyter Notebook

conda install jupyter notebook

3. Clone/Download Project

Download this project and navigate to the project directory:

git clone https://github.com/Dkplucas/model.git
cd model

4. Install Dependencies

pip install -r requirements_ml.txt

5. Launch Jupyter Notebook

jupyter notebook

📥 Dataset Download

Download AP-10K Dataset

Visit the official repository: AP-10K Dataset
Download the dataset following the instructions in the repository
Extract the dataset to a directory outside your git repository (to avoid large file issues) with the following structure:

data/
└── ap-10K/
    ├── annotations/
    │   ├── ap10k-train-split1.json
    │   ├── ap10k-train-split2.json
    │   ├── ap10k-train-split3.json
    │   ├── ap10k-val-split1.json
    │   ├── ap10k-val-split2.json
    │   ├── ap10k-val-split3.json
    │   ├── ap10k-test-split1.json
    │   ├── ap10k-test-split2.json
    │   └── ap10k-test-split3.json
    └── data/
        ├── 000000000001.jpg
        ├── 000000000002.jpg
        └── ... (all image files)

Important: Ensure the dataset is placed in the data/ap-10K/ directory relative to your Jupyter notebook working directory.

📁 Project Structure

animal_classification/
├── dataprocess.py          # Data preprocessing and label creation
├── hogfeatures.py          # HOG feature extraction  
├── model.py                # Deep learning model training
├── projection.py           # Additional analysis and projections
├── requirements_ml.txt     # Project dependencies
├── .gitignore             # Git ignore file for dataset exclusion
├── README.md              # This file
└── data/                  # Dataset directory (to be created outside repo)
    └── ap-10K/
        ├── annotations/   # JSON annotation files
        └── data/         # Image files

🔄 Usage Instructions

⚠️ Important: Run the scripts in the following order for successful execution:

Step 1: Data Preprocessing

Open and run dataprocess.py in Jupyter Notebook:

%run dataprocess.py

What this does:

Loads AP-10K annotations from JSON files
Calculates body length from keypoint coordinates (nose to tail distance)
Creates weight classification labels (Underweight/Healthy/Overweight)
Splits data into train/validation/test sets
Saves processed data as CSV files

Expected outputs:

data/train_split.csv
data/val_split.csv
data/test_split.csv

Step 2: HOG Feature Extraction

Run hogfeatures.py in Jupyter Notebook:

%run hogfeatures.py

What this does:

Extracts Histogram of Oriented Gradients (HOG) features from images
Processes images in batches for memory efficiency
Handles various image path structures in the dataset
Prepares traditional ML features for classification

Expected outputs:

HOG feature arrays for train/validation/test sets
Progress logs showing successful feature extractions

Step 3: Model Training

Run model.py in Jupyter Notebook:

%run model.py

What this does:

Builds and trains custom CNN model
Implements data augmentation and callbacks
Evaluates model performance on test set
Generates confusion matrices and classification reports
Saves trained model and training history

Expected outputs:

best_weights.weights.h5 - Best model weights during training
animal_weight_classifier_weights.h5 - Final model weights
animal_weight_classifier_architecture.json - Model architecture
training_log.csv - Training history
confusion_matrix.png - Confusion matrix visualization
training_history.png - Training/validation curves

🏗️ Model Architecture

Deep Learning Model (Custom CNN)

Base Model: Custom CNN from scratch
Input: 224×224×3 RGB images
Architecture:
- Conv2D(32 filters, 3×3 kernel, ReLU activation)
- MaxPooling2D(2×2)
- Conv2D(64 filters, 3×3 kernel, ReLU activation)
- MaxPooling2D(2×2)
- Conv2D(64 filters, 3×3 kernel, ReLU activation)
- GlobalAveragePooling2D()
- Dropout(0.5)
- Dense(64, ReLU activation)
- Dropout(0.3)
- Dense(3) # 3 classes
Optimizer: Adam (learning_rate=1e-4)
Loss: Sparse Categorical Crossentropy

Traditional ML Approach

Feature Extraction: HOG (Histogram of Oriented Gradients)
Parameters:
- 9 orientations
- 16×16 pixels per cell
- 2×2 cells per block
- L2-Hys normalization
Classifier: Random Forest (can be extended)

📊 Results

The model provides:

Classification metrics: Precision, Recall, F1-score for each class
Confusion matrix: Visual representation of classification performance
Training curves: Loss and accuracy over training epochs
Class distribution analysis: Understanding of dataset balance

🔧 Troubleshooting

Common Issues:

"Image not found" errors:
- Ensure dataset is extracted to correct data/ap-10K/ directory
- Check that both annotation files and images are present
Memory errors during training:
- Reduce batch size in model.py (default is 8-16)
- Close other applications to free up RAM
Slow training:
- Consider using GPU acceleration with tensorflow-gpu
- Reduce image resolution if necessary
Import errors:
- Ensure all dependencies are installed: pip install -r requirements_ml.txt
- Activate the correct conda environment
TensorFlow/Keras errors:
- Update TensorFlow: pip install tensorflow --upgrade
- Check CUDA compatibility for GPU usage

Dependencies Issues:

# If sklearn package fails, install scikit-learn instead:
pip install scikit-learn

# If skimage package fails, install scikit-image instead:
pip install scikit-image

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📄 License

This project is for educational purposes. Please refer to the AP-10K dataset license for data usage terms.

🙏 Acknowledgments

AP-10K Dataset for providing the animal pose dataset
TensorFlow and scikit-learn communities for the frameworks

Note: This project is designed for research and educational purposes. For production use, consider additional validation and testing on diverse datasets.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Animal Body Condition Score(BCS) Classification using AP-10K Dataset

📋 Table of Contents

🎯 Project Overview

✨ Features

📋 Prerequisites

🚀 Installation & Setup

1. Create Conda Environment

2. Install Jupyter Notebook

3. Clone/Download Project

4. Install Dependencies

5. Launch Jupyter Notebook

📥 Dataset Download

Download AP-10K Dataset

📁 Project Structure

🔄 Usage Instructions

Step 1: Data Preprocessing

Step 2: HOG Feature Extraction

Step 3: Model Training

🏗️ Model Architecture

Deep Learning Model (Custom CNN)

Traditional ML Approach

📊 Results

🔧 Troubleshooting

Common Issues:

Dependencies Issues:

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
README.md		README.md
dataprocess.py		dataprocess.py
hogfeatures.py		hogfeatures.py
model.py		model.py
projection.py		projection.py
requirements_ml.txt		requirements_ml.txt

Dkplucas/model

Folders and files

Latest commit

History

Repository files navigation

Animal Body Condition Score(BCS) Classification using AP-10K Dataset

📋 Table of Contents

🎯 Project Overview

✨ Features

📋 Prerequisites

🚀 Installation & Setup

1. Create Conda Environment

2. Install Jupyter Notebook

3. Clone/Download Project

4. Install Dependencies

5. Launch Jupyter Notebook

📥 Dataset Download

Download AP-10K Dataset

📁 Project Structure

🔄 Usage Instructions

Step 1: Data Preprocessing

Step 2: HOG Feature Extraction

Step 3: Model Training

🏗️ Model Architecture

Deep Learning Model (Custom CNN)

Traditional ML Approach

📊 Results

🔧 Troubleshooting

Common Issues:

Dependencies Issues:

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages