CTG Neural Network Classifier 📌

The purpose of the task was to create and train a neural network capable of classifying (recognizing) an infant's condition based on CTG examination data. The network should use a maximum of 60% of the data for training and 40% for testing.

🎯 The goal was to:

Achieve a classification accuracy greater than 92% on test data.
Validate the trained network by running five repetitions with randomly split data (in my case).
Compare at least three different network structures.
Perform testing on samples from each disease category using the best-trained network.

Content of the documentation ⬇️

Content:

🔁 Description of Input and Output Data

INPUT DATA

The data comes from the file CTGdata.mat and includes 25 parameters derived from measured signals from cardiotocography (CTG) examination.

% Loading data from the file
data = load('CTGdata.mat');

These parameters are stored in the variable NDATA and serve as inputs for the neural network.

OUTPUT DATA

The variable typ_ochorenia contains classification into three groups:

1 = Normal condition
2 = Suspect condition
3 = Pathological condition

targets = dummyvar(data.typ_ochorenia);      
% Creating a binary representation of categorical values

These groups are transformed into binary representation (one-hot encoding) using the dummyvar function before being used in the neural network.

DATA SPLITTING INTO TRAINING AND TESTING

Setting parameters for random data splitting into training and testing sets:

net.divideFcn = 'dividerand';  
net.divideParam.trainRatio = 0.6;                % 60% training data
net.divideParam.valRatio = 0;  
net.divideParam.testRatio = 0.4;                 % 40% test data

📄 MLP Network Structure

Inputs:

The network receives 25 input parameters, stored in the variable NDATA. These parameters are derived from measured signals from cardiotocography (CTG) examination.

Neuron type: Input neurons.
Number of neurons: 25 (same as the number of input parameters in NDATA).

Hidden Layer:

Neuron type: Neurons with a nonlinear activation function (in this case, the default function tansig—hyperbolic tangent sigmoid function).
Number of neurons: 32 (set by the variable hidden_neurons).

Output Layer:

The network outputs three disease classification categories:

Normal condition
Suspect condition
Pathological condition

Neuron type: Neurons with a softmax activation function (typically used for classification tasks).
Number of neurons: 3 (one for each disease category).

🏄 Training Parameters

net.trainParam.goal = 0.001; % Termination condition for error
net.trainParam.epochs = 1000; % Maximum number of epochs
net.trainParam.max_fail = 12; % Maximum number of failed validations

% Training the neural network and returning training data
[net, tr] = train(net, data.NDATA', targets');

Termination Conditions:

net.trainParam.goal = 0.001;

Termination condition for error: Training stops if the network error reaches 0.001.

net.trainParam.epochs = 1000;

Maximum number of epochs: The network can go through a maximum of 1000 training cycles.

net.trainParam.max_fail = 12;

Maximum number of validation failures: Training stops if, after 12 consecutive failed validations, no improvement occurs.

Criterion Function:

The criterion function is Mean Squared Error (MSE). This function is used to evaluate the network's error during training.

In the code:

Training stops when the MSE reaches the set target value of 0.001 (net.trainParam.goal = 0.001).

Training Process and Contingency Matrix for the Best Network

📉 Training Process Progress Chart:

The chart illustrates the training process of the neural network and the error reduction over training epochs.

Key Elements of the Chart:

🔵 Blue Line (Training Data Error):

Shows the error trend on training data across epochs.
Initially, the error decreases sharply, indicating that the model is learning.
As training progresses, the error stabilizes, suggesting convergence.

🔴 Red Line (Testing Data Error):

Represents the error on unseen test data.
A stable test error suggests strong generalization to new data.
If the test error increases, it may indicate overfitting, where the model performs well on training data but struggles with new data.

📎 Best Performance Point:

Marks the epoch where the model achieved its lowest error value, demonstrating peak accuracy during training.

📎 Overall Trend:

A gradual decrease in error signifies that the model is successfully adjusting its parameters.
If the gap between training and test error is too large, it may indicate poor generalization.

🔢 Contingency Matrix (plotconfusion):

🔷 Purpose of the Contingency Matrix:

Provides a detailed evaluation of the model’s classification accuracy.
Helps identify misclassified instances and improve model performance.

📊 Interpreting the Matrix:

Diagonal Values (Green): Represent correctly classified samples, indicating strong classification accuracy.
Off-Diagonal Values (Red): Show misclassified instances; fewer red values suggest better generalization.

🚀 Performance Metrics:

Training Accuracy: 100%
Testing Accuracy: 93.3%
Error Rate: 6.7%

📎 Significance of Results:

High training accuracy confirms effective learning of patterns.
Stable testing accuracy ensures proper generalization, preventing overfitting.

🔷 Overall Classification Accuracy:

This contingency matrix provides a comprehensive assessment of the neural network’s performance, considering both training and testing results.
The classification accuracy across all data is 97.3%, meeting the expected requirements.

📊 Evaluation Summary:

Training Accuracy: 100%
Testing Accuracy: 93.3%
Overall Accuracy: 97.3%

📎Key Insights:

The high classification accuracy indicates that the model effectively learns patterns and generalizes well.
A minimal error percentage suggests strong reliability and robustness in predictions.

📋 Neural Network Testing:

📊 Average Results Across 5 Runs:

Training Accuracy: min = 99.9%, average = 99.9%, max = 99.9%
Testing Accuracy: min = 92.56%, average = 92.56%, max = 92.56%

📎 Performance Evaluation:

The model successfully meets the requirement of exceeding 92% classification accuracy on test data.
The minimal variation across different runs confirms that the model maintains stability and reliability in classification.

📊 MATLAB Implementation:

test_accuracies = 1 - confusion(test_target, test_outputs);                                 % Calculate success rate  
fprintf('Training Accuracy: min = %.2f%%, avg = %.2f%%, max = %.2f%%\\n', ...  
min(train_accuracies) * 100, mean(train_accuracies) * 100, max(train_accuracies) * 100);  
fprintf('Testing Accuracy: min = %.2f%%, avg = %.2f%%, max = %.2f%%\\n', ...  
min(test_accuracies) * 100, mean(test_accuracies) * 100, max(test_accuracies) * 100);

📄 Training Process and Contingency Matrix for Different Neuron Counts

First Variant: 100 Neurons

The number of neurons was set to 100 as an illustrative example.

Key Elements of the Chart:

🔵 Blue Line (Training Data Error):

At the start of training, the error value on training data dropped only during the first epoch to 2.627.
The curve then remained constant throughout the 1000 epochs, indicating suboptimal learning progression.
This suggests that the model stagnated, failing to effectively optimize weights in the hidden layer.

🔴 Red Line (Testing Data Error):

The test error remained at the same level as the training error, indicating poor generalization.
No improvement was observed in test accuracy, suggesting the model failed to capture the underlying structure of the test data.

📊 Contingency Matrix Analysis (100 Neurons):

📎 Key Insights:

The network did not learn effectively, resulting in high error rates.
The model failed to optimize correctly, leading to poor performance on unseen data.
Increasing the neuron count beyond an optimal number does not necessarily improve classification accuracy.

Second Variant: 10 Neurons

Neuron Configuration: For the second variant, the number of hidden neurons was set to 10 to evaluate its impact on performance.

Key Elements of the Chart:

🔵 Blue Line (Training Data Error):

The error stabilized after a few epochs but remained higher than when using 32 neurons.
The model learned slowly and failed to optimize weights effectively, indicating that 10 neurons were insufficient for capturing data complexity.

🔴 Red Line (Testing Data Error):

Higher test error compared to training error suggests weaker generalization.
Though the test error stabilized, it remained significantly worse than results with 32 neurons.

📊 Contingency Matrix (10 Neurons):

📎 Key Insights:

📎 Overall Classification Performance:

The matrix considers both training and testing results.
Total Classification Accuracy: 95.3% → Acceptable but still below desired levels

🚀 Final Observations:

The low neuron count resulted in limited learning capacity, leading to high error rates.
While classification accuracy improved over training, the network struggled with unseen data, limiting generalization.

💊 Testing Samples from Each Disease Type for the Best-Trained Network

🔎 Overview:
The image displays the classification results for samples from each disease category using the best-trained neural network (with an initial neuron count of 32).

📊 Sample Classification Details:

1️⃣ Normal Sample

Predicted Probabilities: [1, 2.9031e-21, 5.3626e-29]
- The first value (1) indicates nearly 100% certainty that the sample belongs to Group 1 (Normal).
- The remaining probabilities are close to zero, showing that the model is highly confident in this classification.

2️⃣ Suspicious Sample

Predicted Probabilities: [5.0451e-14, 0.99996, 3.7645e-05]
- The second value (0.99996) indicates a very high probability that the sample belongs to Group 2 (Suspicious).
- The first and third probabilities are nearly zero, suggesting the model is strongly confident in this classification.

3️⃣ Pathological Sample

Predicted Probabilities: [1.3813e-16, 3.2275e-12, 1]
- The third value (1) demonstrates almost 100% certainty that the sample belongs to Group 3 (Pathological).
- The other probability values are negligible, reinforcing the model’s confidence in its decision.

📎 Key Insights:

The high probability values for the correct classifications indicate that the neural network is highly reliable in distinguishing between different disease types.
The low probability values for incorrect classifications suggest the model’s certainty and precision in predictions.

📄 Classification Accuracy, Sensitivity, and Specificity

📊 Overview:
The image presents the evaluation metrics for classification accuracy, sensitivity, and specificity.

🔵 Sensitivity:

99.32% on the training set → The model correctly identified almost all positive cases.
85.88% on the test set → Slightly lower accuracy on unseen data, but still a strong performance.

🔴 Specificity:

99.69% on the training set → The model accurately recognizes normal cases.
95.54% on the test set → Still high precision in identifying negative cases within test data.

📎 Overall Accuracy:

99.22% on the training set → The model achieves near-perfect accuracy on training data.
91.88% on the test set → Accuracy declined but remains above the standard success threshold.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
CTG_classifier.m		CTG_classifier.m
CTGdata.mat		CTGdata.mat
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CTG Neural Network Classifier 📌

🎯 The goal was to:

Content:

🔁 Description of Input and Output Data

INPUT DATA

OUTPUT DATA

DATA SPLITTING INTO TRAINING AND TESTING

📄 MLP Network Structure

🏄 Training Parameters

Termination Conditions:

Criterion Function:

Training Process and Contingency Matrix for the Best Network

📉 Training Process Progress Chart:

Key Elements of the Chart:

🔢 Contingency Matrix (plotconfusion):

📋 Neural Network Testing:

📄 Training Process and Contingency Matrix for Different Neuron Counts

First Variant: 100 Neurons

Key Elements of the Chart:

Second Variant: 10 Neurons

Key Elements of the Chart:

💊 Testing Samples from Each Disease Type for the Best-Trained Network

📄 Classification Accuracy, Sensitivity, and Specificity

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CTG Neural Network Classifier 📌

🎯 The goal was to:

Content:

🔁 Description of Input and Output Data

INPUT DATA

OUTPUT DATA

DATA SPLITTING INTO TRAINING AND TESTING

📄 MLP Network Structure

🏄 Training Parameters

Termination Conditions:

Criterion Function:

Training Process and Contingency Matrix for the Best Network

📉 Training Process Progress Chart:

Key Elements of the Chart:

🔢 Contingency Matrix (plotconfusion):

📋 Neural Network Testing:

📄 Training Process and Contingency Matrix for Different Neuron Counts

First Variant: 100 Neurons

Key Elements of the Chart:

Second Variant: 10 Neurons

Key Elements of the Chart:

💊 Testing Samples from Each Disease Type for the Best-Trained Network

📄 Classification Accuracy, Sensitivity, and Specificity

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages