Test Accuracy: -- AUC-ROC: -- F1 Score: -- 5-CV Mean: -- Total Samples: -- Features: -- Test Accuracy: -- AUC-ROC: -- F1 Score: -- 5-CV Mean: -- Total Samples: -- Features: --
Connecting
--:--:--
Model Evaluation
Complete performance analysis — Random Forest trained on 80,791 financial tweets with FinBERT auto-labelling
Test Accuracy
--
held-out test set
CV Accuracy
--
5-fold mean
AUC-ROC
--
binary class
F1 Score
--
weighted avg
Precision
--
weighted avg
Recall
--
weighted avg
Confusion matrix
-- predictions
← Predicted label → Actual label ↕
Predicted Rise
Predicted Fall
--% of Rise
--
True Positive
Actual Rise
Predicted Rise ✓
--% of Rise
--
False Negative
Actual Rise
Predicted Fall ✗
--% of Fall
--
False Positive
Actual Fall
Predicted Rise ✗
--% of Fall
--
True Negative
Actual Fall
Predicted Fall ✓
Sensitivity
--
TP/(TP+FN)
Specificity
--
TN/(TN+FP)
MCC
--
Matthews Corr.
ROC curve
AUC = --
Interpretation
AUC > 0.7 indicates good discriminative ability. Our model -- the 0.7 threshold, suggesting the Random Forest meaningfully differentiates Rise vs Fall signals beyond random chance.
Classification report
ClassPrecisionRecallF1-ScoreSupport
Rise (1)--------
Fall (0)--------
Weighted avg -- -- -- --
5-fold cross-validation
Mean: --
Fold 1
--
Fold 2
--
Fold 3
--
Fold 4
--
Fold 5
--
Mean: -- Std: -- Stratified K-Fold
Feature importance (Gini impurity)
Model configuration
AlgorithmRandom Forest
n_estimators200
max_depth12
max_featuressqrt
min_samples_split5
class_weightbalanced
criteriongini
SMOTEenabled
train/test split80% / 20%
Label sourceFinBERT auto
NLP modelProsusAI/FinBERT
Total features36