Bài học 6 trong số 7

Evaluate and Test

5 phút để hoàn thành

Evaluate and Test

How to interpret the output of your model and evaluate its performance

Precision and Recall

Once the model is trained, you will see a summary of the model performance with scores for "Precision" and "Recall".

Precision tells us what proportion of the images identified by the model as positive should indeed have been categorised as such. Recall instead tells us what proportion of actual positive images were correctly identified.

Our model performed very well in both categories, with scores above 97%. Let's see what that means in more detail.

Evaluate the model performance

Click on "Evaluate" on the top menu and let's explore the interface. First, it shows us again the scores on precision and recall. In our case, the precision score tells us that 97% of the test images that the model identified as examples of amber mining were indeed showing traces of amber mining.

Bước 1
The recall score instead tells us that 97% of the test images showing examples of amber mining were correctly labelled as such by the model.

Bước 2
Confidence threshold is the level of confidence the model must have to assign a label. The lower it is, the more images the model will classify, but the higher the risk of misclassifying some images.

Bước 3
If you want to dig deeper and also explore the precision-recall curves, follow the link on the interface to learn more.

False positives and False negatives

Next, let's look at the Confusion Matrix. The higher the scores on blue background, the better the model performed. In this example, the scores are very good.

Bước 1
All images that should have been labelled as negative (no amber mining) were recognised by the model and 82% of the images that included traces of amber mining were correctly labelled as such.

Bước 2
We have no false positives – no images were wrongly labelled as examples of amber mining – and only 12% of false negatives: images showing traces of amber mining that the model failed to recognise.

Bước 3
This is good for the purpose of our investigation into illegal amber mining: it's better to miss some positive examples than to bring as proof of amber mining images that do not actually show that.

Bước 4
Click on the left filters if you want to see which test images were correctly or wrongly classified by the model.

Bước 5
Not yet sure if you can trust the model? By clicking on “Test & Use”, you can upload brand-new satellite images – with or without traces of amber mining – to see if the model labels them correctly.

Test and train again

A few final considerations before we wrap up:

You might be wondering how the model is getting some wrong answers when we told it all the right answers to begin with. If you are, you might want to review the split into training, validation, and test sets described in the previous lesson.

For this example, almost all of the images were classified correctly. But that will not always be the case. If you are not satisfied with your model's performance, you can always update and improve your dataset and train the model again. You could carefully analyse what went wrong in the first iteration and, for example, add to your training set more images similar to those that were misclassified by the model.

As for humans, learning is an iterative process.

Xin chúc mừng! Bạn đã hoàn thành Evaluate and Test Rồi, tôi đang thực hiện

Đề xuất cho bạn

open_in_new

Google Xu hướng nâng cao

Bài học

Trở thành bậc thầy về công cụ Khám phá xu hướng bằng các mẹo đơn giản này để trích xuất dữ liệu chính xác.

Bắt đầu

Xóa khỏi tài khoản

Lưu vào tài khoản của bạn

None
open_in_new

Google Sheets: Scraping data from the internet

Bài học

Build your own data sets using Google Sheets.

Bắt đầu

Xóa khỏi tài khoản

Lưu vào tài khoản của bạn

None
open_in_new

YouTube: A storytelling tool.

Bài học

Find out how to cultivate and maintain a YouTube audience.

Bắt đầu

Xóa khỏi tài khoản

Lưu vào tài khoản của bạn

None

Bạn đánh giá như thế nào về bài học này?

Ý kiến phản hồi của bạn sẽ giúp chúng tôi không ngừng cải thiện các bài học!

TITLE

TITLE

TITLE

TITLE

TITLE

TITLE

TITLE

TITLE

TITLE

TITLE

TITLE

TITLE

TITLE

TITLE

TITLE

TITLE

Evaluate and Test

Precision and Recall

Evaluate the model performance

False positives and False negatives

Test and train again

Google Xu hướng nâng cao

Google Sheets: Scraping data from the internet

YouTube: A storytelling tool.

Tôi đang tìm kiếm tài nguyên trong

Evaluate and Test

Precision and Recall

Evaluate the model performance

False positives and False negatives

Test and train again

Google Xu hướng nâng cao

Google Sheets: Scraping data from the internet

YouTube: A storytelling tool.