Home  |   Tutorial  |   Summary  |   Detect  |   Donate Data  |   Download Pre-train Data

Summary

Information about the tools used by HEAP

The goal of HEAP is to help professionals find the target eggs quickly. The internal implementation method is to provide tools to predict the target from the pictures uploaded by the user, so that the user can quickly identify the target eggs from the candidate eggs.

The tools provided by the platform mainly use deep learning to predict the position of candidate eggs from the map.At present, two models of SSD300 and U-net are used. Before predicting candidate eggs from the graph, we need to prepare a training data set for the model to learn identification rules.



Here is a summary of the number of parasite egg types provided by the platform, the number of tools provided, and the number of models.


Table 1. Introduction table of identification tools provided by the platform.

Number of parasite egg typesNumber of toolsNumber of models
17322
  • Trichuris trichiura egg
  • Ascaris lumbricoides egg(fertilized)
  • Diphyllobothrium latum egg
  • Enterobius vermicularis egg
  • Taenia/Echinococcus egg
  • Fasciola hepatica egg
  • Schistosoma japonicum egg
  • Toxocara canis egg
  • Clonorchis sinensis egg
  • Fasciolopsis buski egg
  • Paragonimus westermani egg
  • Echinostoma spp. egg
  • Hymenolepis diminuta egg
  • Hookworm egg
  • Hymenolepis nana egg
  • Schistosoma haematobium egg
  • Schistosoma mansoni egg
  • SSD300
  • U-net
  • Faster R-CNN
  • Generate various models according to different training parameters




Here is an introduction to the types of parasite eggs identified by the platform, the source of the training data set, and the training data set information.

Identified parasite egg type

This platform currently provides identification of 17 kinds of parasite eggs, and the available tools are shown in the table. Based on the scalability of the platform, more identifiable types of parasite eggs and available tools will be added in the future to help users find the target eggs.


Table 2. Identification of parasite eggs provided by the platform.

Parasite egg categorySSD300U-netFaster R-CNN
Trichuris trichiura egg
Ascaris lumbricoides egg(fertilized)constructing
Diphyllobothrium latum egg
Enterobius vermicularis eggconstructing
Taenia/Echinococcus egg
Fasciola hepatica egg
Schistosoma japonicum eggconstructing
Toxocara canis eggconstructingconstructing
Clonorchis sinensis eggconstructingconstructing
Fasciolopsis buski eggconstructingconstructing
Paragonimus westermani eggconstructingconstructing
Echinostoma spp. eggconstructingconstructing
Hymenolepis diminuta eggconstructingconstructing
Hookworm eggconstructingconstructing
Hymenolepis nana eggconstructingconstructing
Schistosoma haematobium eggconstructingconstructing
Schistosoma mansoni eggconstructingconstructing



Training data set source

The source of the data set used for training is the slide provided by the Parasitology Department of Chang Gung University. Scan the slides through the instrument to obtain images of different depth of fields. The table shows the parameters used to scan the slides.


Table 3. Equipment and parameters for scanning slides

ParameterValue
Scanner Agent:Yuanli Instrument
Microscope equipment:Olympus BX53
Scanning software:cellSens Dimension
Objective magnification:10X
Eyepiece magnification:10X
Exposure time:Automatic



Training data set information

The images scanned from the slides need to be manually labeled to generate a data set for training the model. During model training, feature values ​​are extracted from pictures of different sizes according to the model settings. The training data set information corresponding to the identification tools trained with different models are shown in tables. Tables on the website indicated how many images captured and the total depth of field covered in each specimen. This is an important feature of HEAP to simulate the actual experience of microscopic observation. (Note: Training data can be obtained from the "Download Pre-train Data" page.)


SSD300

Use pictures with a size of 300*300 and manually labeled .xml files with labelImg for training.


Table 4. Data set information for various types of worm eggs using SSD300 training

Parasite egg categorySlide namedepth of fields(μm)Number of imagesNumber of training sets
Trichuris trichiura eggoriginal11012340
Ascaris lumbricoides egg(fertilized)(Csp-1)8091434
Diphyllobothrium latum egg(92W5257)809159
Enterobius vermicularis egg(0822)808638
Echinococcus granulosus egg(PS1710)10011290
Fasciola hepatica egg(30-6406)10011346
Schistosoma japonicum egg(PS1301)10011159
Fasciolopsis buski eggBuski_egg10011112
Paragonimus westermani egg(PS1415)33034100
Echinostoma spp. egg(E-16)1001188
Hymenolepis diminuta egg(92W5341)10011285
Hookworm egg(HL E-3)901080
Hymenolepis nana egg(92W5361)1001184
Schistosoma haematobium egg(92W5123)10011100
Schistosoma mansoni egg(92W5153)901080



U-net

Use 512*512 pictures and manually labeled .json files with labelme for training.


Table 5. Data set information for various types of worm eggs using U-net training

Parasite egg categorySlide namedepth of fields(μm)Number of imagesNumber of training sets
Trichuris trichiura eggoriginal11012485
Ascaris lumbricoides egg(fertilized)(Csp-1)809655
Diphyllobothrium latum egg(92W5257)809117
Echinococcus granulosus egg(PS1710)10011265
Fasciola hepatica egg(30-6406)10011287



Faster R-CNN

Use 300*300 pictures and manually labeled .xml files with labelImg for training.


Table 6. Data set information for various types of worm eggs using Faster R-CNN training

Parasite egg categorySlide namedepth of fields(μm)Number of imagesFaster R-CNN (default)
Number of training sets
Faster R-CNN (clear)
Number of training sets
Trichuris trichiura eggoriginal11012619332
Diphyllobothrium latum egg(92W5257)8095825
Enterobius vermicularis egg(0822)808480268
Echinococcus granulosus egg(PS1710)1001117745
Fasciola hepatica egg(30-6406)10011217121
Schistosoma japonicum egg(PS1301)100119760
Toxocara canis egg(92W5823)100115617
Clonorchis sinensis egg(PS1218)809674164