UT Arlington

UT Arlington

Department of Electrical Engineering

 

IPNNL logo

Training Data Files for Classification

GRNG.TRN: (16 Inputs, Class Id, 800 Training Patterns, 196KB)

The geometric shape recognition data file consists of four geometric shapes, ellipse, triangle, quadrilateral, and pentagon. Each shape consists of a matrix of size 64*64. For each shape, 200 training patterns were generated using different degrees of deformation. The deformations included rotation, scaling, translation, and oblique distortions. The feature set is ring-wedge energy (RNG), and has 16 features.

For more information on the data file, see

H. C. Yau, M. T. Manry, "Iterative Improvement of a Nearest Neighbor Classifier", Neural Networks, Vol. 4, pp. 517-524, 1991

grng.tra (zipped)

 

GONGTRN.TRA: ( 16 Inputs, Class Id, 3000 Training Patterns, 780KB)

The raw data consists of images from hand printed numerals collected from 3,000 people by the Internal Revenue Service. We randomly chose 300 characters from each class to generate 3,000 character training data. Images are 32 by 24 binary matrices. An image scaling algorithm is used to remove size variation in characters. The feature set contains 16 elements. The 10 classes correspond to 10 Arabic numerals. For more details concerning the features, see

W. Gong, H. C. Yau, and M. T. Manry, "Non-Gaussian Feature Analyses Using a Neural Network," Progress in Neural Networks, vol. 2, 1994, pp. 253-269.

A testing version GONGTST is also available (780K) for download.

gongtrn.tra  (zipped)
gongtst.tst  (zipped)

           
COMF18.TRA : ( 18 Inputs, Class Id, 12,392 Training Patterns, 3.8MB)

The training data file is generated  segmented images. Each segmented region is separately histogram equalized to 20 levels. Then the joint probability density of pairs of pixels separated by a given distance and a given direction is estimated. We use 0, 90, 180, 270 degrees for the directions and 1, 3, and 5 pixels for the separations. The density estimates are computed for each classification window. For each separation, the co-occurrences for for the four directions are folded together to form a triangular matrix. From each of the resulting three matrices, six features are computed: angular second moment, contrast, entropy, correlation, and the sums of the main diagonal and the first off diagonal. This results in 18 features for each classification window.

For more details concerning the features, see

R.R. Bailey, E. J. Pettit, R. T. Borochoff, M. T. Manry, and X. Jiang, "Automatic Recognition of USGS Land Use/Cover Categories Using Statistical and Neural Network Classifiers," Proceedings of SPIE OE/Aerospace and Remote Sensing, April 12-16, 1993, Orlando Florida.

Four regions of land use/cover types were identified in the images per Level I of the US Geological Survey Land Use/Land Cover Classification System : urban areas, fields or open grassy land, trees (forested land), and water ( lakes or rivers).

comf18.tra (zipped)

SPEECH_CLASS.TRA: (39 Inputs, 34 Classes, 2184 Training Patterns, 853 KB)

The speech samples are first preemphasized and it is converted into frequency domain by taking DFT. Then it is passed through Mel filter banks and the inverse DFT is applied on the output to get Mel-Frequency Cepstrum Coefficients (MFCC). Each of MFCC(n), MFCC(n)-MFCC(n-1) and MFCC(n)-MFCC(n-2) would have 13 features, which results in a total of 39 features. Each class corresponds to a phoneme.

Speech_Class (zipped)


F17C.DAT: (17 inputs, 39 Classes, 4745 Training Patterns, 1.33 MB)

This data file consists of parameters that are available in the basic health usage monitoring system (HUMS), plus some others. The data was obtained from the M430 flight load level survey conducted in Mirabel Canada in early 1995. The input features include: (1) CG F/A load factor, (2) CG lateral load factor, (3) CG normal load factor, (4) pitch attitude, (5) pitch rate, (6) roll attitude, (7) roll rate, (8) yaw rate, (9) corrected airspeed, (10) rate of climb, (11) longitudinal cyclic stick position, (12) pedal position, (13) collective stick position, (14) lateral cyclic stick position, (15) main rotor mast torque, (16) main rotor mast pm, (17) density ratio. The 39 classes represents different maneuvers  of the flight like taking off, landing, turning right or left etc. This is an application for prognostics or flight condition recognition.

F17C (zipped)

 

© 2009 The University of Texas at Arlington
© 2009 Image Processing and Neural Networks Lab