Image Processing and Neural Networks Lab
|
|
I. Training Data Files For Regression Or Approximation
This training file is used to
train a neural network to perform demodulation of an FM (frequency modulation)
signal containing a sinusoidal message. The data are generated from the equation For more details, see K. Rohani and M. T. Manry, "The Design of Multi-Layer Perceptrons using Building Blocks," Proc of IJCNN 91, Seattle WA., pp. II-497 to II-502. ▪ fmtrain.dat
(zipped)
This training file is used in the task of inverting the surface scattering parameters from an inhomogeneous layer above a homogeneous half space, where both interfaces are randomly rough. The parameters to be inverted are the effective permittivity of the surface, the normalized rms height, the normalized surface correlation length, the optical depth, and single scattering albedo of an inhomogeneous irregular layer above a homogeneous half space from back scattering measurements. The training data file
contains 1768 patterns. The inputs consist of eight theoretical values of
back scattering coefficient parameters at V and H polarization and four incident
angles. The outputs were the corresponding values of permittivity, upper surface
height, lower surface height, normalized upper surface correlation length,
normalized lower surface correlation length, optical depth and single scattering
albedo which had a joint uniform pdf. M. S. Dawson, A. K. Fung and M. T. Manry, "Surface parameter retrieval using fast learning neural networks," Remote Sensing Reviews, 1993, Vol. 7(1), pp. 1-18. M. S. Dawson, J. Olvera, A. K. Fung and M. T. Manry, "Inversion of surface parameters using fast learning neural networks," Proc. of IGARSS'92, Houston, Texas, May 1992, Vol II, pp 910 - 912. The testing version of the
data file TWOD.TST is also available (Size 138K) This file was generated by Mike Dawson while he worked for Prof.Adrian Fung, at University of Texas at Arlington. Dr.Dawson currently works at Raytheon E-Systems in Garland, Texas.
This training data file consists of 16 inputs and 3 outputs and represents the training set for inversion of surface permittivity, the normalized surface rms roughness, and the surface correlation length found in back scattering models from randomly rough dielectric surfaces. The first 16 inputs represent the simulated back scattering coefficient measured at 10, 30, 50 and 70 degrees at both vertical and horizontal polarization. The remaining 8 are various combinations of ratios of the original eight values. These ratios correspond to those used in several empirical retrieval algorithms. For more details, see A. K. Fung, Z. Li, and K. S. Chen, "Back scattering from a Randomly Rough Dielectric Surface," IEEE Trans. Geo. and Remote Sensing, Vol. 30, No. 2, March 1992. A. K. Fung, Microwave Scattering and Emission Models and Their Applications, Arctec House, 1994. ▪ single2.tra (zipped) This file was generated by Mike Dawson while he worked for Prof.Adrian Fung, at University of Texas at Arlington. Dr.Dawson currently works at Raytheon E-Systems in Garland, Texas.
This data set is given in Oh,
Y., K. Sarabandi, and F.T. Ulaby, "An Empirical Model and an Inversion
Technique for Radar Scattering from Bare Soil Surfaces," in IEEE
Trans. on Geoscience and Remote Sensing, pp. 370-381, 1992. The training set
contains VV and HH polarization at L 30, 40 deg, C 10, 30, 40, 50, 60 deg, and X
30, 40, 50 deg along with the corresponding unknowns rms surface height,
surface correlation length, and volumetric soil moisture content in g /
cubic cm.
This training file was generated using data obtained from TU Electric Company in Texas. The first ten input features are last ten minutes power load in megawatts for the entire TU Electric utility, which covers a large part of north Texas. The output is power load fifteen minutes in the future from the current time. All powers were originally sampled every fraction of a second, and averaged over 1 minute to reduce noise. The last two inputs are respectively, the "True Area Control Error" (TACE) and the "Filtered Area Control Error" (FACE). The FACE is a combination of exponentially filtered TACE and moving average filtered TACE. For more details, see K. Liu, S. Subbarayan, R. R. Shoults, M. T. Manry, C. Kwan, F. L. Lewis, and J. Naccarino, "Comparison of Very Short-Term Load Forecasting Techniques," IEEE Transactions on Power Systems, vol.11, no.2, May 1996, pp. 877-882. M. T. Manry, R. Shoults, and J. Naccarino, "An Automated System for Developing Neural Network Short Term Load Forecasters," Proceedings of the 58th American Power Conference, Chicago, Ill., April 9-11, 1996, vol. 1, pp. 237-241. A testing version POW12TST
(299 K) is also available for download.
This training file provides the data set for inversion of random two-by-two matrices. Each pattern consists of 4 input features and 4 output features. The input features, which are uniformly distributed between 0 and 1, represent a matrix and the four output features are elements of the corresponding inverse matrix. The determinants of the input matrices are constrained to be between .3 and 2.
▪ mattrn.zip (zipped) ▪ mattst.zip (zipped)
The speech samples are first preemphasized and it is converted into frequency domain by taking DFT. Then it is passed through Mel filter banks and the inverse DFT is applied on the output to get Mel-Frequency Cepstrum Coefficients (MFCC). Each of MFCC(n), MFCC(n)-MFCC(n-1) and MFCC(n)-MFCC(n-2) would have 13 features, which results in a total of 39 features.
The desired
outputs are likelihoods for the beginning, middle, and ends of 39 phonemes. ▪ Speech_Map (zipped)
II Training Data Files For Classification
The geometric shape recognition data file consists of four geometric shapes, ellipse, triangle, quadrilateral, and pentagon. Each shape consists of a matrix of size 64*64. For each shape, 200 training patterns were generated using different degrees of deformation. The deformations included rotation, scaling, translation, and oblique distortions. The feature set is ring-wedge energy (RNG), and has 16 features. For more information on the data file, see H. C. Yau, M. T. Manry, "Iterative Improvement of a Nearest Neighbor Classifier", Neural Networks, Vol. 4, pp. 517-524, 1991 ▪ grng.tra (zipped)
The raw data consists of images from hand printed numerals collected from 3,000 people by the Internal Revenue Service. We randomly chose 300 characters from each class to generate 3,000 character training data. Images are 32 by 24 binary matrices. An image scaling algorithm is used to remove size variation in characters. The feature set contains 16 elements. The 10 classes correspond to 10 Arabic numerals. For more details concerning the features, see W. Gong, H. C. Yau, and M. T. Manry, "Non-Gaussian Feature Analyses Using a Neural Network," Progress in Neural Networks, vol. 2, 1994, pp. 253-269. A testing version GONGTST is
also available (780K) for download. ▪ gongtst.tst (zipped)
The training data file is generated segmented images. Each segmented region is separately histogram equalized to 20 levels. Then the joint probability density of pairs of pixels separated by a given distance and a given direction is estimated. We use 0, 90, 180, 270 degrees for the directions and 1, 3, and 5 pixels for the separations. The density estimates are computed for each classification window. For each separation, the co-occurrences for for the four directions are folded together to form a triangular matrix. From each of the resulting three matrices, six features are computed: angular second moment, contrast, entropy, correlation, and the sums of the main diagonal and the first off diagonal. This results in 18 features for each classification window. For more details concerning the features, see R.R. Bailey, E. J. Pettit, R. T. Borochoff, M. T. Manry, and X. Jiang, "Automatic Recognition of USGS Land Use/Cover Categories Using Statistical and Neural Network Classifiers," Proceedings of SPIE OE/Aerospace and Remote Sensing, April 12-16, 1993, Orlando Florida. Four regions of
land use/cover types were identified in the images per Level I of the US
Geological Survey Land Use/Land Cover Classification System : urban areas,
fields or open grassy land, trees (forested land), and water ( lakes or rivers).
The speech samples are first preemphasized and it is converted into frequency domain by taking DFT. Then it is passed through Mel filter banks and the inverse DFT is applied on the output to get Mel-Frequency Cepstrum Coefficients (MFCC). Each of MFCC(n), MFCC(n)-MFCC(n-1) and MFCC(n)-MFCC(n-2) would have 13 features, which results in a total of 39 features. Each class corresponds to a phoneme.
▪ Speech_Class (zipped)
This data file consists of parameters that are available in the basic health usage monitoring system (HUMS), plus some others. The data was obtained from the M430 flight load level survey conducted in Mirabel Canada in early 1995. The input features include: (1) CG F/A load factor, (2) CG lateral load factor, (3) CG normal load factor, (4) pitch attitude, (5) pitch rate, (6) roll attitude, (7) roll rate, (8) yaw rate, (9) corrected airspeed, (10) rate of climb, (11) longitudinal cyclic stick position, (12) pedal position, (13) collective stick position, (14) lateral cyclic stick position, (15) main rotor mast torque, (16) main rotor mast pm, (17) density ratio. The 39 classes represents different maneuvers of the flight like taking off, landing, turning right or left etc. This is an application for prognostics or flight condition recognition.
▪ F17C (zipped)
III Software
This software can be used to generate training data files that can be used to train a neural network to perform demodulation of an FM (Frequency Modulation) signal containing a random waveform. The generated training data file will have the user specified number of input samples which are nothing but values of the FM waveform taken over the specified window size and one output corresponding to the window center sample of the original random waveform. The window size is an odd number and a minimum of 3. The user can generate any number of patterns and save it as desired. Both training and testing files can be generated for the FM Demodulation application.
▪ Source Code (zipped) ▪ Training Files (zipped) ▪
Sample
Testing (zipped)
This software can be used to generate training data files that can be used to train a neural network to perform matrix inversion. The generated training data file will have four input samples which are nothing but elements of a random 2x2 matrix and four output samples which are the elements of the corresponding inverse matrix. The user can generate any number of patterns and save it as desired. Both training and testing files can be generated for the Matrix Inversion application.
▪ Source Code (WinZipped version) ▪ Training Files (WinZipped version) ▪
Sample
Testing (WinZipped version)
This software can be used to generate training data files that can be used to train a neural network to perform estimation of parameters of an exponential waveform, from its KLT coefficients. The exponential waveform is generated as A*exp(-n/T)+n(n), where A and T have uniform densities between (1,4) and (10,50) respectively and n(n) is WGN with standard deviation of 0.1. Each pattern of the training data file consists of 2 KLT coefficients of the exponential waveform as inputs and the corresponding values of A and T as the outputs. The program also generates the Cramer-Rao bounds on the training error.
▪ Source Code (WinZipped version) ▪ Training Files (WinZipped version) ▪ Sample Testing (WinZipped version)
IV Help
|
Click here to email any
questions or comments about this web site.
|