Separability of ternary codes for sparse designs of error-correcting output codes

doi:10.1016/j.patrec.2008.10.002

Pattern Recognition Letters

Volume 30, Issue 3, 1 February 2009, Pages 285-297

https://doi.org/10.1016/j.patrec.2008.10.002 Get rights and content

Abstract

Error-correcting output codes (ECOC) represent a successful framework to deal with multi-class categorization problems based on combining binary classifiers. With the extension of the binary ECOC to the ternary ECOC framework, ECOC designs have been proposed in order to better adapt to distributions of the data. In order to decode ternary matrices, recent works redefined many decoding strategies that were formulated to deal with just two symbols. However, the coding step also is affected, and therefore, it requires to be reconsidered. In this paper, we present a new formulation of the ternary ECOC distance and the error-correcting capabilities in the ternary ECOC framework. Based on the new measure, we stress on how to design coding matrices preventing codification ambiguity and propose a new sparse random coding matrix with ternary distance maximization. The results on a wide set of UCI Machine Learning Repository data sets and in a real speed traffic sign categorization problem show that when the coding design satisfies the new ternary measures, significant performance improvement is obtained independently of the decoding strategy applied.

Introduction

In the literature, one can find several powerful types of binary classifiers. However, when one needs to deal with multi-class classification problems, many learning techniques fail to manage this information. Instead, it is common to construct the classifiers to distinguish between just two classes, and to combine them. In this sense, error-correcting output codes were born as a general framework to combine binary problems to address the multi-class problem. The strategy was introduced by Dietterich and Bakiri (1995). Based on the error-correcting principles (Dietterich and Bakiri, 1995) and because of its ability to correct the bias and variance errors of the base classifiers (Kong and Dietterich, 1995), ECOC has been successfully applied to a wide range of Computer Vision applications, such as face recognition (Windeatt and Ardeshir, 2003), face verification (Kittler et al., 2001), text recognition (Ghani, 2001) or manuscript digit classification (Zhou and Suen, 2005).

The ECOC technique can be broken down into two general stages: encoding and decoding. Given a set of classes, the coding stage designs a codeword¹ for each class based on different binary problems. The decoding stage makes a classification decision for a given test sample based on the value of the output code.

At the coding step, given a set of N classes to be learnt, n different bi-partitions (groups of classes) are formed, and n binary problems (dichotomizers) are trained. As a result, a codeword of length n is obtained for each class, where each bit of the code corresponds to the response of a given dichotomizer (coded by +1, −1, according to its class set membership). Arranging the codewords as rows of a matrix, we define a coding matrix M, where $M \in {- 1, 1}^{N \times n}$ in the binary case. The most well-known binary coding strategies are the one-versus-all strategy (Nilsson, 1965), where each class is discriminated against the rest of classes, and the dense random strategy (Allwein et al., 2002), where a random matrix M is generated maximizing the rows and columns separability in terms of the Hamming distance (Dietterich and Bakiri, 1995). In Fig. 1a, the one-versus-all ECOC design for a 4-class problem is shown. The white regions of the coding matrix M correspond to the positions coded by 1, and the black regions to −1. Thus, the codeword for class $c_{1}$ is ${1, - 1, - 1, - 1}$ . Each column j of the coding matrix codifies a binary problem learnt by its corresponding dichotomizer $h_{i}$ . For instance, dichotomizer $h_{1}$ learns $c_{1}$ against classes $c_{2}, c_{3}$ and $c_{4}$ , dichotomizer $h_{2}$ learns $c_{2}$ against classes $c_{1}, c_{3}$ and $c_{4}$ , etc. An example of a dense random matrix for a 4-class problem is shown in Fig. 1c.

It was when Allwein et al. (2002) introduced a third symbol (the zero symbol) in the coding process that the coding step received special attention. This symbol increases the number of partitions of classes to be considered in a ternary ECOC framework by allowing some classes to be ignored. Then, the ternary coding matrix becomes $M \in {- 1, 0, 1}^{N \times n}$ . In this case, the symbol zero means that a particular class is not considered by a certain binary classifier. Thanks to this, strategies such as one-versus-one (Hastie and Tibshirani, 1998) and sparse random coding (Allwein et al., 2002) have been formulated in the ECOC framework. Fig. 1b shows the one-versus-one ECOC configuration for a 4-class problem. In this case, the grey positions correspond to the zero symbol. A possible sparse random matrix for a 4-class problem is shown in Fig. 1d. Recently, new improvements in the ternary ECOC coding demonstrate the suitability of the ECOC methodology to deal with multi-class classification problems (Pujol et al., 2006, Escalera et al., 2007). These recent designs use the knowledge of the problem-domain to learn relevant binary problems from ternary codes. The basic idea of these methods is to use the training data to guide the training process, and thus, to construct the coding matrix M focusing on the binary problems that better fits the decision boundaries of a given data set.

The decoding step was originally based on error-correcting principles under the assumption that the learning task can be modeled as a communication problem, in which class information is transmitted over a channel (Dietterich and Bakiri, 1995). During the decoding process, applying the n binary classifiers, a code is obtained for each data point in the test set. This code is compared to the base codewords of each class defined in the matrix M, and the data point is assigned to the class with the closest codeword. The most frequently applied decoding strategies are the Hamming (HD) (Nilsson, 1965) and the Euclidean (ED) decoding distances (Hastie and Tibshirani, 1998). With the introduction of the zero symbol, Allwein et al. (2002) showed the advantage of using a Loss-based function of the output margin of the base classifier. Recently, Escalera et al. (2008) proposed a loss-weighted strategy to decode, where a set of probabilities based on the performances of the base classifiers is used to weight the final classification decision. In Fig. 1, each ECOC codification is used to classify an input object X. The input X is tested with each dichotomizer $h_{i}$ , obtaining an output $X_{i}, i \in {1, . ., n}$ . The final code ${X_{1}, \dots, X_{n}}$ of the test input X is used by a given decoding strategy to obtain the final classification decision. Note that in both, the binary and the ternary ECOC framework, the value of each position $X_{j}$ of the test codeword can not take the value zero since the output of each dichotomizer is $h_{j} \in {- 1, + 1}$ , meaning the automatical increasing of distance/error.

To deal with multi-class categorization problems in the ternary ECOC framework, recent works redefined decoding strategies that were formulated to deal with just two symbols (Escalera et al., 2008, Allwein et al., 2002). However, the influence of the zero symbol to the error-correction capabilities and the design of the coding strategies have not been taken into account. In this paper, we formulate the ternary distance and the ternary error-correcting capabilities in the ternary ECOC framework. We propose a new sparse coding design based on maximizing the new ternary distance. We evaluate the methodology on a wide set of UCI Machine Learning Repository data sets and in a real Computer Vision problem: speed traffic sign categorization. The results show that when the new ternary distance is considered on sparse designs, significant performance improvement is obtained.

The paper is organized as follows: Section 2 overviews the ECOC random designs and presents a new sparse coding design based on ternary distance maximization. Section 3 presents the experimental results. Finally, Section 4 concludes the paper.

Section snippets

Random ECOC designs

In this section, we overview both dense and sparse random ECOC designs (Allwein et al., 2002). We show the inconsistency of the classical sparse random design and introduce a new measure for sparse coding designs.

Results

We discuss the data, comparatives, and measurements of the experiments before the results are presented.

•
Data: The data used for the experiments consists of 16 multi-class data sets from the UCI Machine Learning Repository database (Asuncion and Newman, 2007). The details of the data sets are shown in Table 1.
We also use the video sequences obtained from a Mobile Mapping System (Casacuberta et al., 2004) to test the methods in a real traffic sign categorization problem.
•
Comparative: For the

Conclusions

In this paper, we introduced a new formulation of the ternary distance that defines the classes separability in the ternary ECOC framework. We showed that the rows separability in terms of the Hamming distance of the binary ECOC framework can not be applied in the ternary case. Based on the new measure, we illustrated that the design of the standard sparse random strategy is inconsistent, and a new sparse random construction is presented. The results show that the new design applied with any

Acknowledgments

This work has been supported in part by projects TIN2006-15308-C02, FIS PI061290, and CONSOLIDER-INGENIO CSD 2007-00018.

References (20)

S. Escalera et al.
Boosted landmarks of contextual descriptors and Forest-ECOC: A novel framework to detect and classify objects in clutter scenes
Pattern Recognition Lett.
(2007)
Allwein, E., Schapire, R., Singer, Y., 2002. Reducing multiclass to binary: A unifying approach for margin classifiers....
Asuncion, A., Newman, D., 2007. In: UCI Machine Learning Repository, University of California, Irvine, School of...
Casacuberta, J., Miranda, J., Pla, M., Sanchez, S., Serra, A., Talaya, J., 2004. On the accuracy and performance of the...
T. Dietterich et al.
Solving multiclass learning problems via error-correcting output codes
J. Artif. Intell. Res.
(1995)
Escalera, S., Pujol, O., Radeva, P., 2006. Decoding of ternary error correcting output codes. In: CIARP, vol. 4225, pp....
Escalera, S., Pujol, O., Radeva, P., 2008. Loss-weighted decoding for error-correcting output codes. In: Internat....
J. Friedman et al.
Additive logistic regression: A statistical view of boosting
Ann. Statist.
(1998)
Ghani, R., 2001. Combining labeled and unlabeled data for text classification with a large number of categories. In:...
Hastie, T., Tibshirani, R., 1998. Classification by pairwise grouping. In: NIPS, vol. 26,...

There are more references available in the full text version of this article.

Cited by (117)

An adaptive error-correcting output codes algorithm based on gene expression programming and similarity measurement matrix
2024, Pattern Recognition
The multi-class classification task is one of the most common tasks in machine learning. As a typical solution based on a partitioning strategy, Error-Correcting Output Codes (ECOC) can transform a multi-class classification problem into multiple binary classification problems. The key of ECOC is to construct an effective codematrix to represent a set of class decomposition schemes, which transforms a multiclass problem into a group of binary class problems. Consequently, the design of a fast and effective ECOC codematrix generation method is of great research significance and value for solving multi-class classification problems. In ECOC algorithms, the design of codematrix is treated as a combination problem between different code columns, in which the evolutionary algorithm shows a great advantage. Based on this consideration, the Gene Expression Programming (GEP) is applied to search for the codematrix with high performance because its expressive tree structure makes it well represent codematrcies for subsequent optimization operations. This paper proposes an adaptive ECOC algorithm based on Gene Expression Programming (GEP) and similarity measurement matrix, named GEP-ECOC. In our GEP, each individual represents a set of columns to form a random ECOC codematrix, which is optimized in the evolutionary process. Meanwhile, the crossover and mutation operations are modified to include a legality checking process to ensure that the generated codematrix satisfies the ECOC constraints. The GEP-based ECOC codematrix generation algorithm can quickly produce a codematrix with better performance, which ensures the efficiency of the algorithm to a certain extent. In addition, an adaptive algorithm based on a similarity measurement matrix is proposed to add new columns to the current codematrix, aiming to better handle hard classes. Our algorithm is compared with other algorithms on various data sets, and the experimental results confirm that our GEP-ECOC can balance the efficiency and performance of the algorithm and achieve higher performance.
Bow slamming detection and classification by Machine Learning approach
2023, Ocean Engineering
Identification of slamming events during ship navigation is typically based on the fulfillment of multiple conditions evaluated on sensor data. Alternatively, Machine Learning (ML) algorithms can be trained to capture slamming events by analyzing features from measurement signals. The feasibility of using supervised ML techniques is here presented by processing data from a segmented-model of a fast ferry tested in the towing-tank. The extensive test conditions provided a suitable dataset for ML training and comparing the accuracy of results with a physics-based identification model. The slamming identification problem is addressed at two levels: simply counting of slams, and classification of slams into three different groups. A challenging aspect is related to imbalanced data, due to minority of slamming classes, calling for meaningful evaluation metrics. The considered ML models are eXtreme Gradient Boosting (XGBoost), Supporting Vector Machine and Decision Tree for the detection problem, along with meta-models to deal with classification. A maximum F1-score of 72.4% is reached in slamming detection, while the best weighted F1-score for classification is 84.3%, both obtained with XGBoost. Reducing the number of analyzed features, as in the case of sensor failure, still provides good performance metrics, demonstrating the versatility of the ML approach.
A hybrid complex-valued neural network framework with applications to electroencephalogram (EEG)
2023, Biomedical Signal Processing and Control
In this article, we present a new EEG signal classification framework by integrating the complex-valued and real-valued Convolutional Neural Network (CNN) with discrete Fourier transform (DFT). The proposed neural network architecture consists of only one complex-valued convolutional layer, real-valued convolutional layers, and fully connected layers. Our method can efficiently utilize the phase information contained in the DFT. We validate our approach using two simulated EEG signals and two benchmark datasets and compare it with some widely used frameworks. Our method drastically reduces the number of parameters used and improves accuracy when compared with the existing methods in classifying benchmark seizure EEG dataset, and significantly improves performance in classifying simulated EEG signals.
A novel soft-coded error-correcting output codes algorithm
2023, Pattern Recognition
Error-Correcting Output Codes (ECOC) algorithms enable multiclass classification by reassigning multiple classes to the positive/negative group with the class reassignment schemes being recorded as binary/ternary hard-coded (HC) codematrices. Different classes tend to get diverse subordination degrees to the positive/negative group, providing clues to correct potential errors. However, the HC codematrices are unable to provide the information in the subordination degrees. In this paper, a Soft-Coded ECOC (SC-ECOC) scheme, namely, the Sequential Forward Floating Selection algorithm, is proposed by filling codematrices with real values instead of hard codes to improve classification performance. This algorithm divides multiple classes into two groups by maximizing the ratio of inter-group distance to intra-group distance. Then a new measure coverage is designed to evaluate the subordination degrees of different classes to both groups, which are set as the elements to form a codematrix. Furthermore, a self-adaptive strategy adjusts the value of each element to fit learners better. Experiments are carried out to verify the performance of our algorithm on various data sets, and results confirm that our algorithm can achieve more balanced results compared with the traditional HC ECOC algorithms. Besides, the values of soft codes correlate with the difficulty level of various classes to improve the multiclass classification ability.
Characterization and classification of river networks in South Korea
2022, Environmental Modelling and Software
The aims of this study are to characterize river network patterns and identify whether the slope of preexisting surfaces relatively unaffected by modern landscape processes influences the formation of networks on the peninsula. The distributional properties of the tributary junction angles by the beta distribution are examined, and support vector machines based on the parameter estimates of the distribution are used to distinguish the different river network types. The classified river networks are validated using three measures: drainage area increment, stream course irregularity, and tributary junction angle derived from scaling invariance. 41 river networks are analyzed for the characterization and classification of networks in the region. Among the river networks, 12 networks represent the parallel network type and 29 networks represent the rectangular network type. These results imply that the preexisting slope does not seem to have a main effect on the development of the networks on South Korea.
Voice spoofing detector: A unified anti-spoofing framework
2022, Expert Systems with Applications
Citation Excerpt :
Thus, we extracted 13 GTCC coefficients and later fused them with the proposed novel ATCoP features for audio signal representation. To address the multi-class classification problem, we employed the error correcting output codes (ECOC) framework (Escalera, Pujol, & Radeva, 2009) by combining three binary classifiers. ECOC model generates a codeword against each class during encoding and predict the class of given test sample at the decoding phase.
Voice controlled systems (VCS) in Internet of Things (IoT), speaker verification systems, voice-based biometrics, and other voice-assistant-enabled systems are vulnerable to different spoofing attacks i.e., replay, cloning, cloned-replay, etc. VCS are not only susceptible to these attacks in a non-network environment, but they are also vulnerable to multi-order spoofing attacks in networked IoT. Additionally, deepfakes with artificially generated audio pose a great threat to the all systems having voice-interfaces. Most of the existing countermeasures against these voice spoofing attacks work for only one specific attack (e.g. voice replay) and fail to generalize this for other classes of spoofing attacks. Additionally, generalization is also crucial for cross-corpora evaluation. Thus, there exists a need to develop a unified voice anti-spoofing framework capable of detecting multiple spoofing attacks. This work presents a unified anti-spoofing framework that uses novel (ATCoP-GTCC) features to combat the variety of voice spoofing attacks. The proposed novel acoustic-ternary co-occurrence patterns (ATCoP) encode the co-occurrence of similar patterns between the center and neighboring samples. Our experiments demonstrate that ATCoP can better capture the microphone induced distortions in replays, unnatural prosody and algorithmic artifacts in cloned samples, and both the distortions and artifacts in cloned-replays including compression on multi-hop attacks in the spoofing samples. The performance of ATCoP could be further enhanced by the Gammatone cepstral coefficients. To evaluate the effectiveness of the proposed anti-spoofing system for multi-order replay and cloned-replay attacks detection, we created a diverse voice spoofing detection corpus (VSDC) containing multi-order replay and cloned-replay audios against the bonafide and cloned audio recordings, respectively. Experimental results obtained on VSDC, ASVspoof 2019, Google’s LJ Speech, and YouTube deepfakes datasets illustrate the effectiveness of the proposed system in terms of accurate detection for a variety of voice spoofing attacks.

View all citing articles on Scopus

View full text

Separability of ternary codes for sparse designs of error-correcting output codes

Abstract

Introduction

Section snippets

Random ECOC designs

Results

Conclusions

Acknowledgments

Pattern Recognition Lett.

Solving multiclass learning problems via error-correcting output codes

J. Artif. Intell. Res.

Additive logistic regression: A statistical view of boosting

Ann. Statist.