Larcher A.,Institute for Infocomm Research |
Bonastre J.-F.,University of Avignon |
Fauve B.,ValidSoft Ltd. |
Lee K.A.,Institute for Infocomm Research |
And 4 more authors.
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | Year: 2013
ALIZE is an open-source platform for speaker recognition. The ALIZE library implements a low-level statistical engine based on the well-known Gaussian mixture modelling. The toolkit includes a set of high level tools dedicated to speaker recognition based on the latest developments in speaker recognition such as Joint Factor Analysis, Support Vector Machine, i-vector modelling and Probabilistic Linear Discriminant Analysis. Since 2005, the performance of ALIZE has been demonstrated in series of Speaker Recognition Evaluations (SREs) conducted by NIST and has been used by many participants in the last NISTSRE 2012. This paper presents the latest version of the corpus and performance on the NIST-SRE 2010 extended task. Copyright © 2013 ISCA.
Urbain J.,University of Mons |
Cakmak H.,University of Mons |
Charlier A.,University of Mons |
Denti M.,Multitel |
And 2 more authors.
IEEE Journal on Selected Topics in Signal Processing | Year: 2014
This paper presents the adaptation of HMM-based speech synthesis to laughter signals. Acoustic laughter synthesis HMMs are built with only 3 minutes of laughter data. An evaluation experiment shows that the method achieves significantly better performance than previous works. In addition, the first method to generate laughter phonetic transcriptions from high-level signals (in our case, arousal signals) is described. This enables to generate new laughter phonetic sequences, that do not exist in the original data. The generated phonetic sequences are used as input for HMM synthesis and reach similar perceived naturalness as laughs synthesized from existing phonetic transcriptions. These methods open promising perspectives for the integration of natural laughs in man-machine interfaces. It could also be used for other vocalizations (sighs, cries, coughs, etc.). © 2014 IEEE.
Carincotte C.,Multitel |
Bremond F.,French Institute for Research in Computer Science and Automation
Proceedings - IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2010 | Year: 2010
In Intelligent Video Systems, most of the recent advanced performance evaluation metrics perform a stage of mapping data between the system results and ground truth. This paper aims to review these metrics using a proposed framework. It will focus on metrics for events detection, objects detection and objects tracking systems. © 2010 IEEE.
Simon C.,Communication and Remote Sensing Laboratory |
Meessen J.,Multitel |
De Vleeschouwer C.,Communication and Remote Sensing Laboratory
Multimedia Tools and Applications | Year: 2010
This paper presents a classifier-based approach to recognize dynamic events in video surveillance sequences. The goal of this work is to propose a flexible event recognition system that can be used without relying on a long-term explicit tracking procedure. It is composed of three stages. The first one aims at defining and building a set of relevant features describing the shape and movements of the foreground objects in the scene. To this aim, we introduce new motion descriptors based on space-time volumes. Second, an unsupervised learning-based method is used to cluster the objects, thereby defining a set of coarse to fine local patterns of features, representing primitive events in the video sequences. Finally, events are modeled as a spatio-temporal organization of patterns based on an ensemble of randomized trees. In particular, we want this classifier to discover the temporal and causal correlations between the most discriminative patterns. Our system is experimented and validated both on simulated and real-life data. © 2009 Springer Science+Business Media, LLC.
University of Limoges, Multitel and Leukos | Date: 2011-04-08
A supercontinuum source capable of emitting waves between the infrared range and the ultraviolet range, includes a pulsed laser source (