Zhang J.,National Laboratory of Pattern Recognition |
Zhang D.,Toshiba China R and nter |
Hao J.,Toshiba China R and nter
IJCAI International Joint Conference on Artificial Intelligence | Year: 2015
Statistical machine translation models have made great progress in improving the translation quality. However, the existing models predict the target translation with only the source- and target-side local context information. In practice, distinguishing good translations from bad ones does not only depend on the local features, but also rely on the global sentence-level information. In this paper, we explore the source-side global sentence-level features for target-side local translation prediction. We propose a novel bilingually-constrained chunk-based convolutional neural network to learn sentence semantic representations. With the sentencelevel feature representation, we further design a feed-forward neural network to better predict translations using both local and global information. The large-scale experiments show that our method can obtain substantial improvements in translation quality over the strong baseline: the hierarchical phrase-based translation model augmented with the neural network joint model.
Wu B.,National Laboratory of Pattern Recognition |
Lyu S.,University at Albany |
Hu B.-G.,National Laboratory of Pattern Recognition |
Ji Q.,Rensselaer Polytechnic Institute
Proceedings of the IEEE International Conference on Computer Vision | Year: 2013
We describe a novel method that simultaneously clusters and associates short sequences of detected faces (termed as face track lets) in videos. The rationale of our method is that face track let clustering and linking are related problems that can benefit from the solutions of each other. Our method is based on a hidden Markov random field model that represents the joint dependencies of cluster labels and track let linking associations. We provide an efficient algorithm based on constrained clustering and optimal matching for the simultaneous inference of cluster labels and track let associations. We demonstrate significant improvements on the state-of-the-art results in face tracking and clustering performances on several video datasets. © 2013 IEEE.
Zhou G.,Central China Normal University |
He T.,Central China Normal University |
Zhao J.,National Laboratory of Pattern Recognition |
Hu P.,National Laboratory of Pattern Recognition
ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference | Year: 2015
Community question answering (cQA) has become an important issue due to the popularity of cQA archives on the web. This paper is concerned with the problem of question retrieval. Question retrieval in cQA archives aims to find the existing questions that are semantically equivalent or relevant to the queried questions. However, the lexical gap problem brings about new challenge for question retrieval in cQA. In this paper, we propose to learn continuous word embeddings with metadata of category information within cQA pages for question retrieval. To deal with the variable size of word embedding vectors, we employ the framework of fisher kernel to aggregated them into the fixedlength vectors. Experimental results on large-scale real world cQA data set show that our approach can significantly outperform state-of-The-Art translation models and topic-based models for question retrieval in cQA. © 2015 Association for Computational Linguistics.
Zhang X.,Wenzhou University |
Shi X.,National Laboratory of Pattern Recognition |
Hu W.,National Laboratory of Pattern Recognition |
Li X.,National Laboratory of Pattern Recognition |
Maybank S.,Birkbeck College
Neurocomputing | Year: 2011
The appearance model is an important issue in the visual tracking community. Most subspace-based appearance models focus on the time correlation between the image observations of the object, but the spatial layout information of the object is ignored. This paper proposes a robust appearance model for visual tracking which effectively combines the spatial and temporal eigen-spaces of the object in a tensor reconstruction way. In order to capture the variations in object appearance, an incremental updating strategy is developed to both update the eigen-space and mean of the object. Experimental results demonstrate that, compared with the state-of-the-art appearance models in the tracking literature, the proposed appearance model is more robust and effective. © 2011 Elsevier B.V.
Niu B.,National Laboratory of Pattern Recognition |
Cheng J.,National Laboratory of Pattern Recognition |
Bai X.,Beihang University |
Lu H.,National Laboratory of Pattern Recognition
Signal Processing | Year: 2013
Relevance feedback is an effective approach to improve the performance of image retrieval by leveraging the labeling of human. In order to alleviate the burden of labeling, active learning method has been introduced to select the most informative samples for labeling. In this paper, we present a novel batch mode active learning scheme for informative sample selection. Inspired by the method of graph propagation, we not only take the correlation between labeled samples and unlabeled samples, but the correlation among unlabeled samples taken into account as well. Especially, considering the unbalanced distribution of samples and the personalized feedback of human we propose an asymmetric propagation scheme to unify the various criteria including uncertainty, diversity and density into batch mode active learning in relevance feedback. Extensive experiments on publicly available datasets show that the proposed method is promising. © 2012 Elsevier B.V.