Time filter

Source Type

Fang J.,Beijing Institute of Technology | Tao X.,Beijing Institute of Technology | Tang Z.,Beijing Institute of Technology | Qiu R.,Beijing Institute of Technology | And 2 more authors.
Proceedings - 10th IAPR International Workshop on Document Analysis Systems, DAS 2012 | Year: 2012

Table detection is an important task in the field of document analysis. It has been extensively studied since a couple of decades. Various kinds of document mediums are involved, from scanned images to web pages, from plain texts to PDF files. Numerous algorithms published bring up a challenging issue: how to evaluate algorithms in different context. Currently, most work on table detection conducts experiments on their in-house dataset. Even the few sources of online datasets are targeted at image documents only. Moreover, Precision and recall measurement are usual practice in order to account performance based on human evaluation. In this paper, we provide a dataset that is representative, large and most importantly, publicly available. The compatible format of the ground truth makes evaluation independent of document medium. We also propose a set of new measures, implement them, and open the source code. Finally, three existing table detection algorithms are evaluated to demonstrate the reliability of the dataset and metrics. © 2012 IEEE.


PubMed | Founder Lab and Heinrich Heine University Düsseldorf
Type: Journal Article | Journal: Compendium of continuing education in dentistry (Jamesburg, N.J. : 1995) | Year: 2016

Dental erosion is a global oral health problem that can lead to significant functional and esthetic impairments of the affected patients. Treatment of severe cases with augmented loss of the vertical dimension of occlusion (VDO) represents a challenge for both the dental team and the patient. CAD/CAM technology was used in the presented case to analyze the interocclusal space. Based on a virtual wax-up of the final restorations, CAD/CAM-fabricated preparation splints served as a guide and ensured a most minimally invasive preparation design. Milled polymer provisionals enabled the patient to visualize the final treatment outcome and served as a fracture-resistant temporary restoration to test the increased VDO. Monolithic lithium-disilicate ceramic, defect-oriented restorations with reduced ceramic thickness enabled a functional and reliable reconstruction of the severely compromised dentition. This case report documents a practical, digital approach and discusses the advantages related to treatment time, ease of treatment, and predictability.


Xu C.,Beijing Institute of Technology | Xu C.,Founder Lab | Tang Z.,Beijing Institute of Technology | Tang Z.,Founder Lab | And 4 more authors.
Proceedings of SPIE - The International Society for Optical Engineering | Year: 2013

To increase the flexibility and enrich the reading experience of e-book on small portable screens, a graph based method is proposed to perform layout analysis on Portable Document Format (PDF) documents. Digital born document has its inherent advantages like representing texts and fractional images in explicit form, which can be straightforwardly exploited. To integrate traditional image-based document analysis and the inherent meta-data provided by PDF parser, the page primitives including text, image and path elements are processed to produce text and non text layer for respective analysis. Graph-based method is developed in superpixel representation level, and page text elements corresponding to vertices are used to construct an undirected graph. Euclidean distance between adjacent vertices is applied in a top-down manner to cut the graph tree formed by Kruskal's algorithm. And edge orientation is then used in a bottom-up manner to extract text lines from each sub tree. On the other hand, non-textual objects are segmented by connected component analysis. For each segmented text and non-text composite, a 13-dimensional feature vector is extracted for labelling purpose. The experimental results on selected pages from PDF books are presented. © 2013 SPIE SPIE-IS&T.


Xie H.,Beijing Institute of Technology | Xie H.,Founder Lab | Lu X.,Beijing Institute of Technology | Tang Z.,Beijing Institute of Technology | Ye M.,Beijing Founder Apabi Technology Ltd
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries | Year: 2016

The accuracy of the contents of a knowledge base determines the effectiveness of knowledge service applications, thus, it is necessary to evaluate the confidence of triples when a knowledge base is built. This study introduces a generic computational methodology to compute the confidence values of triples in knowledge bases and detect potentially incorrect ones for further verification. The major contributions of the proposed methodology are as follows: (1) A process to compute the confidence values of triples is designed; (2) New algorithms are proposed to adjust the term frequency and inverse document frequency values of each triple; (3) A method to build a support vector machine (SVM) classifier based on the selected triples used for incorrect triple detection is presented. © 2016 ACM.


Liu Y.,Beijing Institute of Technology | Liu Y.,Founder Lab | Lu X.,Beijing Institute of Technology | Xu J.,Founder Lab
2013 9th Asian Control Conference, ASCC 2013 | Year: 2013

Although lots of vehicle detection methods can implement vehicle detection with high performance, most of their application is confined by traffic scenes. The detection precision may change heavily with traffic congestion extent, illumination variance and vehicle moving speed. To overcome the problem of weak traffic scene adaptability, a robust vehicle detection method is proposed using the inter-relationship of consecutive multiframes. The changing of frame content is a process including abrupt and gradual variation caused by the objects' color and intensity changing. Thus, the local maxima of consecutive frames' objective function are constructed to determine the best vehicle detection frame. This function is invariant to traffic congestion and vehicle speed, and avoids vehicle segmentation from frames. For illumination invariance, traditional threshold method is substituted by peak searching method. Experiments show that the proposed method implements stably in different traffic scenes than traditional methods, and with the real-time performance and higher detection precision. © 2013 IEEE.


Xu X.,Beijing Institute of Technology | Xu X.,Founder Lab | Ye M.,Founder Lab | Tang Z.,Beijing Institute of Technology | And 3 more authors.
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | Year: 2015

With the coming of digital newspaper, user-oriented special topic generation becomes extremely urgent to satisfy the users’ requirements both functionally and emotionally. We propose an applicable automatic special topic generation system for digital newspapers based on users’ interests. Firstly, extract subject heading vector of the topic of interest by filtering out function words, localizing Latent Dirichlet Allocation (LDA) and training the LDA model. Secondly, remove semantically repetitive vector component by constructing a synonymy word map. Lastly, organize and refine the special topic according to the similarity between the candidate news and the topic, and the density of topic-related terms. The experimental results show that the system has both simple operation and high accuracy, and it is stable enough to be applied for user-oriented special topic generation in practical applications. © Springer International Publishing Switzerland 2015.


Xu C.,University of Science and Technology Beijing | Xu C.,Founder Lab | Tang Z.,University of Science and Technology Beijing | Tang Z.,Founder Lab | And 2 more authors.
Proceedings of SPIE - The International Society for Optical Engineering | Year: 2013

Converting the PDF books to re-flowable format has recently attracted various interests in the area of e-book reading. Robust graphic segmentation is highly desired for increasing the practicability of PDF converters. To cope with various layouts, a multi-layer concept is introduced to segment graphic composites including photographic images, drawings with text insets or surrounded with text elements. Both image based analysis and inherent digital born document advantages are exploited in this multi-layer based layout analysis method. By combining low-level page elements clustering applied on PDF documents and connected component analysis on synthetically generated PNG image document, graphic composites can be segmented for PDF documents with complex layouts. The experimental results on graphic composite segmentation of PDF document pages have shown satisfactory performance. © 2013 SPIE-IS&T.


Shi C.,University of Science and Technology Beijing | Xiao J.,University of Science and Technology Beijing | Jia W.,University of Science and Technology Beijing | Xu C.,University of Science and Technology Beijing | Xu C.,Founder Lab
Proceedings of SPIE - The International Society for Optical Engineering | Year: 2013

A framework is proposed in this paper to effectively generate a new hybrid character type by means of integrating local contour feature of Chinese calligraphy with structural feature of font in computer system. To explore traditional art manifestation of calligraphy, multi-directional spatial filter is applied for local contour feature extraction. Then the contour of character image is divided into sub-images. The sub-images in the identical position from various characters are estimated by Gaussian distribution. According to its probability distribution, the dilation operator and erosion operator are designed to adjust the boundary of font image. And then new Chinese character images are generated which possess both contour feature of artistical calligraphy and elaborate structural feature of font. Experimental results demonstrate the new characters are visually acceptable, and the proposed framework is an effective and efficient strategy to automatically generate the new hybrid character of calligraphy and font. © 2013 SPIE-IS&T.


Ye M.,Founder Lab | Tang Z.,Founder Lab | Xu J.,Founder Lab | Jin L.,Founder Lab
Information (Switzerland) | Year: 2015

Digital publishing resources contain a lot of useful and authoritative knowledge. It may be necessary to reorganize the resources by concepts and recommend the related concepts for e-learning. A recommender system is presented in this paper based on the semantic relatedness of concepts computed by texts from digital publishing resources. Firstly, concepts are extracted from encyclopedias. Information in digital publishing resources is then reorganized by concepts. Secondly, concept vectors are generated by skip-gram model and semantic relatedness between concepts is measured according to the concept vectors. As a result, the related concepts and associated information can be recommended to users by the semantic relatedness for learning or reading. History data or users' preferences data are not needed for recommendation in a specific domain. The technique may not be language-specific. The method shows potential usability for e-learning in a specific domain. © 2015 by the authors.


Loading Founder Lab collaborators
Loading Founder Lab collaborators