Time filter

Source Type

Zhang X.,Chinese Institute of Scientific and Technical Information
High Technology Letters | Year: 2012

A novel dynamic batch selective sampling algorithm based on version space analysis is presented. In the traditional batch selective sampling, example selection is entirely determined by the existing unreliable classification boundary; meanwhile, within a batch, examples labeled previously fail to provide instructive information for the selection of the rest. As a result, using the examples selected in batch mode for model refinement will jeopardize the classification performance. Based on the duality between feature space and parameter space under the SVM active learning framework, dynamic batch selective sampling is proposed to address the problem. We select a batch of examples dynamically, using the examples labeled previously as guidance for further selection. In this way, the selection of feedback examples is determined by both the existing classification model and the examples labeled previously. Encouraging experimental results demonstrate the effectiveness of the proposed algorithm. © by HIGH TECHNOLOGY LETTERS PRESS. Source

Zhang D.-g.,Tianjin University of Technology | Zhang X.-d.,Chinese Institute of Scientific and Technical Information
Enterprise Information Systems | Year: 2012

With the growth of the amount of information manipulated by embedded application systems, which are embedded into devices and offer access to the devices on the internet, the requirements of saving the information systemically is necessary so as to fulfil access from the client and the local processing more efficiently. For supporting mobile applications, a design and implementation solution of embedded un-interruptible power supply (UPS) system (in brief, EUPSS) is brought forward for long-distance monitoring and controlling of UPS based on Web. The implementation of system is based on ATmega161, RTL8019AS and Arm chips with TCP/IP protocol suite for communication. In the embedded UPS system, an embedded file system is designed and implemented which saves the data and index information on a serial EEPROM chip in a structured way and communicates with a microcontroller unit through I 2C bus. By embedding the file system into UPS system or other information appliances, users can access and manipulate local data on the web client side. Embedded file system on chips will play a major role in the growth of IP networking. Based on our experiment tests, the mobile users can easily monitor and control UPS in different places of long-distance. The performance of EUPSS has satisfied the requirements of all kinds of Web-based mobile applications. © 2012 Copyright Taylor and Francis Group, LLC. Source

Vaughan L.,University of Western Ontario | Yang R.,Chinese Institute of Scientific and Technical Information
Journal of the American Society for Information Science and Technology | Year: 2012

Earlier studies found that web hyperlink data contain various types of information, ranging from academic to political, that can be used to analyze a variety of social phenomena. Specifically, the numbers of inlinks to academic websites are associated with academic performance, while the counts of inlinks to company websites correlate with business variables. However, the scarcity of sources from which to collect inlink data in recent years has required us to seek new data sources. The recent demise of the inlink search function of Yahoo! made this need more pressing. Different alternative variables or data sources have been proposed. This study compared three types of web data to determine which are better as academic and business quality estimates, and what are the relationships among the three data sources. The study found that Alexa inlink and Google URL citation data can replace Yahoo! inlink data and that the former is better than the latter. Alexa is even better than Yahoo!, which has been the main data source in recent years. The unique nature of Alexa data could explain its relative advantages over other data sources. © 2012 ASIS & T. Source

Liping D.,Chinese Institute of Scientific and Technical Information
Applied Energy | Year: 2011

Energy is important for China and for the whole world. Previously, the huge investment in energy-related research and commercialisation made it possible for China to cooperate with its international partners in various channels, and programs involving international cooperation and co-published papers increased annually. In this paper, through the review of intergovernmental cooperation programs and bibliometric analysis of the top energy journals, it was found that: (1) intergovernmental cooperation and non-governmental cooperation are two effective channels for energy R&D. (2) In these two channels, most participants of international cooperation are universities and institutes, and the most important partner countries are the US, Japan, and European Countries. (3) Industries began to be involved in international cooperation gradually. (4) For different areas, the degree of cooperation is not the same. Some areas have been more fruitful in cooperation, some are just beginning hydrogen energy, fuel energy and applied energy are the main co-publication areas with Chinese involvement; while wind energy, solar energy, fuel cells and bio-energy are new areas for China and there has not been so much co-publication until now. © 2011 Elsevier Ltd. Source

Zeng W.,Chinese Institute of Scientific and Technical Information
Electronic Library | Year: 2012

Purpose - The paper aims to explore multilingual thesauri automation construction based on the freely available digital library resources. The key methods and study results are presented in the paper. It also proposes a way that terms are automatically extracted from multilingual parallel corpus. Design/methodology/approach - The study adopted the technology of natural language processing to analyze the linguistics characteristics of terms, and combined this with statistical analyses to extract the terms from technological documents. The methods consist of automatically extracting and filtering terms, judging and building relationship among terms, building the multilingual parallel corpus, and extracting term pairs between Chinese and foreign languages through calculating their associated probability. The experiments run on the Java test platform. Findings - The study obtains the following conclusions: finding the similarities and differences between the Chinese thesaurus standard and international thesaurus standard. The methods for automatically extracting terms and building relationships among them are presented. Eventually the multilingual terms' translation sets are generated based on real corpora. The results of the study show that the proposed methods can obtain better performance. The effect of automatic terms' translation alignment method is better than that of traditional IBM model method. Practical implications - The study results can provide references for further study and application of multilingual thesauri automation construction using Chinese as a pivot. Originality/value - The paper proposes new ideas on thesaurus automation construction in the digital age. The presented method based on linguistics and statistics is a new attempt. According to the experimental results, this exploration and study is innovative and valuable. In addition, these ideas and methods give a good start for improving information services of the PRC's National Science and Technology Digital Library. © Emerald Group Publishing Limited. Source

Discover hidden collaborations