Ontotext is a Bulgarian software company headquartered in Sofia. It is the semantic technology branch of Sirma Group. Its main domain of activity is the development of software products and solutions based on the Semantic Web languages and standards, in particular RDF, OWL and SPARQL. Wikipedia.
Agency: Cordis | Branch: FP7 | Program: CP | Phase: ICT-2011.4.4 | Award Amount: 3.46M | Year: 2012
Non-relational data management is emerging as a critical need for the new data economy based on large, distributed, heterogeneous, and complexly structured data sets. This new data management paradigm also provides an opportunity for research results to impact young innovative companies working on new RDF and graph data management technologies to start playing a significant role in this new data economy.Standards and benchmarking are two of the most important factors for the development of new information technology, yet there is still no comprehensive suite of benchmarks and benchmarking practices for RDF and graph databases, nor is there an authority for setting benchmark definitions and auditing official results. Without them, the future development and uptake of these technologies is at risk by not providing industry with clear, user-driven targets for performance and functionality.The goal of the Linked Data Benchmark Council (LDBC) project is to create the first comprehensive suite of open, fair and vendor-neutral benchmarks for RDF/graph databases together with the LDBC foundation which will define processes for obtaining, auditing and publishing results. The core scientific innovation of LDBC is therefore to define meaningful benchmarks derived from a combination of actual usage scenarios combined with the technical insight of top database systems researchers and architects in the choke points of current technology. LDBC will bring together a broad community of researchers and RDF and graph database vendors to establish an independent authority, the LDBC foundation, responsible for specifying benchmarks, benchmarking procedures and verifying/publishing results. The forum created will become a long-surviving, industry supported association similar to the TPC. Vendors and user organisations will participate in order to influence benchmark design and to make use of the obvious marketing opportunities.
Agency: Cordis | Branch: FP7 | Program: CP | Phase: ICT-2013.4.1 | Award Amount: 4.27M | Year: 2014
Social media poses three major computational challenges, dubbed by Gartner the 3Vs of big data: volume, velocity, and variety. Content analytics methods have faced additional difficulties, arising from the short, noisy, and strongly contextualised nature of social media. In order to address the 3Vs of social media, new language technologies have emerged, e.g. using locality sensitive hashing to detect breaking news stories from media streams (volume), predicting stock market movements from microblog sentiment (velocity), and recommending blogs and news articles based on user content (variety).
PHEME will focus on a fourth crucial, but hitherto largely unstudied, challenge: veracity. It will model, identify, and verify phemes (internet memes with added truthfulness or deception), as they spread across media, languages, and social networks.
PHEME will achieve this by developing novel cross-disciplinary social semantic methods, combining document semantics, a priori large-scale world knowledge (e.g. Linked Open Data) and a posteriori knowledge and context from social networks, cross-media links and spatio-temporal metadata. Key novel contributions are dealing with multiple truths, reasoning about rumour and the temporal validity of facts, and building longitudinal models of users, influence, and trust.
Results will be validated in two high-profile case studies: healthcare and digital journalism. The techniques will be generic with many business applications, e.g. brand and reputation management, customer relationship management, semantic search and knowledge management. In addition to its high commercial relevance, PHEME will also benefit society and citizens by enabling government organisations to keep track of and react to rumours spreading online.
PHEME addresses Objective ICT-2013.4.1 Content analytics and language technologies; a) cross-media analytics.
Agency: Cordis | Branch: H2020 | Program: RIA | Phase: INFRAIA-1-2014-2015 | Award Amount: 7.97M | Year: 2015
The European Holocaust Research Infrastructure (EHRI) project seeks to transform archival research on the Holocaust. The vision of EHRI is to integrate the data, services and expertise of existing Holocaust infrastructures on an unprecedented scale. It will allow researchers from across the globe transnational and virtual access to the integrated infrastructure, and provide them with innovative digital tools and methods to (collaboratively) explore and analyse Holocaust sources. EHRI will thereby become an indispensable tool for the study of the Holocaust from a pan-European perspective. EHRI is based on an advanced community that has already achieved a significant co-ordination of its efforts, not least thanks to the activities undertaken during EHRIs first phase. The aim of the second phase is to further expand this community. The EHRI consortium includes 22 partners, spread across Europe and beyond. This consortium, as well as a network of regional contact points, enables EHRI to reach those regions where much valuable Holocaust source material is located, but where access has hitherto been problematic, especially in South-Eastern and Eastern Europe. EHRI includes measures to build capacity in such regions, thereby ensuring that institutions and people across Europe can contribute to, and make use of, the EHRI infrastructure. EHRI will continue to serve as a best practice model for other humanities projects, and its innovative approach to data integration, management and retrieval will have impact in the wider cultural and IT industries. Although EHRI is geared towards scholarly communities, open online availability of reliable Holocaust material is important for the larger public, as the Holocaust is deeply rooted in the development of European societies. European support for the study of this most traumatic historical event is essential to achieve a comprehensive approach to the history of the Holocaust as a shared European phenomenon.
Agency: Cordis | Branch: FP7 | Program: CP | Phase: ICT-2013.4.3 | Award Amount: 2.17M | Year: 2013
While in recent years a large number of datasets has been published as open (and often linked) data, applications utilizing these open and distributed data have been rather few. Reasons include, amongst others, the technical complexity and cost of publishing and providing access to the data, lack of monetization incentives on the provider side, and lack of simplified and unified solutions for data consumption in a multi-platform way. The DaPaaS project directly addresses these challenges by developing a software infrastructure combining Data-as-a-Service (DaaS) and Platform-as-a-Service (PaaS) for open data, with the aim of optimizing publication of Open Data and development of data applications. Addressing the data consumption aspect by developing novel cross-platform interfaces to data applications, DaPaaS extensively covers the life cycle of cost-efficient data publishing and consumption. Backed by the development of a methodology for data use in the DaPaaS infrastructure, the project will deliver an intuitive platform that simplifies data publication, as well as cross-platform data consumption, thus enabling a sustainable infrastructure for efficient and simplified reuse of open data. Core innovations include: an open DaaS and PaaS, unified Linked Data access, integrated DaaS and PaaS for open data, lowering the complexity of open data publishing and consumption for non-experts.Sustainable exploitation of the project results is ensured through a strong participation of SMEs in the consortium. The participating SMEs are among the worlds leading organizations in the field of Open Data, with strong knowledge transfer experience and unique technologies in Linked Data, Semantic Web, data integration and mobile development, with strong links across both public and private sectors, and are committed to a joint development of unique technologies for effectively and efficiently supporting the life cycle of reuse of open data.
Agency: Cordis | Branch: H2020 | Program: IA | Phase: ICT-15-2014 | Award Amount: 3.89M | Year: 2015
The overall objective of the KConnect project is to create a medical text Data-Value Chain with a critical mass of participating companies using cutting-edge commercial cloud-based services for multilingual Semantic Annotation, Semantic Search and Machine Translation of Electronic Health Records and medical publications. The commercial cloud-based services will be the result of productisation of the multilingual medical text processing tools developed in the Khresmoi FP7 project, allowing wide adoption of these tools by industry. The critical mass will be created by the KConnect Professional Services Community, which will consist of at least 30 companies by the end of the project. These companies will be trained to build solutions based on the KConnect Services, hence serving as multipliers for commercial exploitation of the KConnect services. The KConnect project will facilitate the straightforward adaptation of the commercialised services to new languages by providing toolkits enabling the adaptation to be done by by people having a software engineering skillset, as opposed to the rarer language engineering skillset. The KConnect services will also be adapted to handle text in Electronic Health Records, which is particularly challenging due to misspellings, neologisms, organisation-specific acronyms, and heavy use of negation and hedging. The consortium is driven by a core group of four innovative SMEs following complementary business perspectives related to medical text analysis and search. These companies will build solutions for their customers based on KConnect technology. Two partners from the medical domain will use KConnect services to solve their medical record analysis challenges. Two highly-used medical search portal providers will implement the KConnect services to innovate the services offered by their search portals. Through these search portals, the KConnect technologies will be used by over 1 million European citizens before the end of the project