Greenplum

San Mateo, CA, United States

Greenplum

San Mateo, CA, United States

Time filter

Source Type

Soliman M.A.,Greenplum | Ilyas I.F.,University of Waterloo | Martinenghi D.,Polytechnic of Milan | Tagliasacchi M.,Polytechnic of Milan
Proceedings of the ACM SIGMOD International Conference on Management of Data | Year: 2011

Ranking queries report the top-K results according to a user-defined scoring function. A widely used scoring function is the weighted summation of multiple scores. Often times, users cannot precisely specify the weights in such functions in order to produce the preferred order of results. Adopting uncertain/incomplete scoring functions (e.g., using weight ranges and partially-specified weight preferences) can better capture user's preferences in this scenario. In this paper, we study two aspects in uncertain scoring functions. The first aspect is the semantics of ranking queries, and the second aspect is the sensitivity of computed results to refinements made by the user. We formalize and solve multiple problems under both aspects, and present novel techniques that compute query results efficiently to comply with the interactive nature of these problems. © 2011 ACM.


News Article | December 1, 2016
Site: www.newsmaker.com.au

Big Data Analytics & Hadoop Market accounted for $8.48 billion in 2015 and is expected to reach $99.31 billion by 2022 growing at a CAGR of 42.1% from 2015 to 2022. Rise of big data & growing need for big data analytics and rapid growth in consumer data are some of the factors fueling the market growth. Lack of skilled workers and Lack of security features in the Hadoop framework are restraining the market growth. Venture capital funding is the major opportunity for vendors in big data analytics and hadoop market. Consulting services segment commanded the market due to the enterprise wide implementation of this technology. Application software is the leading segment in the market due to rising deployment by developers to develop real time applications. Storage segment leads the hardware market in terms of revenue. North America is dominating the global market, while Asia-Pacific region is anticipated to observe a significant growth in the big data analytics and hadoop market due to its huge IT services industry stand. Some of the key players in the market include Dell Inc., Karmasphere Inc., Talend, Inc., DataDirect Networks, Inc., Amazon Web Service LLC, HORTONWORKS, INC., Appistry, Inc., NetApp, Inc., Teradata Corporation, Cloudera Inc., The Hewlett-Packard Company, Greenplum, Inc., Datameer, Inc., Zettaset, Inc., Fujitsu Ltd., Pentaho Corporation, DataStax, Inc., Platform Computing, HStreaming LLC , MapR Technologies, Inc., IBM and Hadapt Inc. End Users Covered:  • Retail  • Healthcare & Life Sciences  • Banking, Financial Service & Insurance  • Government & Public Utilities  • Bioinformatics  • Web  • IT & Security  • Manufacturing  • Transportation  • Media & Entertainment  • Gaming  • University Research & Education  • Telecommunication  • Natural Resources  • Other End User Regions Covered:  • North America  o US  o Canada  o Mexico  • Europe  o Germany  o France  o Italy  o UK  o Spain  o Rest of Europe  • Asia Pacific  o Japan  o China  o India  o Australia  o New Zealand  o Rest of Asia Pacific  • Rest of the World  o Middle East  o Brazil  o Argentina  o South Africa  o Egypt What our report offers:  - Market share assessments for the regional and country level segments  - Market share analysis of the top industry players  - Strategic recommendations for the new entrants  - Market forecasts for a minimum of 7 years of all the mentioned segments, sub segments and the regional markets  - Market Trends (Drivers, Constraints, Opportunities, Threats, Challenges, Investment Opportunities, and recommendations)  - Strategic recommendations in key business segments based on the market estimations  - Competitive landscaping mapping the key common trends  - Company profiling with detailed strategies, financials, and recent developments  - Supply chain trends mapping the latest technological advancements About Us Wise Guy Reports is part of the Wise Guy Consultants Pvt. Ltd. and offers premium progressive statistical surveying, market research reports, analysis & forecast data for industries and governments around the globe. Wise Guy Reports understand how essential statistical surveying information is for your organization or association. Therefore, we have associated with the top publishers and research firms all specialized in specific domains, ensuring you will receive the most reliable and up to date research data available.


According to Stratistics MRC, the Big Data Analytics & Hadoop Market accounted for $8.48 billion in 2015 and is expected to reach $99.31 billion by 2022 growing at a CAGR of 42.1% from 2015 to 2022. Rise of big data & growing need for big data analytics and rapid growth in consumer data are some of the factors fueling the market growth. Lack of skilled workers and Lack of security features in the Hadoop framework are restraining the market growth. Venture capital funding is the major opportunity for vendors in big data analytics and hadoop market. Consulting services segment commanded the market due to the enterprise wide implementation of this technology. Application software is the leading segment in the market due to rising deployment by developers to develop real time applications. Storage segment leads the hardware market in terms of revenue. North America is dominating the global market, while Asia-Pacific region is anticipated to observe a significant growth in the big data analytics and hadoop market due to its huge IT services industry stand. Some of the key players in the market include Dell Inc., Karmasphere Inc., Talend, Inc., DataDirect Networks, Inc., Amazon Web Service LLC, HORTONWORKS, INC., Appistry, Inc., NetApp, Inc., Teradata Corporation, Cloudera Inc., The Hewlett-Packard Company, Greenplum, Inc., Datameer, Inc., Zettaset, Inc., Fujitsu Ltd., Pentaho Corporation, DataStax, Inc., Platform Computing, HStreaming LLC , MapR Technologies, Inc., IBM and Hadapt Inc. End Users Covered: • Retail • Healthcare & Life Sciences • Banking, Financial Service & Insurance • Government & Public Utilities • Bioinformatics • Web • IT & Security • Manufacturing • Transportation • Media & Entertainment • Gaming • University Research & Education • Telecommunication • Natural Resources • Other End User Regions Covered: • North America o US o Canada o Mexico • Europe o Germany o France o Italy o UK  o Spain      o Rest of Europe  • Asia Pacific o Japan        o China        o India        o Australia        o New Zealand       o Rest of Asia Pacific       • Rest of the World o Middle East o Brazil o Argentina o South Africa o Egypt What our report offers: - Market share assessments for the regional and country level segments - Market share analysis of the top industry players - Strategic recommendations for the new entrants - Market forecasts for a minimum of 7 years of all the mentioned segments, sub segments and the regional markets - Market Trends (Drivers, Constraints, Opportunities, Threats, Challenges, Investment Opportunities, and recommendations) - Strategic recommendations in key business segments based on the market estimations - Competitive landscaping mapping the key common trends - Company profiling with detailed strategies, financials, and recent developments - Supply chain trends mapping the latest technological advancements


Miner D.,Greenplum
ACM International Conference Proceeding Series | Year: 2012

Greenplum is using Hadoop and several other open source tools in interesting ways as part of a big data architecture with their Greenplum Database (a scale-out MPP SQL database). Copyright is held by author/owner(s).


Li K.,University of Florida | Grant C.,University of Florida | Wang D.Z.,University of Florida | Khatri S.,Greenplum | Chitouras G.,Greenplum
Proceedings of the 2nd Workshop on Data Analytics in the Cloud, DanaC 2013 - In Conjunction with ACM SIGMOD/PODS Conference | Year: 2013

Many companies keep large amounts of text data inside of relational databases. Several challenges exist in using state-of-the-art systems to perform analysis on such datasets. First, expensive big data transfer cost must be paid up front to move data between databases and analytics systems. Second, many popular text analytics packages do not scale up to production sized datasets. In this paper, we introduce GPText, Greenplum parallel statistical text analysis framework that addresses the above problems by supporting statistical inference and learning algorithms natively in a massively parallel processing database system. GPText seamlessly integrates the Solr search engine and applies statistical algorithms such as k-means and LDA using MADLib, an open source library for scalable in-database analytics which can be installed on Post-greSQL and Greenplum. In addition, GPText also developed and contributed a linear-chain conditional random field(CRF) module to MADLib to enable information extraction tasks such as part-of-speech tagging and named entity recognition. We show the performance and scalability of the parallel CRF implementation. Finally, we describe an eDiscovery application built on the GPText framework. Copyright © 2013 ACM.


Raghavan V.,Greenplum | Raghavan V.,Worcester Polytechnic Institute | Rundensteiner E.A.,Worcester Polytechnic Institute | Srivastava S.,Worcester Polytechnic Institute
Information Systems | Year: 2011

Growing interests in multi-criteria decision support applications have resulted in a flurry of efficient skyline algorithms. In practice, real-world decision support applications require to access data from disparate sources. Existing techniques define the skyline operation to work on a single set, and therefore, treat skylines as an add-on on top of a traditional Select-Project-Join query plan. In many real-world applications, the skyline dimensions can be anti-correlated such as the attribute pair price, mileage for cars and price, distance for hotels. Anti-correlated data are particularly challenging for skyline evaluation and therefore have commonly been ignored by existing techniques. In this work, we propose a robust execution framework called SKIN to evaluate skyline over joins. The salient features of SKIN are: (a) effective in reducing the two primary costs, namely the cost of generating the join results and the cost of dominance comparisons to compute the final skyline of join results, (b) shown to be robust for both skyline-friendly (independent and correlated) as well as skyline-unfriendly (anti-correlated) data distributions. SKIN is effective in exploiting the skyline knowledge in both local within individual data sources and across disparate sources - to significantly reduce the above-mentioned costs incurred during the evaluation of skyline over join. Our experimental study demonstrates the superiority of our proposed approach over state-of-the-art techniques to handle a wide variety of data distributions. © 2011 Elsevier B.V.


Hellerstein J.M.,University of California at Berkeley | Re C.,U. Wisconsin | Schoppmann F.,Greenplum | Wang D.Z.,U. Florida | And 7 more authors.
Proceedings of the VLDB Endowment | Year: 2012

MADlib is a free, open-source library of in-database analytic methods. It provides an evolving suite of SQL-based algorithms for machine learning, data mining and statistics that run at scale within a database engine, with no need for data import/export to other tools. The goal is for MADlib to eventually serve a role for scalable database systems that is similar to the CRAN library for R: a community repository of statistical methods, this time written with scale and parallelism in mind. In this paper we introduce the MADlib project, including the background that led to its beginnings, and the motivation for its opensource nature. We provide an overview of the library's architecture and design patterns, and provide a description of various statistical methods in that context. We include performance and speedup results of a core design pattern from one of those methods over the Greenplum parallel DBMS on a modest-sized test cluster. We then report on two initial efforts at incorporating academic research into MADlib, which is one of the project's goals. MADlib is freely available at http://madlib.net, and the project is open for contributions of both new methods, and ports to additional database platforms. © 2012 VLDB Endowment.


News Article | October 28, 2016
Site: www.prweb.com

Powered by open source Greenplum Database®, Pivotal Greenplum® is a commercial fully featured data warehouse that is now available on the AWS Marketplace. The BYOL Pivotal Greenplum® distribution allows you to use your Pivotal Greenplum® license to run your database in the cloud the same way you run on-prem. BYOL Pivotal Greenplum® on AWS is geared towards big data analytics and has many data science features providing powerful and rapid analytics on petabyte scale data volumes. zData’s Marketplace product for the BYOL Pivotal Greenplum® distribution offers licensed support from Pivotal®, along with support from zData’s Managed Services team. Integrated with zData’s new BURST platform, BURST offers self service management augmented by zData’s operations and support staff. This new platform features integrated alerts, data science, dashboards and security on top of the support that you already receive from your Pivotal® License. With BURST, BYOL Pivotal Greenplum® on AWS becomes a “click-to-load” solution, and allows organizations to start validating or realizing the benefits of Greenplum® in the cloud, in just minutes. “We are excited to introduce BYOL Pivotal Greenplum® into the AWS Marketplace. This is an exciting opportunity for Pivotal Greenplum® License holders to easily implement Pivotal Greenplum® in the cloud with additional “built in” support from zData’s Managed Services team.” Dillon Woods, zData CTO zData is focused on providing available and stable big data platforms on-prem and in the cloud. zData works to support and implement the newest cloud native applications allowing company decision makers to quickly and affordably perform complex analysis of terabytes of data to accelerate and improve business decisions.


Paris, New York, February 17, 2017 - Atos, a global leader in digital services, expands its expertise in Big Data services with the acquisition of zData, a leader in Big Data consulting and solutions for both commercial and enterprise corporations. Atos has signed a share purchase agreement with zData, bringing a unique team of software engineers and data scientists to support its customers' digital transformation journey within all sectors. This strategic acquisition brings a new level of scalability, reliability and performance giving enterprises all the benefits of open-source software framework Hadoop through the world's most advanced turnkey Hadoop solution for critical production workloads. The company is working with the industries best software providers for on-site and off-site consulting - from Greenplum to Hadoop and PIVOTAL HDB (HAWQ) expertise. "We are pleased to welcome zData to the Atos team and look forward to offering our customers the right blueprint in their cloud application development needs leveraging zData's PIVOTAL Cloud Foundry experience", said Jerome Sandrini, Atos Vice President and Head of Big Data, North American Operations. "zData's Hadoop experts and Data scientists combined with Atos' cognitive solutions will enable Atos to accelerate the deployment of its Big Data and Atos Codex solutions in North America, further strengthening its ability to guide customers through their digital transformation journey". zData's team of experts and innovative capabilities fully aligns to its Big Data and Atos Codex expansion strategy notably in the U.S. Atos Codex offers organizations fast and cost efficient means to exploit the value of their existing data combined with external data. In this new landscape, the ability to derive insight from massive volumes of structured and unstructured data will be made possible by systems which are able to learn as they perform. Atos Codex gives customers the techniques, tools and processes they need to make this business-changing step from Business Intelligence to agile analytics. Atos SE (Societas Europaea) is a leader in digital transformation with circa 100,000 employees in 72 countries and pro forma annual revenue of circa € 12 billion. Serving a global client base, the Group is the European leader in Big Data, Cybersecurity, Digital Workplace and provides Cloud services, Infrastructure & Data Management, Business & Platform solutions, as well as transactional services through Worldline, the European leader in the payment industry. With its cutting edge technologies, digital expertise and industry knowledge, the Group supports the digital transformation of its clients across different business sectors: Defense, Financial Services, Health, Manufacturing, Media, Utilities, Public sector, Retail, Telecommunications, and Transportation. The Group is the Worldwide Information Technology Partner for the Olympic & Paralympic Games and is listed on the Euronext Paris market. Atos operates under the brands Atos, Atos Consulting, Atos Worldgrid, Bull, Canopy, Unify and Worldline.


News Article | February 17, 2017
Site: globenewswire.com

Paris, New York, le 17 février 2017 - Atos, leader international de la transformation digitale, étend son expertise en services Big Data avec l'acquisition par un accord de rachat d'actions de zData, un leader en conseil et solutions Big Data pour les entreprises. Cette acquisition apporte à Atos une équipe hors pair d'ingénieurs logiciels et de data scientists afin d'accompagner ses clients tout au long de leur transformation numérique dans toutes les industries. Cette acquisition stratégique offre une nouvelle dimension d'évolutivité, de stabilité et de performance en fournissant aux entreprises tous les bénéfices de la plateforme open source Hadoop, solution clé en main la plus complète au monde pour les charges de production critiques. zData collabore avec les meilleurs fournisseurs de logiciels sur-site et hors-site - de Greenplum à Hadoop en passant par PIVOTAL HDB (HAWQ). « Nous sommes ravis d'accueillir zData au sein de la famille Atos. Nous  allons offrir à nos clients la solution parfaite à leurs besoins en développement d'applications Cloud, en nous appuyant sur la riche expérience dont bénéficie zData sur la plateforme PIVOTAL Cloud Foundry » explique Jérôme Sandrini, Vice-président d'Atos et Responsable du Big Data en Amérique du Nord.« En associant les experts Hadoop et les data scientists de zData à nos  solutions cognitives, nous allons accélérer le déploiement de nos offres Big Data et notre gamme Atos Codex en Amérique du Nord, renforçant encore davantage notre capacité à guider nos clients tout au long de leur transformation numérique. » Avec son équipe d'experts et ses capacités d'innovation, zData est parfaitement en ligne avec la stratégie de croissance des offres Big Data et Atos Codex du groupe, en particulier aux États-Unis. Atos Codex permet d'exploiter de manière rentable et rapide la valeur des données existantes des entreprises et de les croiser avec des données externes. Atos SE (Société Européenne), est un leader de services numériques avec un chiffre d'affaires annuel pro forma de l'ordre 12 milliards d'euros et environ 100 000 collaborateurs dans 72 pays. Atos fournit à ses clients du monde entier des services de conseil et d'intégration de systèmes, d'infogérance, de Big Data et de Sécurité, d'opérations Cloud et des services transactionnels par l'intermédiaire de Worldline, le leader européen des services de paiement. Grâce à son expertise technologique et sa connaissance sectorielle pointue, Atos sert des clients dans différents secteurs : Défense, Services financiers, Santé, Industrie, Médias, Services aux collectivités, Secteur Public, Distribution, Télécoms, et Transports. Atos déploie les technologies qui accélèrent le développement de ses clients et les aident à réaliser leur vision de l'entreprise du futur. Atos est le partenaire informatique mondial des Jeux Olympiques et Paralympiques. Le Groupe est coté sur le marché Euronext Paris et exerce ses activités sous les marques Atos, Bull, Canopy, Worldline, Atos Consulting, Atos Worldgrid et Unify.

Loading Greenplum collaborators
Loading Greenplum collaborators