SAS Institute is a developer of analytics software based in Cary, North Carolina. SAS develops and markets a suite of analytics software , which helps manage, access, analyze and report on data to aid in decision-making. The company is the world's largest privately held software business and its software is used by most of the Fortune 500.SAS has developed a model workplace environment and benefits program designed to retain employees, allow them to focus on their work, and reduce operating costs. Professor Jeffrey Pfeffer from the Stanford Graduate School of Business estimated that the company saves $60–$80 million annually in expenses related to employee turnover. It provides on-site, subsidized or free healthcare, gyms, daycare and life counseling services.SAS Institute started as a project at North Carolina State University to create a "statistical analysis software" that was originally used primarily by agricultural departments at universities in the late 1960s. It became an independent, private business led by current CEO James Goodnight and three other project leaders from the university in 1976. SAS grew from $10 million in revenues in 1980 to $1.1 billion by 2000. A larger proportion of these revenues are spent on research and development than at most other software companies, at one point more than double the industry average. Wikipedia.
SAS Institute | Date: 2016-09-06
Exemplary embodiments are generally directed to methods, mediums, and systems for correcting censored or constrained historical data with various possible types of computing devices, including cloud-based devices, personal computing devices, and edge-based devices. The corrected data may be used in forecasting, for example to forecast demand for a limited resource. In some embodiments, the data is modeled at a higher level of granularity than an individual record. The aggregated demand may then be pro-rated over a group of categories or users where a given category of users that might be small or nonexistent over a certain time frame may be better accommodated. Moreover, it may be easier or more efficient to make assumptions and employ computing resources at the aggregate level.
An apparatus includes a processor component caused to: retrieve metadata of organization of data within a data set, and map data of organization of data blocks within a data file; receive indications of which node devices are available to perform a processing task with a data set portion; and in response to the data set including partitioned data, compare the quantities of available node devices and of the node devices last involved in storing the data set. In response to a match, for each map data map entry: retrieve a hashed identifier for a data sub-block, and a size for each of the data sub-blocks within the corresponding data block; divide the hashed identifier by the quantity of available node devices; compare the modulo value to a designation assigned to each of the available node devices; and provide a pointer to the available node device assigned the matching designation.
SAS Institute | Date: 2016-10-25
In a system automatically processing data from a first computing device for use on a second computing device, a registry file including a plurality of filename parameters is read. Each filename parameter identifies a matching filename pattern, an extract script indicator, and a read file indicator. The extract script indicator indicates an extract script for a file having a filename that matches the matching filename pattern. The read file indicator indicates how to read the file having the filename that matches the matching filename pattern. One parameter of the plurality of filename parameters is selected by matching a filename of a source file to the matching filename pattern of the one parameter. The associated extract script is selected and used to read data from the source file using the associated read file indicator and the read data is output to a different file and in a different format.
SAS Institute | Date: 2016-11-02
Various embodiments are generally directed to an apparatus, method and other techniques for receiving a request to generate a bootable image in a cloud-based computing environment, creating a block storage volume in the cloud-based computing environment in response to receiving the request, the block storage volume having one or more partitions. Further, an apparatus, method and so forth may include installing software comprising one or more files in a file system on the block storage volume in the cloud-based computing environment, creating a snapshot of the file system including the software in the cloud-based computing environment, and creating a bootable image from the snapshot of the file system in the cloud-based computing environment.
An apparatus comprising a processor component to: provide, to a control device, an indication of availability to perform a processing task with one or more data set portions as a node device; perform a processing task specified by the control device with the one or more data set portions; and request a pointer to a location at which to store the one or more data set portions as a data block within a data file. In response to the data set including partitioned data, for each data set portion, include a data sub-block size of the data set portion and a hashed identifier derived from a partition label of a partition in the request; receive, from the control device, the requested pointer to the location; and store each data set portion as a data sub-block within the data block starting at the location within the data file.
An apparatus comprising a processor component to: receive metadata of data organization within a data set; receive indications of which node devices will be storing the data set as multiple data blocks within a data file; and receive, from each node device, a pointer request to a location within the data file for storing a data set portion as a data block. In response to the data set including partitioned data, for each request for a pointer: determine the location within the data file; generate a map data map entry for the data block; generate therein a sub-block count of data sub-blocks within the data block; generate therein a sub-entry for each data sub-block including size and a hashed identifier derived from a partition label; and provide a pointer to the node device. In response to successful storage of all data blocks, store the map data in the data file.
SAS Institute | Date: 2014-05-06
Techniques to perform data reduction for statistical tests are described. An apparatus may comprise an evaluation component to receive a computational representation arranged to generate an approximate probability distribution for statistics of a statistical test, the computational representation to include a simulated data structure with information for estimated cumulative distribution function (CDF) curves for one or more parameter vectors of the statistical test, each parameter vector represented with a single point in a grid of points, the evaluation component to evaluate the simulated data structure to determine whether any points in the grid of points are removable from the simulated data structure with a target level of precision, and a data reduction generator to reduce the simulated data structure in accordance with the evaluation to produce a reduced simulated data structure having a smaller data storage size relative to the simulated data structure. Other embodiments are described and claimed.
SAS Institute | Date: 2014-05-06
Techniques to perform interpolation for statistical tests are described. An apparatus may comprise processor circuitry and a simulated data component for execution by the processor circuitry to generate simulated data for a statistical test, statistics of the statistical test based on parameter vectors to follow a probability distribution. The apparatus may further comprise a statistic simulator component for execution by the processor circuitry to simulate statistics for the parameter vectors from the simulated data, each parameter vector represented with a single point in a grid of points. The apparatus may further comprise a code generator component for execution by the processor circuitry to remove selective points from the grid of points to form a subset of points, and generate interpolation code to interpolate a statistic of the statistical test on any point. Other embodiments are described and claimed.
SAS Institute | Date: 2017-01-18
In a computing device supporting a failover in an event stream processing (ESP) system, an event block object is received. A first status of the computing device as active or standby is determined. When the first status is active, a second status of the computing device as newly active or not newly active is determined. Newly active is determined when the computing device is switched from a standby to an active status. When the second status is newly active, a last published event block object identifier that uniquely identifies a last published event block object is determined. A next event block object is selected from a non-transitory computer-readable medium accessible by the computing device. The next event block object has an event block object identifier that is greater than the determined last published event block object identifier. The selected next event block object is published to an out-messaging network device.
Agency: European Commission | Branch: H2020 | Program: MSCA-ITN-ETN | Phase: MSCA-ITN-2016 | Award Amount: 3.51M | Year: 2017
SOLUTION will provide research and training program for 14 early stage researchers (ESR) pursuing their PhD in various disciplines covering the broadly defined area of solid lubricant coatings. The project combines theoretical approaches represented by advanced nanoscale simulations, laboratory design and fabrication of novel solid lubricants supported by simulations, and the up-scaling of promising solutions and their application in selected emerging engineering applications. SOLUTION will link industries from various areas dealing with similar issues through intensive training and knowledge sharing. Three topics driven by industrial partners have been selected to demonstrate the added value of simultaneous development and training. The use of modern solid lubricants underlines the transformation of industry towards smart design, which is based on predictive models and cross-communication throughout the entire production chain. Fellows supported by the project will have a unique opportunity to gain competence ranging from simulation, characterization and processing, to industrial processes and entrepreneurship. Highly individualized multidisciplinary training reflecting actual market needs, together with scientific excellence, will generate an open-mind generation able to harvest multidisciplinary knowledge and to successfully face challenges represented by the design of competitive solid lubricants.