Burlington, MA, United States
Burlington, MA, United States

Nuance Communications is an American multinational computer software technology corporation, headquartered in Burlington, Massachusetts, United States,a suburb of Boston, that provides speech and imaging applications. Current business products focus on server & embedded speech recognition, telephone call steering systems, automated telephone directory services, medical transcription software & systems, optical character recognition software, and desktop imaging software. The company also maintains a small division which does software and system development for military and government agencies. In October 2011, unconfirmed research suggested that its servers power Apple's iPhone 4S Siri voice recognition application.As of 2008, the company is a result of organic growth, mergers, and acquisitions. ScanSoft and Nuance merged in October 2005; before the merger, the two companies competed in the commercial large scale speech application business. The officially termed "merger" was a de facto acquisition of Nuance by ScanSoft, though the combined company changed its name to Nuance following the transaction. Before 1999, ScanSoft was known as Visioneer, a hardware and software scanner company. In 1999, Visioneer bought ScanSoft – a Xerox spin-off – and adopted ScanSoft as the company name. The original ScanSoft had its roots in Kurzweil Computer Products, a software company that developed the first omni-font character recognition system. Wikipedia.


Time filter

Source Type

Patent
Nuance Communications | Date: 2015-07-31

In a contact center, agents can be distracted and delayed from communicating with customers by searching for articles relevant to the customers communication. A system automatically performing searches based on the customers communication can therefore aid the agent. In an embodiment, a method of improving agent interaction with a user at a contact center includes determining, at a contact center, an intent of a portion of a received input from a user based on an established context of an application domain of the contact center. The method further includes mapping the determined intent to an element of an unstructured knowledge base stored in a memory. The method further includes presenting, to an agent at the contact center via a display, automated search results having the element of the one unstructured knowledge base. Therefore, the agent can quickly access search results relevant to the call and application domain.


Patent
Nuance Communications | Date: 2015-08-20

According to some aspects, a method of classifying speech recognition results is provided, using a neural network comprising a plurality of interconnected network units, each network unit having one or more weight values, the method comprising using at least one computer, performing acts of providing a first vector as input to a first network layer comprising one or more network units of the neural network, transforming, by a first network unit of the one or more network units, the input vector to produce a plurality of values, the transformation being based at least in part on a plurality of weight values of the first network unit, sorting the plurality of values to produce a sorted plurality of values, and providing the sorted plurality of values as input to a second network layer of the neural network.


System and methods are provided for detecting numerical text strings within a text string and converting those numerical text strings into digit strings. The digit strings may be reflected in real-time, such as when the user is typing a text message. If more than one possible format of the digit string is determined, the system may then provide a selection of the various formats for selection. Once the proper format for the digit string is determined, that digit string may replace the numerical string previously detected in the text string. The text to digit conversion and associated formatting expedites user text entry such that the user is not required to switch keyboard views, (e.g., virtual keyboards). Additionally, converting to digit strings compresses message length, as well as provide other benefits.


Patent
Nuance Communications | Date: 2015-08-25

Aspects described herein are directed towards methods, computing devices, systems, and computer-readable media that apply scattering operations to extracted visual features of audiovisual input to generate predictions regarding the speech status of a subject. Visual scattering coefficients generated according to one or more aspects described herein may be used as input to a neural network operative to generate the predictions regarding the speech status of the subject. Predictions generated based on the visual features may be combined with predictions based on audio input associated with the visual features. In some embodiments, the extracted visual features may be combined with the audio input to generate a combined feature vector for use in generating predictions.


Patent
Nuance Communications | Date: 2015-08-27

Techniques for automatically identifying a speaker in a conversation as a known person based on processing of audio of the speakers voice to extract characteristics of that voice and on an automated comparison of those characteristics to known characteristics of the known persons voice. A speaker segmentation process may be performed on audio of the conversation to produce, for each speaker in the conversation, a segment that includes the audio of that speaker. Audio of each of the segments may then be processed to extract characteristics of that speakers voice. The characteristics derived from each segment (and thus for multiple speakers) may then be compared to characteristics of the known persons voice to determine whether the speaker for that segment is the known person. For each segment, a degree of match between the voice characteristics of the speaker and the voice characteristics of the known person may be calculated.


Patent
Nuance Communications | Date: 2017-03-08

Techniques for training a natural language understanding (NLU) engine may include generating a first annotation of free-form text documenting a healthcare patient encounter and a link between the first annotation and a corresponding portion of the text, using the NLU engine. A second annotation of the text and a link between the second annotation and a corresponding portion of the text may be received from a human user. The first annotation and its corresponding link may be merged with the second annotation and its corresponding link. Training data may be provided to the engine in the form of the text and the merged annotations and links.


Patent
Nuance Communications | Date: 2017-03-08

Techniques are provided whereby a clarification request may be generated with a clinical documentation improvement (CDI) system for resolution by a clinician, and notification of the clarification request may be transmitted to a medical coding system. At a medical coding system, notification may be received of a clarification request generated at a CDI system for resolution by a clinician. In some embodiments, the medical coding system may be a computer-assisted coding (CAC) system.


Technology is described for developing a language model for a language recognition system from scratch based on aggregating and analyzing text input from multiple users of the language. The technology allows a user to select a language, and if no existing language model is available for the selected language, provides a new language model for the selected language, monitors and collects information about the use of words in the selected language, combines information collected from multiple users of the selected language, and updates the users language model based on the combined information from multiple users of the selected language.


Patent
Nuance Communications | Date: 2016-08-16

According to some aspects, a method of monitoring an acoustic environment of a mobile device, at least one computer readable medium encoded with instructions that, when executed, perform such a method and/or a mobile device configured to perform such a method is provided. The method comprises receiving acoustic input from the environment of the mobile device while the mobile device is operating in the low power mode, detecting whether the acoustic input includes a voice command based on performing a plurality of processing stages on the acoustic input, wherein at least one of the plurality of processing stages is performed while the mobile device is operating in the low power mode, and using at least one contextual cue to assist in detecting whether the acoustic input includes a voice command.


Patent
Nuance Communications | Date: 2016-08-16

A method of utilizing a speech assistant, the speech assistant designed to provide a voice input and speech output capability, the method comprising, enabling the use of the speech assistant for communication with a user, and terminating the speech assistant when the communication is complete. The method further comprises receiving a notification from a native application associated with the communication, and activating a sub-portion of the speech assistant, to enable outputting of the notification using speech output, thereby enabling the use of speech output for periodic announcements without enabling the speech assistant.

Loading Nuance Communications collaborators
Loading Nuance Communications collaborators