• Home
  • Practice Focus
    • Facial Plastic/Reconstructive
    • Head and Neck
    • Laryngology
    • Otology/Neurotology
    • Pediatric
    • Rhinology
    • Sleep Medicine
    • How I Do It
    • TRIO Best Practices
  • Business of Medicine
    • Health Policy
    • Legal Matters
    • Practice Management
    • Tech Talk
    • AI
  • Literature Reviews
    • Facial Plastic/Reconstructive
    • Head and Neck
    • Laryngology
    • Otology/Neurotology
    • Pediatric
    • Rhinology
    • Sleep Medicine
  • Career
    • Medical Education
    • Professional Development
    • Resident Focus
  • ENT Perspectives
    • ENT Expressions
    • Everyday Ethics
    • From TRIO
    • The Great Debate
    • Letter From the Editor
    • Rx: Wellness
    • The Voice
    • Viewpoint
  • TRIO Resources
    • Triological Society
    • The Laryngoscope
    • Laryngoscope Investigative Otolaryngology
    • TRIO Combined Sections Meetings
    • COSM
    • Related Otolaryngology Events
  • Search

Listeners Could Readily Identify Speech Synthesized from Speech Neuroprosthetic Technology

by Amy E. Hamaker • November 11, 2019

  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
Print-Friendly Version

What is the clinical viability of using speech neuroprosthetic technology to restore spoken communication?

Bottom Line: In closed vocabulary tests, listeners could readily identify and transcribe speech synthesized from cortical activity.

You Might Also Like

  • Neurofibromatosis Patients See Different Benefits from CI and ABI Technology
  • Babbling Is Necessary for Speech and Language Development
  • Asymmetric Hearing Loss Can Lead to Speech Recognition Difficulties
  • SM13: Pediatric Speech Delays Can Be Managed if Caught Early
Explore This Issue
November 2019

Background: Technology that translates neural activity into speech would be transformative for people who are unable to communicate as a result of neurological impairments. Decoding speech from neural activity is challenging because speaking requires very precise and rapid multi-dimensional control of vocal tract articulators. Although these systems can enhance a patient’s quality of life, most users struggle to transmit more than 10 words per minute, a rate far slower than the average of 150 words per minutes of natural speech. The study authors designed a neural decoder that explicitly leverages kinematic and sound representations encoded in human cortical activity to synthesize audible speech.

Comment: Investigators in this compelling study used signals from a cortical sensor to produce intelligible speech using a neural decoder. —Andy Murr, MD

Study design: Recording of high-density electrocorticography (ECoG) signals from five participants who underwent intracranial monitoring for epilepsy treatment as they spoke several hundreds of sentences aloud. Tests were run on Amazon Mechanical Turk.

Setting: Department of Neurological Surgery, University of California San Francisco.

Synopsis: Recurrent neural networks first decoded directly recorded cortical activity into representations of articulatory movement, and then transformed these representations into speech acoustics. In closed vocabulary tests, listeners could readily identify and transcribe speech synthesized from cortical activity. Intermediate articulatory dynamics enhanced performance even with limited data. Decoded articulatory representations were highly conserved across speakers, enabling a component of the decoder to be transferrable across participants. Furthermore, the decoder could synthesize speech when a participant silently mimed sentences.

Citation: Anumanchipalli GK, Chartier J, Chang EF. Speech synthesis from neural decoding of spoken sentences. Nature. 2019;568:493-498.

Filed Under: Literature Reviews, Otology/Neurotology Tagged With: clinical best practices, clinical researchIssue: November 2019

You Might Also Like:

  • Neurofibromatosis Patients See Different Benefits from CI and ABI Technology
  • Babbling Is Necessary for Speech and Language Development
  • Asymmetric Hearing Loss Can Lead to Speech Recognition Difficulties
  • SM13: Pediatric Speech Delays Can Be Managed if Caught Early

The Triological SocietyENTtoday is a publication of The Triological Society.

Polls

Have you successfully navigated a mid-career change?

View Results

Loading ... Loading ...
  • Polls Archive

Top Articles for Residents

  • ENTtoday Welcomes Resident Editorial Board Members
  • Applications Open for Resident Members of ENTtoday Edit Board
  • How To Provide Helpful Feedback To Residents
  • Call for Resident Bowl Questions
  • New Standardized Otolaryngology Curriculum Launching July 1 Should Be Valuable Resource For Physicians Around The World
  • Popular this Week
  • Most Popular
  • Most Recent
    • The Dramatic Rise in Tongue Tie and Lip Tie Treatment

    • Changing Perspectives: Why ENT Surgeons Should Consider Nerve Reconstruction

    • Otolaryngologists Are Still Debating the Effectiveness of Tongue Tie Treatment

    • Rating Laryngopharyngeal Reflux Severity: How Do Two Common Instruments Compare?

    • Keeping Watch for Skin Cancers on the Head and Neck

    • The Dramatic Rise in Tongue Tie and Lip Tie Treatment

    • Rating Laryngopharyngeal Reflux Severity: How Do Two Common Instruments Compare?

    • Is Middle Ear Pressure Affected by Continuous Positive Airway Pressure Use?

    • Otolaryngologists Are Still Debating the Effectiveness of Tongue Tie Treatment

    • Complications for When Physicians Change a Maiden Name

    • ENTtoday Welcomes Resident Editorial Board Members
    • Journal Publishing Format Suggestion: A Greener Future for Medical Journals
    • Physician, Know Thyself! Tips for Navigating Mid-Career Transitions in Otolaryngology
    • PA Reform: Is the Administrative War of Attrition Ending?
    • How To: Anatomic-Based Technique for Sensing Lead Placement in Hypoglossal Stimulator Implantation

Follow Us

  • Contact Us
  • About Us
  • Advertise
  • The Triological Society
  • The Laryngoscope
  • Laryngoscope Investigative Otolaryngology
  • Privacy Policy
  • Terms of Use
  • Cookies

Wiley

Copyright © 2025 by John Wiley & Sons, Inc. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies. ISSN 1559-4939