Sony Improving Player Audio In Multiplayer Games In Real Time

Sony Interactive Entertainment may be working on a real-time system, as outlined in a recently published patent, to address player audio issues in multiplayer video games, including hissing sounds, unclear pronunciations, and other undesirable traits, aiming to create a more immersive gameplay experience.

Source: logo king/Vecteezy


  • Sony Interactive Entertainment has recently released a patent centred around audio processing technology aimed at improving voice samples in multiplayer video games.
  • The technology tackles challenges such as sibilance, unclear pronunciations, and other undesirable traits in the gaming audio experience.
  • The technology involves an audio processing system that analyses voice samples, identifies specific sound types related to character pronunciation, and generates a filtered voice sample.
  • A key innovation is the use of personalised filters fine-tuned to individual players’ voices based on additional voice samples.
  • The personalised filters are created using machine learning models trained on additional voice samples, allowing the system to adapt to various voice characteristics such as gender, age, accent, dialect, or ethnic background.
  • The implications for gaming are significant, as the technology could elevate the gaming experience by providing clearer and more refined voice communication within multiplayer video games.

Earlier today, we encountered a recently published patent filed in May 2022, titled “SYSTEMS AND METHODS FOR AUTOMATED CUSTOMIZED VOICE FILTERING,” which, published last week, tackles a prevalent issue in audio processing by concentrating on refining voice samples to provide an improved and personalised listening experience in multiplayer video games.

Conceptual diagram illustrating an example an audio processing system, in accordance with some examples. | Source: Patent Public Search

“Systems and methods for audio processing are described. An audio processing system receives audio content that includes a voice sample. The audio processing system analyzes the voice sample to identify a sound type in the voice sample,” reads the patent’s abstract.

“The sound type corresponds to pronunciation of at least one specified character in the voice sample. The audio processing system generates a filtered voice sample at least in part by filtering the voice sample to modify the sound type. The audio processing system outputs the filtered voice sample.”

Central to the patent is a device for audio processing designed to receive audio content containing a voice sample, usually involving a player uttering a word with multiple characters. The breakthrough lies in the system’s ability to analyse the voice sample, identify specific sound types related to character pronunciation, and generate a filtered voice sample.

What sets Sony Interactive Entertainment’s technology apart is the use of a personalised filter, fine-tuned to the player’s voice based on additional voice samples. The patent recognises that different voices may exhibit varying audio characteristics, leading to challenges such as sibilance (hissing sounds), unclear pronunciations, or other undesirable traits.

Flow diagram illustrating an audio processing process, in accordance with some examples. | Source: Patent Public Search

The technology steps in to address these issues by pinpointing problematic sound types and applying a tailored solution to enhance the overall quality of the voice sample. The patent introduces the concept of personalised filters, emphasising the use of machine learning models trained on additional voice samples.

This ensures that the filter is specifically tuned to the individual characteristics of the player’s voice. The technology is not limited to addressing sibilance but extends to various voice characteristics. It can identify voice types related to gender, age, accent, and other factors, allowing for a comprehensive approach to audio enhancement.

The voice type may encompass attributes such as age, accent, dialect, or ethnic background, with the patent proposing that the technology can adapt dynamically to factors like microphone positioning during recording, thereby enhancing the precision of the filtered output.

The sound type could also correspond to speech dysfluency, and the filtering process corrects this issue in the voice sample. The filtering process involves using specific filters like a de-esser or a compressor targeting a specified frequency range related to the identified sound type.

While the patent’s applications are broad, its implications for gaming are particularly noteworthy. With Sony Interactive Entertainment being a key player in the video game industry, the technology could significantly elevate the gaming experience. Clearer and more refined voice communication within multiplayer video games, for instance, could enhance teamwork and overall immersion.

Beyond the gaming realm, this innovation has the potential to impact various industries. In communication platforms, video conferencing, and virtual assistants, the technology could ensure clearer and more intelligible voice interactions. Users with diverse accents or speech patterns may benefit from a more personalized audio experience, fostering inclusivity.

Although the patent offers a comprehensive blueprint for the technology, its actual implementation remains to be witnessed; nevertheless, Sony Interactive Entertainment has a history of translating patents into groundbreaking products.

What do you think about this? Do tell us your opinions in the comments below!

Similar Reads: Sony Working On AI Assistance For Struggling Players In Games

Was this helpful? 🎮

Good job! Please give your positive feedback ☺️

How could we improve this post? Please Help us. 🤔