ONLY AVAILABLE IN PAID PLANS.
Back to Beauty PageArtificial intelligence (AI) already has significant presence in professional audio. It’s automating and streamlining such tasks as editing, mixing, mastering, and, increasingly, music creation and generation. It allows producers and engineers to work faster and more efficiently and offers new creative possibilities. Even so, there are concerns regarding the potential for loss of human creative input and the ethical implications of AI-generated content.SVG asked some industry thought leaders for their views on this inflective moment in pro audio’s evolution.[caption id="attachment_274081" align="alignright" width="300"] NBC Sports’ Karl Malone: “The beauty of AI lies in its ability to analyze a scenario and evaluate far more parameters of an audio signal than we ever could as humans.”[/caption]Karl Malone, senior director, audio engineering, NBC Sports:I’m excited to have the ability to see what can be achieved with new AI tools in broadcast audio. We already look to Cedars, Isotopes, and upmix engines to look after complex tasks, and the use of Kick for mixing ball effects in European football should be enough for us to take automated and intelligent technology seriously.The beauty of AI lies in its ability to analyze a scenario and evaluate far more parameters of an audio signal than we ever could as humans. Although A1s excel at brain-to-hand-to-eye coordination, we simply cannot match the computational foresight of AI. It can process information in real time, anticipate outcomes, and make decisions or offer suggestions based on both its analysis and the specific training it has received for a given situation.However, the artistic nuances and creative expertise that define an A1’s work are irreplaceable, making AI unsuitable for mixing a major broadcast show on its own. That said, AI could be highly effective in handling secondary outputs, such as creating a dedicated mix for second-screen feeds — focusing, for instance, on mixing close ball effects and radio commentary as a separate audio feed.AI can also be helpful in the QC of large numbers of program feeds to be able to check audio and video for all sorts of visual- and audio-mix issues: out-of-sync audio and video, artifacts in video-resolution–quality fluctuations, missing audio, clipping, metadata timing, phase, etc. It can alert the MCR/BOC operator to take a closer look or listen.Ultimately, we decide if we want to use it in these early stages or not, so it’s not being forced upon anyone to implement.[caption id="attachment_185715" align="alignright" width="240"] Audio-Technica’s Gary Dixon: “AI is a tool for humans to better react to unpredictable events caused by interesting human nature.”[/caption]Gary Dixon, director, broadcast and production business development, Audio-Technica:Audio is dynamic, and the moments worth hearing are typically unpredictable: such as a crash in racing, the eruption of the crowd at the 18th hole, or holding a music note just a little longer at a concert. AI in professional audio and in microphones specifically will be used as a tool for quickly adapting hardware in unpredictable audio situations. Hardware can have limitations in gain structure, dynamics, and general EQ, whereas AI can assist the human in reacting to these situations.However, for audio to appeal to humans, the final monitoring stage and final adjustments will need to be done by humans. AI is a tool for humans to better react to unpredictable events caused by interesting human nature.Christian Scheck, head of marketing content, Lawo:From a content-creation perspective, generative AI is proving extremely powerful. As far as video is concerned, it is already possible to feed a generative AI engine with some information to get usable footage.Similarly, on the audio side, music written and performed by AI engines is beginning to scare songwriters and performers alike, while artificially generated voiceovers for videos and live commentary for broadcasts manage to fool a growing number of listeners into believing that they are listening to a human.AI has been used to good effect for the generation of closed captioning, which used to be a time-consuming task and can now be prepared within minutes. Results, however, still need to be checked by a human for consistency, tone of voice, and, critically, accuracy.[caption id="attachment_274084" align="alignright" width="300"] Lawo’s Christian Scheck: “AI has been used to good effect for the generation of closed captioning, which used to be a time-consuming task. Results, however, still need to be checked by a human.”[/caption]In the broadcast industry, more-advanced algorithms [can] help audio engineers, for instance, cope with a rapidly growing workload, not least in immersive-audio–mixing scenarios that require the supervision and delivery of several presentations and downmix formats — all from one console and by a single A1.Ultimately, the success of AI in live-production scenarios will depend on how well it responds to unexpected situations. It may very well become a powerful assistant that complements media production based on the use of Lawo solutions, but whether it will be able to replace DSP audio or high-quality video processing remains to be seen.AI can add value in other ways. On a modern, software-based platform like HOME Apps, for instance, AI can streamline process monitoring, vastly improve debug times, and shorten downtimes, as well as assist with data analysis and help predict failure conditions.Other applications could include advanced auto-mixing algorithms or the intelligent deployment of apps and services to max out hardware and software utilization in scenarios with a limited number of computing resources.However, AI needs to be applied with a layer of business governance, as it also presents a wealth of challenges.[caption id="attachment_274087" align="alignright" width="225"] Q5X’s Paul Johnson: “AI is the enabling technology that will drive the expanded use of live audio at sporting events.”[/caption]Paul Johnson, CEO, Q5X:There are many implications for AI that will expand the scale of audio capture from athletes and officials during sporting events. The dramatic increase in the speed and quality of speech-to-text technology will facilitate real-time audio processing, which enables filtering and correction of game audio for profanity and other undesired language and real-time translation to multiple languages. This will result in increased live game audio, which is always popular with fans. Once [the audio/video is] captured as text transcripts, the indexing of archived audio/video becomes much easier and the archives much more useful for postproduction purposes. AI will also be critical in associating the appropriate audio with automated digital zoom from within a wide-format video feed. Ultimately, AI will be able to mix multiple audio inputs so that the sound focuses on and tracks the target of a digital zoom.From a Q5X perspective, AI is the enabling technology that will drive the expanded use of live audio at sporting events, and it will continue to grow as processing capacity and speeds increase. We are focused on safely capturing very high-quality audio from players and officials during the game. This high-quality audio is the input required before AI can do its magic.[caption id="attachment_274088" align="alignright" width="300"] Q5X’s Paul Johnson: “AI is the enabling technology that will drive the expanded use of live audio at sporting events.”[/caption]Ben Escobedo, market development manager, Shure:Although the term AI often faces criticism as a mere “fun toy” for generating images or text, the future promises its transformation into a valuable partner for the audio industry. AI will assist in automating repetitive tasks and addressing complex challenges, improving audio workflows, and saving operators significant amounts of time. AI assistants, such as Microsoft Copilot, require top-notch audio quality to effectively capture and process voices. Shure is committed to delivering that.There is a common concern that AI could replace careers in the audio industry. However, AI should be viewed as an assistant, not a replacement. Although AI continues to advance, the human ability to understand and communicate effectively remains vastly superior. Live broadcasts and sound productions demand quick, on-the-spot thinking to resolve critical issues that can determine the success or failure of a show — something AI cannot replicate at this time.