Researchers have developed a new artificial ielligence system that can produce accurate images of that place based on a recorded sound. In this research, first some sounds recorded from the streets of differe cities of the world were given to the artificial voice, then the model produced accurate images for the streets.
According to published reports, a team of researchers from the University of Texas in this research sought to answer the question of whether artificial ielligence can understand the visual characteristics of its environme with only audio clips. A skill that was once thought to be unique to humans.
The ability of artificial ielligence to understand the environme from the recorded sound
They explain in their paper that they first collected 100 YouTube video and audio clips from cities in North America, Asia, and Europe. They then used these clips to train an artificial ielligence model that can produce high-resolution images of differe environmes based on audio inputs.

Next, the AI was fed 10-second audio clips and asked to generate high-resolution images of what the environme looked like.
To determine the accuracy of the images, a group of people were prese in the research as judges. For these judges, the output of artificial ielligence and the sound based on which the images were produced were played, then they were asked to ideify which image corresponds to the sound. On average, 80% of the time, the judges’ diagnosis was correct.
According to a stateme published by the University of Texas, the accuracy of the images created by this artificial ielligence model shows that machines can well simulate the human connection between audio and visual perception of environmes.
Yuhao Kang, one of the authors of this study, says:
“Our research shows that acoustic environmes coain enough visual cues to produce recognizable images of streetscapes in which differe locations are accurately represeed; “That means you can transform acoustic environmes io vivid visual displays, and more effectively transform sounds io sights.”



