Phase 1 Report

Group of six people in meeting room around large table with laptops and microphones; one person standing, others seated; guide dog present.

Phase 1 Summary

The initiative transforms conventional screen readers into experiential soundscapes using spatial audio technology. The prototype employs "layered voices and spatial audio to help users discern the text's location and context within the browser," enhancing digital accessibility. The project explores how AI-generated voices and descriptive text can create personalized sound environments, particularly benefiting blind and visually impaired users.

Phase 1 Methodology

Co-Creation Approach

The project engaged six co-creators through structured sessions at Constant vzw (Brussels-based arts organization). Three screen reader users (Chris, Bruno, and Raphael) participated in experimental design exercises and technology demonstrations, focusing on understanding varied interactions with digital content and challenges accessing complex visual information.

Prototype Development

Initial version: Built using A-Frame framework, hosted on Glitch
Features: Keyboard navigation, audio triggers at spatial points
Feedback findings: Users appreciated spatial layout but needed stronger auditory boundary indicators
Refinement: Second prototype added enhanced audio boundaries, refined sound parameters, and adjusted distance modeling

Phase 1 Results

Key Findings

1. Spatial Awareness: "Traditional screen readers often 'flatten' web experiences by reducing content to linear lists," eliminating spatial context crucial for understanding complex information like maps or images.

2. Customization Importance: Users valued control over voice characteristics, sound localization, and movement within soundscapes. The team used ElevenLabs and ChatGPT for demonstration purposes.

3. Navigation Challenges: While exploration appealed to participants, they highlighted difficulties navigating without clear auditory cues and complexity from multiple layered voices.

4. Sound Method Development: Audio layers were designed to coexist without conflict, informed by Daniel R. Montello's spatial theories, aligning sounds with different environmental scales—immediate, vista, environmental, and geographical.

Interactive Demonstrations

Prototype 1: Arrow key navigation in full-screen Chrome with headphones recommended. Try Prototype 1

Prototype 2: Enhanced version with improved controls and audio cues. Try Prototype 2

Hieronymus Bosch, The Garden of Earthly Delights, oil on oak panels, Museo del Prado, Madrid

Alt-Text Generation Demo

Garden of Earthly Delights (Hieronymus Bosch painting) example featuring:

Four audio files demonstrating customized descriptions
Art curator version
Child-friendly versions (neutral and upbeat tones)
Soundtrack integration using Imaginary Soundscapes

Garden of Earthly Delights - Art Curator Perspective

Detailed art historical description tailored for an art curator

Garden of Earthly Delights - Child-Friendly Description

Simplified description suitable for children

Garden of Earthly Delights - Child-Friendly (Upbeat Tone)

Child-friendly description with enthusiastic, upbeat tone

Garden of Earthly Delights - Child-Friendly (Upbeat Tone with Soundscape)

Child-friendly description with upbeat tone and immersive soundscape background

Phase 2 Goals

Co-Creation Objectives

Expanded sessions with diverse visual content (charts, infographics, complex materials)
Open-source promotion encouraging developer, sound designer, and accessibility advocate participation
Co-creation guidelines documentation for future inclusive design projects

Technical Objectives

Sonification of images, maps, and web elements into spatial soundscapes
Open-source naturalistic voice customization with real-time processing
Customizable alt-text generation system
Real-time responsiveness to user navigation inputs
Platform expansion beginning with image/map services (Google Maps, OpenStreetMap)
Comprehensive accessibility tutorials and documentation

Acknowledgments

Funding organizations: Constant, The Processing Foundation, Stimuleringsfonds
Mentor: Luis Morales-Navarro
Co-creators: Bruno, Chris, Raphael, Joris