A newer version of the Gradio SDK is available:
6.1.0
title: Autism_QA
app_file: multi_page_gradio_demo.py
sdk: gradio
sdk_version: 5.40.0
Wisal: Autism AI Assistant
Wisal is an AI-powered assistant specifically designed to help individuals with autism with their communication needs. The application provides both text and audio responses to make communication more accessible.
Features
- Text and voice input/output
- Document Q&A capabilities
- Live audio chat
- Customizable voice responses
- User-specific document handling
Prerequisites
Before you begin, ensure you have the following installed:
- Python 3.8 or higher
- pip (Python package installer)
- Conda (optional, but recommended for environment management)
Setup Instructions
1. Clone the Repository
git clone <repository-url>
cd autim-qa-mine
2. Environment Configuration
Copy the example environment file and fill in your API keys:
cp .env.example .env
Then edit the .env file and add your API keys:
- Google Gemini API Key
- SiliconFlow API Key
- Other optional API keys as needed
3. Running the Application
We provide a convenient script to run the application with different environment options:
./run.sh
The script will present you with three options:
- Use an existing conda environment
- Create a new conda environment
- Install requirements with the current Python environment
Manual Installation (Alternative)
If you prefer to set up the environment manually:
Create a virtual environment (recommended):
python -m venv wisal-env source wisal-env/bin/activate # On Windows: wisal-env\Scripts\activateInstall dependencies:
pip install -r requirements.txtRun the application:
python main.py
Usage
Once the application is running, it will be accessible at http://localhost:8080 in your web browser.
Basic Usage
- Type your question in the text box or record audio using the microphone
- Select your preferred voice for the response
- Click "Send Message" or press Enter
- The AI will respond with both text and audio
Advanced Features
- Document Q&A: Upload documents (PDF, DOCX, TXT) to ask questions about their content
- Live Chat: Use the WebRTC feature for real-time audio conversation
- User-Specific Documents: Store and query personal documents
Configuration
The application can be configured through the config.yaml file for model settings and other parameters.
Troubleshooting
If you encounter issues:
- Ensure all API keys in
.envare correct and active - Check that all dependencies are installed:
pip install -r requirements.txt - Verify the application is using the correct Python environment
For further assistance, please check the logs in the logs/ directory.