ebonivon/call-genie at main

80.2 kB

1 contributor

Build a fully functional AI-powered web application named "CallGenie" that enables users to create, customize, and deploy their own AI calling agents (similar to OmniDim). The app should feature an immersive 3D user interface for an engaging experience and a robust backend integrating the Gemini AI API key for advanced conversational AI capabilities. --- Requirements: Frontend: Use React.js with Three.js or React Three Fiber for creating a smooth, interactive 3D interface (like a rotating agent avatar or 3D dashboard) Responsive design for desktop and mobile Agent creation wizard UI allowing users to: Choose voice type, language, and personality traits Script or customize dialogue flows with a visual flowchart or form inputs Test calls in real-time with AI-generated responses Real-time call simulation with audio playback using text-to-speech powered by Gemini API Dashboard for managing multiple agents, viewing call logs, and analytics User authentication (email/password or OAuth) for saving agents and data securely Backend: Build with Node.js and Express.js (or your preferred backend stack) Securely handle Gemini API key to communicate with Gemini AI models for: Natural language understanding & response generation Text-to-speech synthesis for voice calls API endpoints for: Creating/updating/deleting AI agents and their scripts Handling real-time call simulation and streaming responses User authentication and data persistence Use a database (MongoDB/PostgreSQL) for storing user data, agent configs, and call logs Implement websocket or server-sent events for real-time interaction during calls Additional: Error handling and loading states for smooth UX Include environment variable setup instructions for Gemini API key Deployable on platforms like Vercel (frontend) and Railway/Heroku (backend) Clear README with setup, usage, and deployment steps At the bottom of the app, include: "© 2025 OrionixLabs. All rights reserved." --- Bonus: Animate the 3D agent with subtle facial expressions synced to AI responses Include multilingual support in UI and AI conversations Add voice input option to simulate full duplex calls --- - Initial Deployment

c310226 verified 4 months ago