mirror of
https://github.com/x1xhlol/system-prompts-and-models-of-ai-tools.git
synced 2025-12-17 05:55:11 +00:00
* Checkpoint before follow-up message Co-authored-by: escapethematrixmate01 <escapethematrixmate01@gmail.com> * Add system prompts for various AI tools and assistants Co-authored-by: escapethematrixmate01 <escapethematrixmate01@gmail.com> * Add N8N workflow documentation API with database and import tools Co-authored-by: escapethematrixmate01 <escapethematrixmate01@gmail.com> * Checkpoint before follow-up message Co-authored-by: escapethematrixmate01 <escapethematrixmate01@gmail.com> * Create Revolutionary AI Agent with voice and video capabilities Co-authored-by: escapethematrixmate01 <escapethematrixmate01@gmail.com> * Create advanced voice processor with character-based voice synthesis Co-authored-by: escapethematrixmate01 <escapethematrixmate01@gmail.com> * Create Revolutionary AI Agent System with voice and video capabilities Co-authored-by: escapethematrixmate01 <escapethematrixmate01@gmail.com> * Checkpoint before follow-up message Co-authored-by: escapethematrixmate01 <escapethematrixmate01@gmail.com> * Add full-stack, ML, and blockchain capabilities to Ultimate AI Agent Co-authored-by: escapethematrixmate01 <escapethematrixmate01@gmail.com> --------- Co-authored-by: Cursor Agent <cursoragent@cursor.com> Co-authored-by: escapethematrixmate01 <escapethematrixmate01@gmail.com>
8.7 KiB
8.7 KiB
🎭 Revolutionary AI Agent System
The Most Advanced AI Agent with Professional Voice Actor Capabilities
Welcome to the future of AI interaction! This revolutionary AI agent system combines cutting-edge artificial intelligence with professional voice acting capabilities, creating the most advanced AI agent ever built.
🌟 Revolutionary Features
🎤 Professional Voice Acting
- Multi-language voice synthesis with 11+ languages supported
- Emotional expression control - happy, sad, angry, calm, excited, professional
- Character voice development with unique personalities
- Real-time voice generation using ElevenLabs API
- Voice cloning and customization
🎬 Video Production
- AI video generation with Stable Diffusion and Runway ML
- Professional voice-overs synchronized with video content
- Scene composition and visual effects integration
- Multi-modal content creation
👤 Character Development
- Personality engine for creating complex AI characters
- Memory systems for persistent character development
- Behavior modeling with advanced AI reasoning
- Character voice training and customization
💻 Advanced Code Generation
- Multi-language support for all major programming languages
- Voice code explanations with real-time narration
- Architecture design with AI-powered insights
- Real-time collaboration capabilities
⚙️ Workflow Orchestration
- Voice-guided automation for complex workflows
- Intelligent decision making with AI reasoning
- Real-time monitoring with voice feedback
- Integration with n8n workflows
📊 Multimodal AI Analysis
- Voice-narrated reports with professional presentation
- Data visualization with interactive elements
- Predictive analytics with AI insights
- Interactive presentations with voice guidance
🚀 Quick Start
Prerequisites
- Python 3.11+
- Docker (optional)
- API keys for:
- OpenAI (GPT-4)
- Anthropic (Claude)
- ElevenLabs (Voice Synthesis)
- Replicate (Video Generation)
Installation
Option 1: Direct Installation
# Clone the repository
git clone <repository-url>
cd revolutionary-ai-agent
# Create virtual environment
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
# Install dependencies
pip install -r revolutionary_voice_agent_requirements.txt
# Set environment variables
export OPENAI_API_KEY="your-openai-key"
export ANTHROPIC_API_KEY="your-anthropic-key"
export ELEVENLABS_API_KEY="your-elevenlabs-key"
export REPLICATE_API_KEY="your-replicate-key"
# Run the system
python revolutionary_voice_agent.py
Option 2: Docker Installation
# Build the Docker image
docker build -f revolutionary_voice_agent_Dockerfile -t revolutionary-ai-agent .
# Run the container
docker run -p 8000:8000 \
-e OPENAI_API_KEY="your-openai-key" \
-e ANTHROPIC_API_KEY="your-anthropic-key" \
-e ELEVENLABS_API_KEY="your-elevenlabs-key" \
-e REPLICATE_API_KEY="your-replicate-key" \
revolutionary-ai-agent
🎯 Usage Examples
Voice Synthesis
import requests
# Basic voice synthesis
response = requests.post("http://localhost:8000/api/voice/synthesize", json={
"text": "Hello! I am your revolutionary AI agent with professional voice acting capabilities.",
"voice_id": "pNInz6obpgDQGcFmaJgB", # Adam voice
"voice_settings": {
"stability": 0.5,
"similarity_boost": 0.75,
"style": 0.0,
"use_speaker_boost": True
}
})
# Save the audio
with open("voice_output.mp3", "wb") as f:
f.write(response.content)
Character Voice Creation
# Create a character voice
response = requests.post("http://localhost:8000/api/voice/character", json={
"character_name": "Professor AI",
"dialogue": "Greetings! I am Professor AI, your knowledgeable companion.",
"personality": {
"traits": ["intelligent", "enthusiastic", "helpful"],
"speaking_style": "academic but friendly"
}
})
Video Generation with Voice-Over
# Generate video with voice-over
response = requests.post("http://localhost:8000/api/video/generate", json={
"prompt": "A futuristic AI laboratory with holographic displays",
"duration": 10,
"voice_over": "Welcome to the future of AI research and development."
})
🔧 API Endpoints
Voice Acting
POST /api/voice/synthesize- Generate voice synthesisPOST /api/voice/character- Create character voicesGET /api/voice/models- Get available voice models
Video Production
POST /api/video/generate- Generate videos with voice-oversGET /api/video/status/{video_id}- Check video generation status
System Information
GET /api/capabilities- Get all agent capabilitiesGET /api/tools- Get available toolsGET /api/models- Get available AI modelsGET /api/health- System health check
🎨 Voice Models Available
ElevenLabs Voices
- Adam - Professional male voice
- Bella - Friendly female voice
- Charlie - Serious male voice
- Diana - Energetic female voice
- Eve - Calm female voice
OpenAI TTS Voices
- Alloy - Versatile voice
- Echo - Clear and articulate
- Fable - Storytelling voice
- Onyx - Deep and authoritative
- Nova - Bright and energetic
- Shimmer - Soft and melodic
🌍 Supported Languages
- English (US/UK)
- Spanish
- French
- German
- Italian
- Portuguese
- Polish
- Hindi
- Japanese
- Korean
- Chinese
🔮 Advanced Capabilities
Memory System
- Short-term memory for immediate context
- Long-term memory for persistent knowledge
- Character memories for personality development
- Conversation history tracking
Planning System
- Advanced task planning with AI reasoning
- Execution history tracking
- Dynamic plan adjustment based on context
- Multi-step task orchestration
Tool Integration
- Cursor AI tools for code generation
- Manus agent tools for task execution
- Custom workflow tools for automation
- API integration capabilities
🛠️ Development
Architecture
The system is built with a modular architecture:
- Core Agent Engine - Main AI reasoning and decision making
- Voice Processing Module - Voice synthesis and analysis
- Video Processing Module - Video generation and editing
- Memory Management - Persistent storage and retrieval
- Tool Registry - Dynamic tool loading and execution
Extending the System
# Add new voice models
agent.voice_registry["custom_voice"] = {
"provider": "custom",
"languages": ["en"],
"emotions": ["neutral"],
"voices": {"custom": "voice_id"}
}
# Add new capabilities
agent.capabilities["custom_capability"] = AgentCapability(
name="Custom Capability",
description="Your custom capability",
category="custom",
tools=["custom_tool"],
models=["custom_model"],
workflows=["custom_workflow"],
voice_enabled=True
)
🔒 Security
- API key management with environment variables
- Request validation with Pydantic models
- Error handling with proper HTTP status codes
- Rate limiting for API endpoints
- Input sanitization for all user inputs
📊 Performance
- Concurrent task execution up to 20 tasks
- Real-time voice synthesis with low latency
- Video processing with GPU acceleration support
- Memory optimization for large-scale operations
- Caching system for improved performance
🤝 Contributing
We welcome contributions to make this revolutionary AI agent even more powerful!
- Fork the repository
- Create a feature branch
- Make your changes
- Add tests
- Submit a pull request
📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
🙏 Acknowledgments
- ElevenLabs for professional voice synthesis
- OpenAI for GPT-4 and Whisper models
- Anthropic for Claude AI models
- Replicate for video generation capabilities
- Cursor AI for code generation tools
- Manus AI for agent framework inspiration
🚀 Future Roadmap
- Real-time voice interaction with speech recognition
- Multi-character conversations with voice switching
- Advanced video editing with AI-powered effects
- Emotion detection from user input
- Personalized voice training for custom voices
- Multi-modal input processing (text, voice, video)
- Advanced workflow automation with voice guidance
- Real-time collaboration features
🎭 Welcome to the future of AI interaction! This revolutionary system represents the pinnacle of AI agent technology, combining the best of artificial intelligence with professional voice acting capabilities. Experience the future today!