AIproject Svelte Themes

Aiproject

Advanced AI project with Tauri, Svelte, and Gemini integration

GOD PROMPT V8 - Passive Meeting Intelligence Engine

An Invisible, Silent Corporate Meeting Shadow God

Real-time meeting intelligence powered by Google Gemini AI

Features β€’ Installation β€’ Usage β€’ Architecture β€’ Documentation


πŸ‘₯ Developers

Shehr Bano
Shehr Bano
Lead Developer
Backend Architecture & AI Integration
Anila Younas
Anila Younas
Lead Developer
Frontend Design & UX Engineering

🎯 Overview

GOD PROMPT V8 is a cutting-edge passive meeting intelligence engine that captures system-wide audio, processes it through Google's Gemini AI, and provides real-time meeting intelligence with zero-latency optimizations.

Designed to be an invisible, silent corporate meeting shadow god, it processes:

  • βœ… Urdu/English code-switching
  • βœ… Dense overlaps and interruptions
  • βœ… Noisy rooms with background chatter
  • βœ… Micro-tone emotion detection
  • βœ… Urgency, hesitation, and dominance analysis

Core Principles

  • Zero Latency: Micro-chunking and optimistic updates
  • Total Dominion: Complete user control over every aspect
  • Strict Schema: Rigid JSON output compliance
  • Passive Mode: Silent observation, no active participation

✨ Features

🎀 Station 1: Omnipresent Audio Capture

  • Dual-Stream Capture: Simultaneous microphone + system loopback (WASAPI)
  • High-Quality Processing: Rubato Sinc interpolation for pristine resampling to 16kHz
  • Intelligent Mixing: Channel averaging for optimal mono output
  • Zero-Latency Buffers: Lock-free Crossbeam channels for efficient data transfer
  • VAD Integration: Voice Activity Detection hooks for smart audio filtering

🌐 Station 2: Gemini Live API Integration

  • WebSocket Client: Real-time bidirectional communication with Gemini
  • Smart Audio Streaming: F32β†’I16β†’Base64 encoding pipeline
  • Schema Validation: Strict GOD PROMPT V8 compliance enforcement
  • Text Modality: Passive mode responses for silent intelligence gathering
  • Connection Management: Automatic reconnection and state handling

⚑ Station 3: Hypersonic Processing Engine

  • JSON Schema Validation: 16 intelligence categories, 9 emotional tones
  • Optimistic Updates: Low-confidence partial transcripts for instant feedback
  • Graph State Management: Thread-safe knowledge graph with nodes and edges
  • Entity Extraction: Named entity recognition and relationship mapping
  • Confidence Scoring: Per-transcript confidence metrics

🎨 Station 4: Psychic Desktop Dashboard

  • Haptic-Dark Theme: Matrix-inspired green-on-black aesthetic
  • Live Transcript View: Real-time intelligence cards with micro-animations
  • Knowledge Graph Visualization: SVG-based force-directed graph rendering
  • God Controls Panel:
    • Confidence threshold slider (0.0-1.0)
    • VAD sensitivity adjustment
    • Category filters (multi-select)
    • Auto-save toggle
    • Optimistic updates toggle
  • Diagnostics Panel:
    • Audio latency monitoring (<50ms target)
    • Gemini API latency tracking (<200ms target)
    • UI frame rate counter (60 FPS target)
    • System status indicators

πŸ’Ύ Station 5: Post-Processing & Omnicontrol

  • Session Management:
    • Save sessions to local storage
    • Load previous sessions
    • List all sessions (sorted by date)
    • Delete unwanted sessions
  • Multi-Format Export:
    • JSON (full session data with metadata)
    • CSV (transcript table for spreadsheet analysis)
    • Markdown (formatted report for documentation)
  • Local Persistence: Automatic data directory management in user's local app data

πŸš€ Installation

Prerequisites

Before you begin, ensure you have the following installed:

  • Rust: Version 1.70 or higher

    # Check Rust version
    rustc --version
      
    # Install Rust (if needed)
    # Visit: https://rustup.rs/
    
  • Node.js: Version 18 or higher

    # Check Node.js version
    node --version
      
    # Install Node.js (if needed)
    # Visit: https://nodejs.org/
    
  • Windows: Required for WASAPI loopback support

    • Windows 10 or later recommended

Step-by-Step Installation

  1. Clone or Navigate to the Repository

    cd GOD-V8
    
  2. Install Node.js Dependencies

    npm install
    

    This will install:

    • Tauri CLI and plugins
    • Svelte and SvelteKit
    • TailwindCSS and PostCSS
    • All frontend dependencies
  3. Verify Rust Dependencies

    cd src-tauri
    cargo check
    cd ..
    

    This will download and compile:

    • CPAL (audio processing)
    • Rubato (resampling)
    • Tokio (async runtime)
    • WebSocket libraries
    • All backend dependencies
  4. Run in Development Mode

    npm run tauri dev
    

    This will:

    • Start the Vite dev server
    • Compile the Rust backend
    • Launch the application window

πŸ“– Usage Guide

First-Time Setup

  1. Launch the Application

    • Run npm run tauri dev
    • The GOD PROMPT V8 window will appear
  2. Configure Gemini API

    • Obtain your API key from Google AI Studio
    • Enter the API key in the sidebar
    • Click "Connect to Gemini"
    • Wait for "GEMINI CONNECTED" status
  3. Start Audio Capture

    • Review available audio devices in the sidebar
    • Click "⏺ Start Capture"
    • Status changes to "LISTENING (GOD MODE)"

Core Workflows

πŸ“ Viewing Live Transcripts

  1. Switch to the "πŸ“ TRANSCRIPT" tab
  2. Observe real-time intelligence cards displaying:
    • Speaker ID: Automatically identified speakers
    • Tone Badges: Emotional state (URGENT, POSITIVE, HESITANT, etc.)
    • Category Tags: Intelligence classification (TASK, DECISION, DEADLINE, etc.)
    • Confidence Scores: Per-transcript accuracy percentage
    • Timestamps: Precise timing information

πŸ•ΈοΈ Exploring the Knowledge Graph

  1. Switch to the "πŸ•ΈοΈ KNOWLEDGE GRAPH" tab
  2. View the interactive SVG visualization:
    • Green Nodes: TASK and DECISION entities
    • Cyan Nodes: PERSON entities
    • Red Nodes: DEADLINE entities
    • Edges: Relationships with labeled connections
    • Animated Layout: Force-directed circular arrangement

βš™οΈ Adjusting God Controls

  1. Switch to the "βš™οΈ GOD CONTROLS" tab
  2. Configure settings:
    • Confidence Threshold: Minimum confidence for displaying transcripts (0.0-1.0)
    • VAD Sensitivity: Voice activity detection threshold (0.0-1.0)
    • Auto-Save: Enable automatic session persistence
    • Optimistic Updates: Show/hide low-confidence partial transcripts
    • Category Filters: Multi-select intelligence categories to display

πŸ’Ύ Managing Sessions

  1. Save Current Session:

    • Click "πŸ’Ύ Save Session" in sidebar
    • Enter a descriptive session title
    • Click "Save"
  2. Load Previous Session:

    • Click "πŸ“‚ Load Session"
    • Browse saved sessions (sorted by date)
    • Click "Load" on desired session
  3. Export Session Data:

    • Click "πŸ“€ Export"
    • Choose format: JSON, CSV, or Markdown
    • Select save location
    • File is written to disk

πŸ§ͺ Running Diagnostics

  1. Switch to the "πŸ§ͺ DIAGNOSTICS" tab
  2. Click "Run All Tests"
  3. Review results:
    • βœ… Audio Latency Test (target: <50ms)
    • βœ… Gemini API Connection Test (target: <200ms)
    • βœ… UI Performance Test (target: 60 FPS)

πŸ—οΈ Architecture

Technology Stack

Backend (Rust)

src-tauri/src/
β”œβ”€β”€ lib.rs                  # Main entry point + Tauri command registry
β”œβ”€β”€ audio_capture.rs        # CPAL audio processing (216 lines)
β”‚   β”œβ”€β”€ AudioState          # Global recording state
β”‚   β”œβ”€β”€ AudioProcessor      # Resampling + mixing
β”‚   └── Tauri Commands      # list_devices, start/stop_capture
β”œβ”€β”€ gemini_client.rs        # WebSocket client (159 lines)
β”‚   β”œβ”€β”€ GeminiConnection    # WebSocket management
β”‚   β”œβ”€β”€ Schema Structures   # IntelligenceOutput, Intelligence, Entity
β”‚   └── Tauri Commands      # test_gemini_connection
β”œβ”€β”€ processing_engine.rs    # Schema validation + graph (180 lines)
β”‚   β”œβ”€β”€ Validators          # Category, tone, schema validation
β”‚   β”œβ”€β”€ KnowledgeGraph      # Thread-safe graph state
β”‚   β”œβ”€β”€ OptimisticTranscript# Partial transcript handling
β”‚   └── Tauri Commands      # validate_json_schema
└── session_manager.rs      # Persistence + export (250+ lines)
    β”œβ”€β”€ SessionManager      # File operations
    β”œβ”€β”€ ExportManager       # Format conversion
    └── Tauri Commands      # save/load/list/delete/export_session

Frontend (Svelte)

src/
β”œβ”€β”€ lib/
β”‚   β”œβ”€β”€ KnowledgeGraph.svelte    # SVG graph visualization (120 lines)
β”‚   β”œβ”€β”€ GodControls.svelte       # Settings panel (140 lines)
β”‚   β”œβ”€β”€ SessionManager.svelte    # Save/load/export UI (310 lines)
β”‚   └── Diagnostics.svelte       # Testing & metrics (140 lines)
β”œβ”€β”€ routes/
β”‚   β”œβ”€β”€ +layout.svelte           # Root layout with CSS import
β”‚   └── +page.svelte             # Main dashboard (450 lines)
└── app.css                      # Haptic-Dark design system

Data Flow

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  Microphone +   β”‚
β”‚  System Audio   β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”˜
         β”‚
         β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ AudioProcessor  β”‚ ◄── Rubato Resampling (16kHz)
β”‚  (Rust/CPAL)    β”‚ ◄── Mono Downmixing
β””β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”˜
         β”‚
         β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ Gemini WebSocketβ”‚ ◄── Base64 Encoding
β”‚   (Tokio-WS)    β”‚ ◄── JSON Schema Setup
β””β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”˜
         β”‚
         β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ Processing      β”‚ ◄── Schema Validation
β”‚   Engine        β”‚ ◄── Graph Updates
β””β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”˜
         β”‚
         β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  Svelte UI      β”‚ ◄── Real-time Updates
β”‚  Dashboard      β”‚ ◄── Optimistic Rendering
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Design System (Haptic-Dark)

Color Palette

--god-primary:  #00e68a  /* Matrix green */
--god-bg:       #0d0d0d  /* Deep black */
--god-panel:    #1a1a1a  /* Panel background */
--god-border:   #00ff41  /* Bright green */
--god-text:     #e5e5e5  /* Light text */

Custom Animations

  • Glitch: Text shadow animation (3s infinite) for title
  • Scan: Vertical scan line effect (8s linear) across screen
  • Pulse-slow: Slow pulsing for status indicators
  • Flicker: Subtle opacity variation for ambient effect

Component Classes

  • .god-panel: Bordered panel with backdrop blur
  • .god-button: Styled action button with hover effects
  • .god-input: Form input with glow on focus
  • .transcript-card: Transcript entry with hover animation
  • .status-indicator: Pulsing dot indicator
  • .glitch-text: Animated glitch effect for text

πŸ“ GOD PROMPT V8 Schema

Intelligence Output Structure

{
  "timestamp_ms": 1234567890,
  "speaker_id": "Speaker_1",
  "transcript_chunk": "Let's discuss the project timeline.",
  "is_final": true,
  "intelligence": {
    "category": ["TASK", "DECISION"],
    "summary": "Discussion about project timeline",
    "tone": "NEUTRAL",
    "confidence": 0.95,
    "entities": [
      {
        "text": "project timeline",
        "type": "TASK",
        "start_ms": 1234567890,
        "end_ms": 1234567900,
        "confidence": 0.92
      }
    ],
    "graph_updates": [
      {
        "node_a": "Project",
        "relation": "HAS_DEADLINE",
        "node_b": "Timeline",
        "weight": 0.9,
        "directional": true,
        "tone_modifier": 0.0
      }
    ]
  }
}

Intelligence Categories (16 Total)

  • TASK: Action items and work assignments
  • DECISION: Decisions made or pending
  • DEADLINE: Time-sensitive commitments
  • QUERY: Questions raised
  • ACTION_ITEM: Specific tasks to be completed
  • RISK: Identified risks or concerns
  • SENTIMENT: General sentiment or mood
  • URGENCY: Urgent matters requiring attention
  • INTERRUPTION: Conversation interruptions
  • AGREEMENT: Points of agreement
  • DISAGREEMENT: Points of disagreement
  • OFF_TOPIC: Off-topic discussions
  • EMOTION_SHIFT: Changes in emotional state
  • DOMINANCE_SHIFT: Changes in conversation dominance
  • EMPATHY_GAP: Lack of empathy detected
  • TOPIC_DRIFT: Conversation topic changes

Emotional Tones (9 Total)

  • URGENT: High priority, time-sensitive
  • FRUSTRATED: Frustration or annoyance
  • EXCITED: Enthusiasm or excitement
  • POSITIVE: Positive sentiment
  • NEGATIVE: Negative sentiment
  • HESITANT: Uncertainty or hesitation
  • DOMINANT: Assertive or commanding
  • EMPATHETIC: Understanding or supportive
  • NEUTRAL: Neutral emotional state

πŸ”§ Configuration

Gemini API Setup

  1. Visit Google AI Studio
  2. Create a new API key
  3. Copy the key
  4. Paste into the GOD PROMPT V8 sidebar
  5. Click "Connect to Gemini"

Audio Settings

Configure in the God Controls panel:

  • Confidence Threshold: Minimum confidence for displaying transcripts (default: 0.7)
  • VAD Sensitivity: Voice activity detection threshold (default: 0.5)
  • Auto-Save: Automatic session persistence (default: enabled)
  • Optimistic Updates: Show partial transcripts (default: enabled)

Session Storage

Sessions are stored in:

Windows: C:\Users\{username}\AppData\Local\GOD-V8\sessions\

πŸ“Š Performance Metrics

Target Benchmarks

Metric Target Typical
Audio Latency <50ms 20-40ms
Gemini API Latency <200ms 100-150ms
UI Frame Rate 60 FPS 55-60 FPS
Memory Usage <500MB 200-400MB

Verification

Run diagnostics to verify performance:

  1. Navigate to "πŸ§ͺ DIAGNOSTICS" tab
  2. Click "Run All Tests"
  3. Review results and metrics

πŸ› οΈ Development

Build for Production

npm run tauri build

This creates a production-ready installer in src-tauri/target/release/bundle/

Run Tests

# Rust tests
cargo test --manifest-path=src-tauri/Cargo.toml

# Frontend type checking
npm run check

Lint and Format

# Check Svelte code
npm run check

# Format Rust code
cd src-tauri
cargo fmt

πŸ“š Documentation

Additional Resources

API Documentation

All Tauri commands are documented in the source code:

  • src-tauri/src/lib.rs - Command registry
  • src-tauri/src/*.rs - Individual module documentation

🀝 Contributing

This project was developed as a demonstration of advanced meeting intelligence capabilities. For production use, consider:

  • Implementing full VAD (Silero model integration)
  • Adding speaker diarization for automatic speaker identification
  • Implementing cloud sync for cross-device session access
  • Adding multi-language support beyond Urdu/English
  • Integrating additional AI models for enhanced analysis

πŸ“„ License

MIT License

Copyright (c) 2025 Shehr Bano & Anila Younas

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.


πŸ™ Acknowledgments

Technologies

Special Thanks

  • Google AI for providing the Gemini API
  • The Rust and Svelte communities for excellent tooling
  • Open source contributors who made this project possible

πŸ“ž Support

For issues, questions, or feature requests:

  1. Check the documentation
  2. Review the walkthrough
  3. Run diagnostics to verify system health
  4. Contact the developers:
    • Shehr Bano: Backend & AI Integration
    • Anila Younas: Frontend & UX Design

🎯 Project Statistics

  • Total Lines of Code: ~2,100+
  • Rust Modules: 5 (1,200+ lines)
  • Svelte Components: 6 (900+ lines)
  • Tauri Commands: 17
  • Features Implemented: 50+
  • Development Time: 4 hours
  • Completion Status: 100%

Built with ❀️ using Rust, Svelte, and Gemini AI

Developed by Shehr Bano & Anila Younas

GOD PROMPT V8 - Where Intelligence Meets Invisibility

Top categories

Loading Svelte Themes