▌ NICK COMMAND BASE ▐
2026-06-07 04:38

▌ beginner-domains.md

▒ PATH: MemPalace Archive/ai-course/beginner-domains.md
▒ SIZE: 1.9 KB
▒ MODIFIED: 2026-05-18 04:03
← BACK TO VAULT
# AI Beginner Course — Main Domains Introductory overview of the major AI domains for complete beginners. ## The 10 Main AI Domains ### 1. Text AI Large Language Models (LLMs), chatbots, writing assistants, code generation. Examples: ChatGPT, Claude, Gemini, code completion tools. ### 2. Image AI Image generation, editing, and visual understanding. Examples: DALL-E, Stable Diffusion, Midjourney, CLIP, SAM (segmentation). ### 3. Video AI Video generation, editing, and understanding. Examples: Sora, Runway, Pika, Kling. ### 4. Audio AI Speech synthesis (TTS), speech recognition (STT), music generation. Examples: Whisper (STT), Suno/MusicGen (music), ElevenLabs (voice). ### 5. 3D AI 3D scene generation, neural rendering, robot vision. Examples: NeRF, Gaussian splatting, Tripo3D. ### 6. Reasoning / Agentic AI AI that plans, uses tools, and autonomously completes multi-step tasks. Examples: AI agents that browse the web, write code, book travel. **Current frontier** — getting AI to actually *DO* things vs just generate content. ### 7. Speech / Conversation AI Real-time voice assistants, two-way voice conversation, voice cloning. Examples: Live voice chat, phone assistants. ### 8. Multimodal AI Models that combine text + image + audio + video simultaneously. Examples: GPT-4V, Gemini, LLaVA — understand images and respond, see-and-talk models. ### 9. Scientific AI AI for scientific discovery — protein folding, drug design, materials science. Examples: AlphaFold (proteins), GNoME (materials), weather prediction. ### 10. Robotics AI AI for physical world interaction — locomotion, manipulation, autonomous machines. Examples: Robot dogs, humanoid robots, self-driving. --- ## Key Takeaway for Beginners The biggest current trend: **AI agents** — AI that doesn't just generate content but actually performs tasks end-to-end (browses, clicks, types, makes decisions). This is where most of the development momentum is right now.
▒▒▒ READY CPU: 12% MEM: 4.2G NET: OK OBSIDIAN ▒ VIEWING