Recent Articles

Advertisement

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

- introduction - pretraining data (internet) - tokenization - neural network I/O - neural network internals - inference

France's LLM representative team Mistral launches le Chat: watching the daily battles of the gods

France's LLM representative team Mistral launches le Chat: watching the daily battles of the gods

The all new le Chat: Your AI assistant for life and work

Apple AI Desk Lamp ELEGNT: Making robots move more naturally and expressively

Apple AI Desk Lamp ELEGNT: Making robots move more naturally and expressively

ELEGNT: Expressive and Functional Movement Design for Non-anthropomorphic Robot

Andrew Ng's latest release on Agent target detection: Agentic Object Detection

Andrew Ng's latest release on Agent target detection: Agentic Object Detection

Reasoning-driven object detection: human-like precision via text prompts without the overhead of custom training

ByteDance's OmniHuman-1: Generating realistic human videos from a single human image

ByteDance's OmniHuman-1: Generating realistic human videos from a single human image

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Researcher AI Tools List compiled by Texas Tech University

Researcher AI Tools List compiled by Texas Tech University

AI-Based Literature Review Resources

Alibaba's EMO2: Audio-Driven Talking Head Generation

Alibaba's EMO2: Audio-Driven Talking Head Generation

EMO2: End-Effector Guided Audio-Driven Avatar Video Generation

Snap's Wonderland: Generating 3D Scenes from a Single Image

Snap's Wonderland: Generating 3D Scenes from a Single Image

Wonderland: Navigating 3D Scenes from a Single Image

OpenAI Deep Research: Intelligent research assistant launched

OpenAI Deep Research: Intelligent research assistant launched

An agent that uses reasoning to synthesize large amounts of online information and complete multi-step research tasks.