Skip to content
View wuwangzhang1216's full-sized avatar
🎯
Focusing
🎯
Focusing
  • San Francisco

Organizations

@Lightning-Goods

Block or report wuwangzhang1216

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wuwangzhang1216/README.md

Hey, I'm Steve Wu 👋

AI Engineer & LLM Researcher

Building AI-native tools that actually ship. Obsessed with making large language models more useful, more controllable, and more open.


🚀 What I'm Working On

  • 🔬 Abliterix — Fully automated LLM abliteration framework. LoRA + Optuna TPE optimization, 135+ model configs, 9 peer-reviewed techniques (NeurIPS/ACL/ICLR). 0–1.5% refusal rate with 0.01 KL divergence.
  • 🗄️ OpenDB — AI-native database & long-term memory for AI agents. 93.6% on LongMemEval (#3 on leaderboard), zero embeddings, zero vector DBs — just SQLite FTS5. 12 MCP tools, works with every major agent framework.
  • 🦬 OpenYak — Open-source local-first AI desktop app supporting 100+ models
  • 🧠 LLM alignment research — abliteration, representation engineering & steering vectors
  • 🔍 Vector search engines & RAG pipelines at scale
  • 🤖 Multi-agent orchestration — built from scratch, no heavy frameworks
  • 📝 AI-native writing & knowledge systems
  • 🔬 Publishing fine-tuned & abliterated models on HuggingFace

🛠 Tech Stack

Languages
Python C++ Rust Go CUDA TypeScript

AI/ML & Deep Learning
PyTorch Transformers HuggingFace PEFT DeepSpeed vLLM

Research & Techniques
RepEng Abliteration RLHF Quantization FlashAttention KnowledgeDistillation MoE

LLM APIs & Inference
OpenAI Anthropic Ollama

Vector Databases & Retrieval
FAISS pgvector Milvus

System Design & Distributed Systems
Microservices DistributedSystems EventDriven APIDesign Kafka RabbitMQ LoadBalancing Caching Sharding HA Scalability Observability

Full Stack
FastAPI Next.js React PostgreSQL Redis

Infra
Docker Kubernetes AWS Linux


🎓 Background

  • 🎓 Honours BSc in Computer Science & Mathematics, University of Toronto (3.95/4.0)
  • 💼 10+ years in Databases, LLMs & AI Agent Systems

HuggingFace

Pinned Loading

  1. openyak/openyak openyak/openyak Public

    Open-source local-first AI agent for desktop work. No account, no telemetry: use local models with Ollama/Rapid-MLX or bring your own provider key.

    Python 777 56

  2. abliterix abliterix Public

    Automated alignment adjustment for LLMs — direct steering, LoRA, and MoE expert-granular abliteration, optimized via multi-objective Optuna TPE.

    Python 213 39

  3. claude-code-source-all-in-one claude-code-source-all-in-one Public

    Always up-to-date open-source mirror of Claude Code (currently v2.1.123). Run from source with Claude subscription/API, ChatGPT subscription (GPT-5.5 / GPT-5.4), OpenAI-compatible providers, or loc…

    TypeScript 78 93

  4. openDB openDB Public

    AI-native local database for files, search, and long-term agent memory.

    Python 82 20

  5. ora ora Public

    Real-time on-device speech translation for macOS. Silero VAD + Qwen3-ASR-1.7B + Qwen3.5 (MLX) on Apple Silicon. No cloud, no API keys, no telemetry.

    Swift 36 4