Skip to content
View manishklach's full-sized avatar

Block or report manishklach

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
manishklach/README.md

Hi, I'm Manish

I build kernel-leaning systems for AI infrastructure, KV-cache orchestration, memory hierarchy control, and low-latency runtime behavior.

Most of my work sits close to the machine: Linux control planes, kernel-facing memory and I/O experiments, CPU scheduling and latency behavior, KV-state movement, and research prototypes that make systems ideas concrete.

About Me

  • I like working on the hard parts of systems for AI: memory placement, I/O paths, scheduler behavior, latency control, and observability.
  • I build across the stack, but I naturally gravitate toward Linux, kernel-adjacent interfaces, CPU and memory behavior, and runtime control planes.
  • I care about making low-level work legible, so I pair code with diagrams, RFC-style docs, and architecture-driven writeups.

Current Focus

  • KV-cache orchestration and memory residency control
  • Linux kernel control planes for inference workloads
  • CPU, IRQ, scheduler, and latency-path tuning
  • Storage and I/O behavior for decode-critical serving paths
  • Systems observability for real AI infrastructure

Selected Projects

  • kairo-io
    AI KV-cache-aware Linux block I/O work focused on decode-priority scheduling, placement metadata, NVMe backend mapping, and kernel tracepoint visibility.

  • kv-cpu-driver
    A Linux control-plane and hardware-interface scaffold for semantic KV-cache orchestration, with CPU-facing coordination, RTL structure, and FPGA emulation paths.

  • linux-hbf-control-plane
    An RFC-style Linux exploration of runtime-guided memory placement, prefetch, promotion, and demotion for future CXL/HBF-era inference systems.

  • kernel-dvfs-agentic-latency
    A kernel latency control-plane project spanning DVFS, cpuidle, IRQs, scheduler behavior, MM, VFS, I/O, and cgroup budget control for latency-sensitive AI execution.

  • ai-host-observability
    Linux-first host observability for GPU and RDMA systems, built to surface memory pressure, reclaim, PCIe, NUMA, IRQ, and host-side failure signals before they become incidents.

Writing And Portfolio

I regularly publish architecture-driven essays and technical companion material around Linux systems, AI infrastructure, memory-centric design, and runtime control.

What You’ll Find Here

  • Kernel and kernel-adjacent experiments
  • KV-cache, memory, CPU, and I/O control-plane ideas
  • Systems observability and performance tooling
  • RFC-style writeups, architecture notes, and technical docs
  • Research prototypes tied to real implementation artifacts

If you're into Linux internals, memory systems, KV-cache control, CPU latency paths, or kernel-facing AI infrastructure work, you'll probably find something interesting here.

Popular repositories Loading

  1. manishklach.github.io manishklach.github.io Public

    Portfolio hub for systems architecture, AI infrastructure, patents, and technical microsites.

    HTML 2

  2. chromelens chromelens Public

    Full-site performance X-ray powered by Chrome DevTools Protocol traces

    Python 2

  3. linux-hbf-control-plane linux-hbf-control-plane Public

    Experimental Linux RFC for an HBF/CXL-era AI memory control plane: runtime hints, prefetch, placement, and tiering.

    2

  4. kernel-dvfs-agentic-latency kernel-dvfs-agentic-latency Public

    Kernel latency control plane for agentic AI: DVFS, cpuidle, IRQ, scheduler, workqueue, I/O, MM, VFS, and cgroup budgets.

    C 2

  5. sram-inference-kernel-fastpath sram-inference-kernel-fastpath Public

    Experimental Linux kernel fast-path patches for SRAM-based AI inference servers, targeting io_uring submission, registered buffers, CQ polling, wakeup attribution, and completion latency.

    C 2

  6. kv-cpu-driver kv-cpu-driver Public

    Reference Linux control plane, RTL, and FPGA emulation scaffold for KV-CPU semantic KV-cache orchestration. Patent pending in India (App No. 202641056309).

    C 2