Provide observability metrics, kernel events, tracing, and profiling for containers, physical machines, and controllers.
RDMA 关键技术研究:Memory Window
RDMA 关键技术研究 MW, 该文主要分析 Memory window 作用,用户态接口,内核实现方式,硬件原理,一些思考。
Low Overhead
Zero Instruction
Intelligent Adaptive Backoff
Leveraging BPF to deliver full-stack observability and deep performance insights into memory, scheduling, networking, and block I/O, with 1% overhead.
Event-driven context snapshot with instrumentation across kernel slow paths. Automatic reports for system-wide events, including scheduling, networking, and interrupts.
Pinpoint performance degradation in cloud-native environments with tracing-based intelligence. Automated tracing for CPU idle drops, I/O latency, and Loadavg spikes.
Continuous profiling of OS kernel and multi-language (e.g. Java, Python, Go, C/C++.) applications across CPU, memory, I/O, and Lock. Driving business innovation.
Network-centric, service‑oriented request tracing across distributed systems. Delivers end‑to‑end visibility of microservice, ensuring system stability in large‑scale environments.
Integrate open‑source observability stacks. Support bare‑metal and cloud‑native deployments. Aware of K8s containers / labels / annotations. Support mainstream Linux distributions.
Kubernetes, Distributed Computing
Provide observability metrics, kernel events, tracing, and profiling for containers, physical machines, and controllers.
Large model training and inference
Extend GPU fault detection, delivering metrics and events for CPU, Memory, PCIe, HCA, and other components.
Integrated Linux Operating Systems
Support for Huawei EulerOS, Alibaba Anolis OS, Tencent TencentOS, and other mainstream operating systems.
Data Center, Bare Metal
Apply to infrastructure services, including storage, big data, message queues, microservices, and other systems.
Disaster Backup and Recovery
Support chaos engineering, fault injection, and data center disaster recovery, enabling users to understand application.
By HAO022 on 2026-04-09
RDMA 关键技术研究 MW, 该文主要分析 Memory window 作用,用户态接口,内核实现方式,硬件原理,一些思考。
By HAO022 on 2026-04-08
RDMA 关键技术研究 QPs, 该文主要分析 QPs, SQ, RQ 作用,用户态接口,内核实现方式,硬件原理,一些思考。
By HAO022 on 2026-04-07
RDMA 关键技术研究 Memory Region, 该文主要分析 memory region 作用,内核实现方式,硬件原理,一些思考。
By 王洪磊 on 2026-03-26
本篇介绍 Linux 内核 RAS, MCE, AER 等硬件故障检查原理。HUATUO 华佗项目依赖该检测机制实现了通用硬件故障监控。
(C) 2025-2026, HUATUO Open Source Community.