Private Deployment Stack: China Enterprise Architecture
Analysis
This synthesis examines the complete technology stack for private agent deployment in Chinese enterprises, integrating insights from infrastructure, security, compliance, and platform integration.
The Four-Layer Stack
Chinese enterprises deploying private agents follow a standardized four-layer architecture:
Layer 1 - Infrastructure (Hardware + Hosting)
- Compute: Huawei Ascend NPU (domestic) or NVIDIA GPU (if available)
- Hosting: Alibaba Cloud VPC, Tencent Cloud, Huawei Cloud, or on-premise data centers
- Storage: Distributed file systems (Ceph) for model weights and training data
- Network: Isolated VPC with controlled egress, no direct internet access
Layer 2 - Model + Serving
- LLM: Qianwen (Alibaba), ERNIE (Baidu), Hunyuan (Tencent), Pangu (Huawei)
- Inference: vLLM, TensorRT-LLM for optimization
- Fine-tuning: LoRA adapters for domain-specific customization
- Registry: Internal model catalog with version control
Layer 3 - Agent Runtime + Security
- Framework: OpenClaw (open-source) or proprietary
- Harness: Permission enforcement, audit logging, rate limiting
- Sandbox: Firecracker microVMs for isolated execution
- Skill registry: Internal catalog of approved skills
Layer 4 - Integration + Compliance
- Enterprise systems: ERP, CRM, HRIS connectors
- Platform integration: DingTalk, Feishu, WeChat Work
- Monitoring: OpenTelemetry, Prometheus, Grafana
- Compliance: MLPS 2.0 audit logs, PIPL data handling
Three Deployment Patterns
Pattern A: Full On-Premise (Government, Finance, Defense)
- All four layers in enterprise data center
- Zero external dependencies
- Highest security, highest cost (10-100x vs. cloud)
- Example: Ministry of Finance using Huawei stack
Pattern B: Hybrid Cloud (Manufacturing, Retail, Healthcare)
- Layers 1-2 on domestic cloud (Alibaba/Tencent)
- Layers 3-4 on-premise for sensitive data
- Balance of cost and control
- Example: Manufacturing firm using Alibaba Cloud + on-premise harness
Pattern C: Private Cloud (Tech Companies, Startups)
- All layers on dedicated VPC (Alibaba/Tencent/Huawei)
- Isolated from other tenants
- Compliance with data localization
- Example: SaaS startup using Alibaba Cloud VPC
Key Differentiators vs. Western Deployments
1. Domestic Technology Preference
- 80% of large enterprises prefer Chinese vendors
- Driven by: Regulations, security concerns, government incentives
- Huawei Ascend NPU adoption growing despite NVIDIA superiority
2. Platform Integration Strategy
- Agents embedded in DingTalk, Feishu, WeChat (not standalone apps)
- “Agent-as-feature” vs. Western “agent-as-product”
- Instant distribution to hundreds of millions of users
3. Compliance-First Architecture
- MLPS 2.0 and PIPL requirements baked into Layer 4
- Audit logging mandatory, not optional
- Data localization enforced at network layer
4. Government Support
- Subsidies up to 10M yuan for agent projects
- Rent-free office space in tech zones
- Mandates driving adoption (67% of industrial firms)
Cost-Benefit Analysis
Full On-Premise
- Upfront: ¥5-50M for hardware (GPUs, servers, storage)
- Annual: ¥2-10M for power, cooling, maintenance, staff
- Break-even: 3-5 years vs. cloud at high utilization
- Justification: Regulatory compliance, data sovereignty
Hybrid Cloud
- Upfront: ¥1-5M for on-premise harness
- Annual: ¥500K-5M for cloud + on-premise
- Break-even: 1-2 years vs. full on-premise
- Justification: Balance cost and control
Private Cloud
- Upfront: ¥100K-1M for setup
- Annual: ¥200K-2M for cloud services
- Break-even: Immediate vs. on-premise
- Justification: Compliance without infrastructure burden
Critical Success Factors
1. Talent
- Need hybrid skills: AI/ML + enterprise architecture + domain expertise
- Training takes 6-12 months minimum
- Shortage of qualified engineers drives up costs
2. Vendor Selection
- Alibaba: Best for e-commerce, retail, logistics
- Tencent: Best for consumer services, gaming, social
- Huawei: Best for government, SOEs, manufacturing
- ByteDance: Best for content, marketing, youth market
3. Phased Rollout
- Start with single department, single use case
- Prove ROI before scaling
- Budget 2-3x initial estimates
- Expect 50% of experiments to fail
4. Compliance Planning
- MLPS 2.0 certification takes 6-12 months
- Audit logs must be retained for 6+ months
- Data localization enforced at network layer
- Regular security audits required
Supporting Evidence
- From China Enterprise Agent Landscape: 80% of large enterprises require on-premise options, 67% of industrial firms have integrated AI
- From High-Privilege Agent Infrastructure: Four-layer harness architecture, Firecracker sandboxing, OWASP Agentic Top 10
- From AI Infrastructure Industry Report: Alibaba/Tencent/Huawei Cloud comparison, Ascend NPU vs. NVIDIA
- From Institutional AI vs Individual AI: Institutional intelligence requirements align with Chinese enterprise culture
- From Skill Factory Risk Analysis: Gartner 40% failure prediction, cost-benefit analysis