Moore Threads Tech Deep Dive

From Silicon Provider to Turnkey AI Factory

Moore Threads operates at the intersection of domestic substitution and the explosive demand for artificial intelligence compute infrastructure in China. Unlike many of its domestic peers that focus exclusively on artificial intelligence accelerators, Moore Threads designs general-purpose graphics processing units capable of executing a wide spectrum of workloads, including 3D graphics rendering, scientific simulation, and large language model training. The company generates revenue through the sale of enterprise and consumer graphics cards, but its primary growth engine has rapidly shifted toward integrated infrastructure solutions. Historically, the business model relied on selling individual accelerator cards like the MTT S4000 and MTT S5000 series to data centers. However, the company is now executing a systemic pivot toward delivering comprehensive artificial intelligence factories. This transition is anchored by the KUAE intelligent computing cluster, a bundled hardware and software solution that networks tens of thousands of proprietary graphics processing units into a single computing fabric. By supplying the entire stack, spanning the underlying silicon to the high-speed interconnects and rack-level integration, Moore Threads captures significantly higher contract values and dramatically lowers the adoption barrier for enterprise clients attempting to train frontier artificial intelligence models.

The financial impact of this strategic pivot is increasingly visible. For fiscal year 2025, Moore Threads reported revenue of 1.5 billion yuan, representing a 243% year-over-year expansion, supported by a highly robust gross margin of 65.6%. The momentum carried into the first quarter of 2026, with revenue reaching 738 million yuan, an increase of 155% year-over-year. Crucially, the scaling of contract sizes through cluster deployments allowed the company to swing to a net profit of 29.4 million yuan in Q1 2026, marking its first profitable quarter and signaling a structural transition from early-stage technological validation to scaled commercialization. A recently secured 660 million yuan order for a single KUAE cluster effectively validates this turnkey business model, demonstrating that domestic hyperscalers are willing to commit significant capital expenditure to Moore Threads hardware.

A Bifurcated Supply Chain: Sovereign Demand Meets Foundry Bottlenecks

The customer base for Moore Threads is expanding rapidly, driven largely by geopolitical imperatives and Chinese central government directives mandating the localization of artificial intelligence infrastructure. End customers primarily consist of state-owned telecommunications operators, including China Mobile and China Telecom, alongside domestic internet giants such as Kuaishou and ByteDance. These entities are utilizing Moore Threads hardware to construct massive data centers capable of handling the computing intensity required by domestic large language models. Furthermore, hardware integrators and server original equipment manufacturers like Lenovo act as direct customers, packaging Moore Threads chips into enterprise-grade servers for wider enterprise distribution. The urgency among these buyers has intensified following the United States government restrictions on exporting advanced silicon, leaving domestic artificial intelligence developers with limited alternatives to power their computational needs.

However, the supplier dynamics present the most critical risk vector for the company. Operating as a fabless semiconductor designer, Moore Threads historically relied on Taiwan Semiconductor Manufacturing Company for its advanced 7-nanometer production. Following the company placement on the United States Bureau of Industry and Security Entity List in late 2023, access to foreign foundries and advanced high-bandwidth memory from global suppliers like SK Hynix and Samsung was abruptly severed. Consequently, Moore Threads has been forced to completely restructure its supply chain, relying on domestic foundries, predominantly Semiconductor Manufacturing International Corporation, to manufacture its advanced nodes. While Semiconductor Manufacturing International Corporation has demonstrated the capability to produce 7-nanometer equivalent chips using deep ultraviolet lithography, the process suffers from severe capacity constraints and suboptimal yields. Similarly, the company must now source memory components from domestic integrated device manufacturers such as Yangtze Memory Technologies and ChangXin Memory Technologies. The ultimate ceiling on Moore Threads growth is not customer demand, but rather the volume of functional wafers its domestic manufacturing partners can reliably supply.

The Domestic Battlefield and Market Share Dynamics

The competitive landscape in the Chinese artificial intelligence accelerator market is undergoing a structural realignment. Historically, Nvidia held a near-monopoly, capturing upwards of 90% of the market. However, export controls have restricted Nvidia to selling degraded silicon, such as the H20 chip, fundamentally altering the value proposition for Chinese buyers. Industry data suggests Nvidia market share in China fell to roughly 50% by late 2025 and is projected by industry specialists to decline into the single digits in the coming years. Into this vacuum, a fiercely competitive domestic ecosystem has emerged. Huawei remains the dominant domestic incumbent, utilizing its Ascend 910B and newly deployed 950 series chips alongside its proprietary CANN software ecosystem to secure the majority of government and telecom contracts. Huawei holds the clear lead in volume, manufacturing allocation, and institutional adoption.

Moore Threads, which held an estimated 1% of the Chinese artificial intelligence accelerator market in 2024, is aggressively contesting the remaining market share against a cohort of well-capitalized domestic pure-play integrated circuit designers. Key competitors include Cambricon, which recently achieved a 500,000 volume target for its Siyuan series chips, as well as Hygon, Biren Technology, and MetaX. The emergence of heavily funded new entrants like MetaX, which recently went public and is deploying massive capital into research and development to narrow the performance gap, highlights the intense rivalry within the sector. While Huawei represents an entrenched ecosystem competitor, Moore Threads differentiates itself from domestic startups by focusing on general-purpose graphics architectures rather than application-specific integrated circuits tailored solely for artificial intelligence matrices. This structural flexibility allows Moore Threads to compete simultaneously in the consumer graphics and enterprise artificial intelligence markets, amortizing research costs across a much larger total addressable market.

Software Compatibility and the Universal Architecture

The primary competitive moat for any semiconductor company is its software ecosystem, and this is where Moore Threads demonstrates structural advantages over its domestic peers. Breaking the monopoly of Nvidia Compute Unified Device Architecture is the defining challenge for alternative silicon providers. Chinese developers have spent years building applications within the Nvidia ecosystem, making the switching costs phenomenally high. To address this, Moore Threads developed the Moore Threads Unified System Architecture, a software ecosystem designed for high compatibility with existing international standards. Through its proprietary MUSIFY translation tools, developers can port existing code with minimal friction. This platform natively supports mainstream frameworks like PyTorch and major domestic large language models, significantly reducing the friction of adoption.

This software agility was empirically validated in April 2026 during the release of the DeepSeek-V4 trillion-parameter model. Moore Threads was among a select group of domestic providers to achieve synchronous Day-0 adaptation on the day of the release. This milestone proved that the underlying domestic computing cycle could function effectively independent of foreign software ecosystems. By maintaining an architecture that natively handles complex vector math for 3D rendering alongside matrix multiplication for artificial intelligence inference, Moore Threads captures hardware efficiencies that pure-play artificial intelligence accelerator startups struggle to replicate. This full-stack ecosystem capability is reflected in the 450,000 developers currently utilizing the company platform.

Scaling Up: Next-Generation Silicon and the 100,000-Card Cluster

The absolute computing power required to train the next generation of multimodal frontier models presents a profound technological opportunity. Market demand is shifting from isolated server racks to unified computing clusters housing tens of thousands of chips. Recognizing this, Moore Threads is heavily investing in technologies to scale its KUAE infrastructure. The company is advancing supernode architectures designed to interlink up to 100,000 graphics processing units, tackling immense engineering challenges related to high-speed optical interconnects, thermal management, and systemic fault tolerance. Achieving reliable uptime in a cluster of this magnitude will dictate the company ability to service the most advanced tier of artificial intelligence laboratories.

To power these ultra-large clusters, Moore Threads is finalizing its next-generation silicon, slated for mass production later in 2026. The new product matrix includes the Huagang architecture, a dedicated artificial intelligence training and inference platform expected to deliver a 50% improvement in compute density and a massive leap in energy efficiency. Concurrently, the company is developing the Huashan architecture for professional visualization and the Lushan system-on-chip to capture edge computing demand. The primary threat to this technological roadmap remains the severe physical limitations imposed by the United States Entity List. While Moore Threads continues to design highly sophisticated logic, the inability to access gate-all-around transistor technology or extreme ultraviolet lithography means physical performance gains must be increasingly extracted through architectural ingenuity and advanced packaging rather than pure node shrinkage. If domestic foundries fail to commercialize competitive advanced packaging techniques or hit an absolute wall in deep ultraviolet multipatterning yields, the performance gap between Moore Threads and global leaders could widen irrevocably.

A Clinical Approach to Execution

The operational velocity of Moore Threads is a direct reflection of its management team, led by founder and chief executive officer James Zhang. As a former global vice president and the general manager of Nvidia China for over 15 years, Zhang possesses an unparalleled understanding of both graphics architecture and the intricacies of the domestic procurement market. His tenure at the global market leader allowed him to recruit top-tier engineering talent and instill an aggressive execution culture. Under his leadership, the company delivered its first operational graphics processing unit within 300 days of inception, a pace rarely observed in the semiconductor industry.

Management track record over the past few years is characterized by highly effective capital allocation and strategic foresight. Anticipating the tightening of export controls, the leadership team preemptively stockpiled intellectual property and successfully navigated the transition to domestic foundries without stalling product iteration. Financially, the leadership team demonstrated discipline by aggressively pushing for commercial deployment of the KUAE clusters rather than remaining reliant on piecemeal chip sales. This strategic pivot culminated in a highly successful public listing on the STAR Market in late 2025, providing the company with over $1 billion in crucial working capital to fund the 1.3 billion yuan in annual research and development expenditures. The ability of the executive team to transition the company from a pre-revenue startup to a profitable, publicly traded enterprise integrated into the sovereign artificial intelligence supply chain within six years is a testament to their operational rigor.

The Scorecard

Moore Threads represents a high-beta vehicle for structural artificial intelligence compute localization in China. The core thesis rests on the company successful transition from a component supplier to an integrated infrastructure provider, evidenced by the high-value commercialization of its KUAE cluster systems. Surging triple-digit revenue growth, expanding gross margins north of 65%, and a definitive pivot to profitability in the first quarter of 2026 demonstrate real traction against an expanding addressable market. Its general-purpose graphics architecture and deep software compatibility layers provide a tangible competitive advantage over fragmented domestic application-specific integrated circuit developers, positioning the company as the most credible architectural alternative to Nvidia within the domestic ecosystem.

Conversely, the investment case is heavily constrained by profound supply chain vulnerabilities and intense domestic competition. Placement on the United States Entity List permanently severs access to leading-edge foreign foundries and high-bandwidth memory, creating a structural ceiling dictated entirely by the capacity and yield maturation of domestic manufacturing partners. Furthermore, Huawei maintains a dominant market share and an entrenched institutional ecosystem, while aggressively funded domestic entrants like MetaX threaten to compress margins. The ultimate trajectory of Moore Threads relies on its ability to circumvent hardware manufacturing bottlenecks through architectural brilliance, a high-stakes engineering challenge carrying immense execution risk.

DruckFin