Tencent Cloud, the cloud business of global technology company Tencent, announced the upgrade of its AIoT 2.0 product solutions, which will deliver seamless hardware-software integration with ready-to-use multimodal capabilities.
The enhanced offering features TWeTalk (voice intelligence system) and TWeSee (video intelligence system), both deeply integrated into Tencent Cloud's foundational AIoT platform for device management, messaging, and end-to-end audio & video communications. Together, these solutions provide users with a comprehensive, intuitive one-stop service, while also supporting the global expansion of smart hardware.
As artificial intelligence continues to advance, AIoT products are transitioning from simply being "AI-assisted" to becoming truly "AI-driven." By addressing challenges such as complex system integrations, model adaptations, and high development and deployment costs, AIoT 2.0 harnesses Tencent Cloud's deep expertise in AI, IoT, and audio & video technologies. These solutions facilitate seamless integration between hardware and software, making interactions among people, devices, and systems smarter, more efficient, and more user-friendly.
AIoT 2.0: A New Standard for Intelligent Device Development
The upgraded AIoT 2.0 product suite—featuring TWeTalk, TWeSee, and the foundational AIoT platform—sets a new benchmark for intelligent device development.
TWeTalk delivers a comprehensive AI conversational solution for smart hardware, with a focus on advanced Voice Agent functionality and real-time audio-video communication capabilities (TRTC/WebSocket). On the device side, TWeTalk partners with leading embedded chip and module manufacturers to integrate core technologies such as noise reduction, wake word detection, and sound source localization. On the cloud side, it provides essential services and distinctive features including speech-to-text (STT) with emotion recognition, voice broadcasting, text-to-speech (TTS), voice customization, WeChat call integration, and conversation highlights. TWeTalk also offers robust AI platform capabilities such as model fine-tuning, agent customization, knowledge graphs, retrieval-augmented generation (RAG), and model context protocol (MCP).
Additionally, TWeTalk integrates seamlessly with Tencent's extensive ecosystem, broadening its range of applications. For communications, TWeTalk enables two-way audio & video calls (VoIP) between devices and WeChat, making it suitable for a wide array of use cases. In terms of content, it supports integration with QQ Music, allowing users to easily access music through voice commands for a more convenient listening experience. Looking ahead, TWeTalk will also enable wearable devices such as smartwatches and AR glasses to support WeChat Pay, enabling smooth and secure payments directly from these devices.
Users can customize these features to suit their specific business needs, assembling them in a modular, building block–style approach for highly tailored solutions. Through close collaboration with leading chip and module partners, TWeTalk ensures optimized compatibility, allowing users to deploy finely tuned large AI models with minimal chip resources. This approach delivers cost-effective, full-scenario edge-cloud collaboration and significantly lowers the development barrier for smart hardware.
Currently, TWeTalk is widely used across a range of applications, including companion toys, robotics, smart wearables, live translation headphones, smart ordering systems, intelligent tours, and AI interviews. It enables natural conversations between users and smart devices with end-to-end latency below 800 ms. For example, Dahua's intelligent surveillance solutions leverage Tencent Cloud's technology to allow IPC (Internet Protocol Camera) devices with screens to make native WeChat calls. This ensures users receive WeChat notifications directly on their devices without the need to install additional applications, thus important calls are never missed.
TWeSee is dedicated to advanced visual solutions, offering video condensation, summarization, search, and object detection—powered by visual agents with perception, reasoning, and autonomous decision-making capabilities. With TWeSee, visual capture devices progress from merely "seeing" to truly "understanding", transforming into intelligent systems able to perform multimodal perception, multi-dimensional analysis, and scenario-based decision making. On the integration side, TWeSee provides plug-and-play AI visual analytics for audio and video equipment connected to the Tencent Cloud AIoT platform, while also offering seamless access for third-party cloud storage users. This greatly shortens the time-to-market for smart applications.
Powered by self-developed multimodal large models, TWeSee is specifically optimized for use cases such as IPCs, smart locks, and security surveillance, continuously enhancing video understanding across diverse scenarios. For instance, with IPC devices, TWeSee can automatically generate concise video highlights of important events and deliver real-time alerts. Additionally, TWeSee supports natural language search, enabling precise and intelligent indexing across large video datasets.
In pet monitoring scenarios, TWeSee can condense video footage of pets into engaging video highlights. For playback retrieval, users can search using text or images, and when paired with TWeTalk, voice search is also supported—for example, "retrieve footage of yesterday's delivery worker". In industrial settings, TWeSee delivers advanced visual analytics for applications such as supermarket foot traffic analysis, shelf stockout detection, theft identification, and monitoring employee behavior, providing valuable customer insights and timely theft alerts. With its robust and adaptable visual understanding capabilities, TWeSee quickly adapts to a wide range of customer needs and industry requirements, enabling smart devices to fully realize the vision of an intelligent and connected world.
From "General Intelligence" to "Scenario Intelligence": Global Implementation Redefines the AIoT Interaction Experience
With the rapid advancement of AIoT technology, the boundaries of hardware devices are constantly expanding, and industry focus has shifted from pure technical competition to ecosystem building. The ability to achieve real-world implementation across various application scenarios has become a key measure of enterprise competitiveness. Tencent Cloud's AIoT 2.0 product solutions are deeply integrated into industries and use cases worldwide, driving intelligent transformation across a diverse range of sectors.
With AIoT 2.0 product solutions, Tencent Cloud remains committed to advancing its technologies and solutions—delivering high-quality, efficient, and cost-effective intelligent services. This empowers industries around the world and helps customers unlock new opportunities for growth.
Jianing Lv, Product Head of Alpha Group's Intelligent Toy Division
With AI technology, we hope Weslie can evolve from a 2D cartoon character into a true smart companion—capable of making WeChat calls, singing, and genuinely accompanying children. This collaboration with Tencent Cloud and Yancheng AI Core Star Technology Co., Ltd, marks an exciting new journey into 'IP + AI'
Han Chengyan, Co-founder of Talking Tom's AI business
The localized deployment and language technology offered by Tencent Cloud's TWeTalk global solution were critical to our large-scale launch in the North American market. Our collaboration—from R&D to commercial rollout—was seamless. We are highly optimistic about our ongoing partnership and look forward to expanding the global AI companion market together
Wang Qiang, General Manager of Philips AVA China Region
The distinction between traditional and AI headphones has been eliminated. Through our strategic collaboration with Tencent Cloud, Philips has redefined the standards for effective communication and productivity—shifting from simple 'sound transmission' to truly 'intelligent communication'