
ai company deepseek recently announced that it is conducting a grayscale beta test of its brand-new “image recognition mode.” this mode will sit alongside the existing “quick mode” and “expert mode,” but its capabilities go far beyond simple ocr text recognition—it boasts more sophisticated multimodal recognition functionality. this means users can upload images and have deepseek perform in-depth analysis and generate intelligent descriptions of the image content, greatly enhancing the convenience of processing image information.
according to feedback from users participating in the grayscale beta test, the current “image recognition mode” responds extremely quickly; one user even described it as lightning-fast. however, some users have encountered a system prompt stating, “image recognition mode is currently unavailable. please try again later,” indicating that the mode is still undergoing continuous refinement and optimization and has not yet reached full stability.
for the vast majority of users, deepseek’s newly launched image recognition feature means they will enjoy a more efficient and intelligent interactive experience when handling various types of image data, such as photos, screenshots, and charts. the enhancement of multimodal recognition capabilities not only further strengthens deepseek’s technological competitiveness in the ai field but also enables users to obtain the information they need more intuitively in their daily work, studies, and personal life. as the grayscale beta test progresses, the feature is expected to be rolled out to more users once it has been fully refined, becoming another practical tool within the deepseek ecosystem.