AI INFO FORGE
Model Releases arXiv cs.AI Jun 5, 2026

Interfaze: The Future of AI is built on Task-Specific Small Models

Interfaze is a hybrid AI model that combines task-specific neural networks (for OCR, object detection, speech recognition) with a transformer decoder through a shared embedding space. The architecture achieves competitive or superior performance on specialized benchmarks compared to larger generalist models while operating at lower computational cost by activating only relevant parameters per task.

Read on arXiv cs.AI →
Model Releases arXiv cs.AI Jun 5, 2026

SAM 3D: 3Dfy Anything in Images

SAM 3D is a generative model that reconstructs 3D objects from single images by predicting geometry, texture, and layout. The researchers created a large-scale annotated dataset through human-model collaboration and trained the system using synthetic pretraining with real-world alignment, achieving substantial improvements over existing methods on natural, cluttered scenes.

Read on arXiv cs.AI →
Model Releases AWS Machine Learning Jun 3, 2026

Fundamental’s Large Tabular Model NEXUS is now available on Amazon SageMaker JumpStart

Fundamental's NEXUS, a foundation model designed specifically for tabular data prediction, is now available on Amazon SageMaker JumpStart. The model leverages pre-training on billions of structured datasets to enable enterprises to generate accurate predictions from their data without extensive feature engineering, reducing deployment timelines from months to days.

Read on AWS Machine Learning →
Model Releases The Verge AI Jun 3, 2026

As AI gets better, it reveals an empty promise

Google's new Gemini AI agent Spark demonstrates advanced capabilities by accessing personal information about users without explicit disclosure, raising concerns about privacy and the broader focus on productivity gains rather than addressing substantive societal challenges.

Read on The Verge AI →
Model Releases OpenAI Jun 3, 2026

Introducing new capabilities to GPT-Rosalind

OpenAI has introduced GPT-Rosalind, a specialized AI model designed to support life sciences research through improved biological reasoning, drug chemistry analysis, genetic data interpretation, and automation of lab procedures.

Read on OpenAI →
Model Releases arXiv cs.AI Jun 3, 2026

Cosmos 3: Omnimodal World Models for Physical AI

NVIDIA introduced Cosmos 3, a multimodal world model that processes and generates language, images, video, audio, and actions in a single unified framework. The system achieved top performance on text-to-image, image-to-video, and robotic control benchmarks, with code and models released openly to support physical AI research.

Read on arXiv cs.AI →
Model Releases arXiv cs.AI Jun 3, 2026

Pretraining Language Models on Historical Text

Researchers introduced TypewriterLM, a 7.24B parameter language model trained exclusively on pre-1913 English text, along with TypewriterCorpus, a 54B-token historical dataset. The work addresses challenges specific to historical language modeling through specialized data cleaning, leakage prevention, and evaluation benchmarks designed to assess temporal consistency.

Read on arXiv cs.AI →
Model Releases The Verge AI Jun 2, 2026

Microsoft’s first advanced reasoning AI is here

Microsoft announced MAI-Thinking-1, a new advanced reasoning model developed in-house, at Build 2026. The medium-sized model performs comparably to leading models on software engineering benchmarks and was trained on clean data without relying on third-party model distillation, marking Microsoft's continued expansion of its own AI model capabilities.

Read on The Verge AI →
Model Releases The Verge AI Jun 2, 2026

Microsoft Scout is a new AI personal assistant built on OpenClaw

Microsoft launched Scout, a new AI personal assistant built on OpenClaw that integrates with Microsoft 365 applications including Outlook, Teams, and OneDrive. Unlike existing Copilot features, Scout functions as a dedicated assistant capable of handling calendar management, expense reporting, email composition, and other workplace tasks.

Read on The Verge AI →
Model Releases The Verge AI Jun 2, 2026

Microsoft’s Project Solara is an OS for AI agent gadgets

Microsoft unveiled Project Solara, a new operating system designed for AI agent-powered devices, built on Android rather than Windows. The company demonstrated two concept devices at Build 2026: a desk-mounted display with facial recognition and a wearable badge with camera and fingerprint scanner.

Read on The Verge AI →
Model Releases arXiv cs.AI Jun 2, 2026

Ryze: Evidence-Enriched Data Synthesis from Biomedical Papers

Researchers developed Ryze, an automated system that converts biomedical papers into evidence-enriched training data for specialized vision-language models. The resulting BioVLM-8B model significantly outperforms general-purpose models on biomedical benchmarks by preserving evidence connections across document elements, and both the system and model are being released open source.

Read on arXiv cs.AI →
Model Releases The Verge AI Jun 1, 2026

This could be Windows’ M1 moment — but expect it to cost a ton

Nvidia is entering the consumer laptop chip market with RTX Spark, positioning itself to deliver the performance breakthrough that Windows laptops have lacked compared to Apple's Arm-based chips. The move could replicate Apple's M1 success in the Windows ecosystem, though the technology is expected to come at a premium price point.

Read on The Verge AI →
Model Releases arXiv cs.AI Jun 1, 2026

AMix-2: Establishing Protein as a Native Modality in Large Language Models

Researchers introduced AMix-2, a foundation model that treats protein sequences as a native modality alongside text in large language models, enabling unified protein understanding and design tasks. The model uses block-wise diffusion language modeling and was evaluated on ProteinArena, a new benchmark with realistic generalization protocols, where it outperformed existing LLMs and matched specialized protein models.

Read on arXiv cs.AI →
Model Releases arXiv cs.AI May 29, 2026

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Researchers introduced AgentDoG 1.5, a lightweight safety alignment framework for AI agents that addresses emerging risks from autonomous systems. The framework includes compact models (0.8B-8B parameters) trained on minimal data, an efficient deployment system reducing computational overhead, and real-time safety moderation capabilities for agent operations.

Read on arXiv cs.AI →
Model Releases AWS Machine Learning May 28, 2026

Claude Opus 4.8 is now available on AWS

Anthropic's Claude Opus 4.8 model is now available through Amazon Bedrock and Claude Platform on AWS. The update offers improved performance for coding, autonomous tasks, and long-running workflows while maintaining enterprise security and data residency options.

Read on AWS Machine Learning →
Model Releases arXiv cs.AI May 28, 2026

Soro: A Lightweight Foundation Model and Chatbot for Tajik

Researchers developed Soro, a lightweight language model optimized for Tajik language use cases, building on Gemma 3 and trained on 1.9 billion Tajik tokens. The model includes new Tajik benchmarks for evaluation and supports quantized versions for deployment in resource-constrained environments, with pilots planned for Tajikistan's education sector.

Read on arXiv cs.AI →
Model Releases arXiv cs.AI May 28, 2026

Laguna M.1/XS.2 Technical Report

Researchers introduced Laguna M.1 and Laguna XS.2, two mixture-of-experts models designed for coding tasks with 225.8B and 33.4B parameters respectively. Both models were developed using an integrated system called Model Factory and demonstrate competitive performance on software engineering benchmarks, with the smaller XS.2 model released as open source.

Read on arXiv cs.AI →
Model Releases arXiv cs.AI May 26, 2026

JT-SAFE-V2: Safety-by-Design Foundation Model with World-Context Data

Researchers introduced JT-Safe-V2, a language model emphasizing safety through design improvements including contextual pre-training data and enhanced post-training mechanisms. The team also proposed Safe-MoMA, a framework using multiple models to reduce inference costs by over 30% while maintaining performance, and released the 35B parameter model publicly.

Read on arXiv cs.AI →
Model Releases AWS Machine Learning May 21, 2026

Amazon Nova Act is now HIPAA eligible

Amazon Nova Act, AWS's browser-based AI agent service, now qualifies as HIPAA-eligible, enabling healthcare organizations to automate compliance-sensitive workflows involving protected health information. This certification removes regulatory barriers for deploying autonomous agents in healthcare settings for tasks like claims processing and referral coordination.

Read on AWS Machine Learning →
Model Releases Google AI Blog May 19, 2026

Gemini 3.5: frontier intelligence with action

Google announced Gemini 3.5, its latest AI model series designed to integrate advanced reasoning capabilities with the ability to take actions. The release occurred at Google I/O and represents the company's effort to advance beyond purely analytical AI.

Read on Google AI Blog →
Model Releases Hugging Face May 19, 2026

Introducing the Ettin Reranker Family

Hugging Face introduced the Ettin Reranker Family, a collection of reranking models designed to improve search and retrieval quality by rescoring documents or passages. These models help refine results from initial retrieval systems to provide more relevant outputs.

Read on Hugging Face →
Model Releases Google DeepMind May 17, 2026

Introducing Gemini Omni

Google DeepMind introduced Gemini Omni, a new AI model designed to process and understand multiple types of content including text, audio, and video in an integrated manner. This multimodal capability represents an advancement in the company's AI offerings.

Read on Google DeepMind →
Model Releases Hugging Face May 14, 2026

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

IBM released Granite Embedding Multilingual R2, an open-source embedding model under Apache 2.0 license that supports multiple languages and handles 32K token context windows. The model achieves competitive retrieval performance while maintaining under 100 million parameters, making it suitable for efficient multilingual semantic search applications.

Read on Hugging Face →
Model Releases OpenAI May 7, 2026

Advancing voice intelligence with new models in the API

OpenAI has introduced new voice models for its API that support reasoning, translation, and transcription capabilities, enabling more sophisticated voice-based interactions. These models aim to provide more natural and intelligent conversational experiences.

Read on OpenAI →
Model Releases Hugging Face May 6, 2026

vLLM V0 to V1: Correctness Before Corrections in RL

vLLM released a major update transitioning from v0 to v1, emphasizing correctness validation before applying corrections in reinforcement learning workflows. The upgrade focuses on improving the reliability and accuracy of the inference engine's handling of RL-based model refinement.

Read on Hugging Face →
Model Releases OpenAI May 5, 2026

GPT-5.5 Instant System Card

OpenAI released a system card for GPT-5.5 Instant, providing technical documentation and safety assessment details for this model variant. The document outlines the model's capabilities, limitations, and recommended usage guidelines.

Read on OpenAI →
Model Releases Hugging Face Apr 29, 2026

Granite 4.1 LLMs: How They’re Built

IBM's Granite 4.1 large language models have been released, with documentation detailing their architecture and training methodology. The models represent an update to IBM's open-source LLM line designed for enterprise applications.

Read on Hugging Face →
Model Releases OpenAI Apr 23, 2026

Introducing GPT-5.5

OpenAI released GPT-5.5, a new model positioned as their most advanced to date, offering improved performance and speed for specialized applications including software development, scientific research, and analytical work.

Read on OpenAI →
Model Releases OpenAI Apr 23, 2026

GPT-5.5 System Card

OpenAI released a system card for GPT-5.5, a document detailing the model's capabilities, limitations, and safety considerations. The system card provides technical specifications and responsible use guidelines for developers and researchers.

Read on OpenAI →
Model Releases OpenAI Apr 21, 2026

Introducing ChatGPT Images 2.0

OpenAI released ChatGPT Images 2.0, an upgraded image generation model featuring better text rendering within images, support for multiple languages, and enhanced visual understanding capabilities.

Read on OpenAI →
Model Releases OpenAI Apr 14, 2026

Trusted access for the next era of cyber defense

OpenAI has expanded its Trusted Access for Cyber program by introducing GPT-5.4-Cyber, a specialized AI model made available to approved cybersecurity professionals. The expansion aims to provide verified defenders with advanced AI capabilities while implementing enhanced safety measures as AI cybersecurity tools become more powerful.

Read on OpenAI →
Model Releases Hugging Face Apr 1, 2026

Falcon Perception

Falcon Perception appears to be a new model or capability release, though limited details are available. Without the full excerpt, the specific features, applications, or significance cannot be accurately summarized.

Read on Hugging Face →
Model Releases Google DeepMind Mar 25, 2026

Lyria 3 Pro: Create longer tracks in more

Google DeepMind released Lyria 3 Pro, an upgraded music generation model capable of creating longer tracks with improved structural understanding. The company is expanding Lyria's availability across additional Google products and platforms.

Read on Google DeepMind →
Model Releases OpenAI Mar 23, 2026

Creating with Sora Safely

OpenAI has designed Sora 2 and its associated app with built-in safety measures to address risks from advanced video generation and social sharing capabilities. The company emphasizes concrete protections as foundational to the platform's architecture.

Read on OpenAI →
Model Releases Hugging Face Mar 17, 2026

Holotron-12B - High Throughput Computer Use Agent

Holotron-12B is a new computer use agent model with 12 billion parameters designed for high-throughput automated interaction with computer interfaces. The model enables efficient processing of tasks that require understanding and acting on visual desktop environments.

Read on Hugging Face →
Model Releases OpenAI Mar 17, 2026

Introducing GPT-5.4 mini and nano

OpenAI released two smaller variants of GPT-5.4 designed for efficiency and speed. The mini and nano versions target coding tasks, tool integration, multimodal reasoning, and high-volume API applications.

Read on OpenAI →
Model Releases Hugging Face Mar 9, 2026

LeRobot v0.5.0: Scaling Every Dimension

Hugging Face released LeRobot v0.5.0, an update focused on expanding capabilities across multiple dimensions of the robotic learning platform. The release aims to improve scalability and functionality for robotics applications.

Read on Hugging Face →
Model Releases OpenAI Mar 5, 2026

GPT-5.4 Thinking System Card

OpenAI released a system card for GPT-5.4 Thinking, documenting the model's capabilities, limitations, and safety considerations. The document provides technical details about the system's reasoning processes and performance benchmarks.

Read on OpenAI →
Model Releases OpenAI Mar 5, 2026

Introducing GPT-5.4

OpenAI released GPT-5.4, a new frontier model designed for professional applications. The model features improved capabilities in coding, computer use, and tool integration, along with support for 1 million token context windows.

Read on OpenAI →
Model Releases OpenAI Mar 3, 2026

GPT-5.3 Instant System Card

OpenAI released a system card for GPT-5.3 Instant, documenting the model's capabilities, limitations, and safety considerations. The documentation provides transparency regarding the model's performance characteristics and potential risks.

Read on OpenAI →
Model Releases OpenAI Feb 13, 2026

GPT-5.2 derives a new result in theoretical physics

GPT-5.2 generated a previously unknown formula for gluon amplitudes in theoretical physics, which researchers subsequently proved and confirmed. This demonstrates the model's capability to contribute to scientific discovery in advanced physics domains.

Read on OpenAI →
Model Releases OpenAI Feb 12, 2026

Introducing GPT-5.3-Codex-Spark

OpenAI released GPT-5.3-Codex-Spark, a new coding model optimized for real-time performance with 15 times faster code generation and 128k token context window. The model is available as a research preview for ChatGPT Pro subscribers.

Read on OpenAI →
Model Releases OpenAI Feb 5, 2026

Introducing GPT-5.3-Codex

OpenAI released GPT-5.3-Codex, an agent designed specifically for coding tasks that combines advanced code generation capabilities with broad reasoning abilities for complex technical projects over extended workflows.

Read on OpenAI →
Model Releases OpenAI Feb 5, 2026

GPT-5.3-Codex System Card

OpenAI released GPT-5.3-Codex, a coding model that integrates advanced coding abilities from GPT-5.2-Codex with reasoning and professional knowledge capabilities. The company positions it as the most capable agentic coding model available.

Read on OpenAI →
Model Releases Hugging Face Jan 20, 2026

Differential Transformer V2

Hugging Face released Differential Transformer V2, an updated version of their differential transformer architecture. The advancement likely represents improvements to the model's efficiency or performance characteristics, though specific enhancements are not detailed.

Read on Hugging Face →
Model Releases VentureBeat AI Jan 13, 2026

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Salesforce launched a rebuilt Slackbot AI agent that transforms the tool from a basic notification system into a fully functional agent capable of searching enterprise data, drafting documents, and executing tasks. The new version, available to Business+ and Enterprise+ customers, represents Salesforce's major push into agentic AI as it competes with Microsoft and Google in workplace AI solutions.

Read on VentureBeat AI →