Creating AI that matters

Oct 21, 2025 | AI

From the very inception of artificial intelligence, MIT and IBM played a pivotal role in its development. Both institutions were instrumental in establishing foundational research, creating some of the earliest programs that foreshadowed modern AI, and conceptualizing how machine intelligence might ultimately emerge.

Eight years on, collaborations such as the MIT-IBM Watson AI Lab continue to serve as crucial drivers for the evolution of future AI technology. This sustained expertise is paramount for industries and the global workforce, which are poised to reap significant benefits, particularly in the short term.

Forecasts predict a substantial global economic impact ranging from $3-4 trillion, alongside an impressive 80 percent productivity surge for knowledge workers and creative tasks. Moreover, generative AI is poised for rapid integration, with projections indicating its incorporation into 80 percent of business processes and 70 percent of software applications within the next three years.

While the industrial sector has recently witnessed a significant surge in prominent AI models, the fundamental engine of innovation largely remains within academia, consistently producing the most highly-cited research.

A prime example of this academic leadership is the MIT-IBM Watson AI Lab, which demonstrates a remarkable portfolio of achievements. The lab’s success is quantified by an impressive 54 patent disclosures, an accumulation of over 128,000 citations with a robust h-index of 162, and the practical application of more than 50 industry-driven use cases.

Among the lab’s many notable contributions are advancements that enhance medical procedures, such as utilizing AI imaging techniques for improved stent placement. Researchers have also focused on optimizing computational resources, successfully slashing overhead and developing more compact models that maintain high performance. Additionally, their work extends to fundamental scientific understanding, exemplified by their innovative modeling of interatomic potential for silicate chemistry.

Aude Oliva, who serves as both MIT lab director and director of strategic industry engagement at the MIT Schwarzman College of Computing, asserts the lab’s unique prowess in identifying crucial problems, a distinction that sets it apart from other entities. She further highlights that the practical experience students gain from tackling these enterprise AI challenges significantly enhances their competitiveness in the job market and drives the promotion of a robust, competitive industry.

Anantha Chandrakasan, MIT’s Provost and Vannevar Bush Professor of Electrical Engineering and Computer Science, who also co-chairs the MIT-IBM Watson AI Lab, has highlighted the lab’s profound influence. He attributes its “tremendous impact” to the extensive collaborations forged between IBM and MIT’s researchers and students. Chandrakasan emphasized that by championing interdisciplinary research at the nexus of artificial intelligence and diverse other fields, the lab is not only advancing foundational knowledge but also expediting the creation of transformative solutions beneficial to both the nation and the global community.

**Long-horizon work** refers to strategic initiatives and projects designed to yield their primary impact or full benefits over an extended future timeframe, rather than delivering immediate results. These endeavors typically demand sustained effort and patience, with their ultimate value unfolding years or even decades down the line.

Despite the burgeoning enthusiasm for artificial intelligence, many organizations are grappling with how to effectively translate this advanced technology into tangible, impactful results. A recent 2024 study by Gartner starkly illustrates this challenge, forecasting that a considerable 30% of generative AI projects will be abandoned post-proof of concept by the end of 2025. This projection, while indicative of widespread ambition and a strong desire to embrace AI, simultaneously highlights a significant deficit in the practical expertise needed to both develop and implement these solutions to deliver immediate, measurable value.

The laboratory distinguishes itself by effectively bridging the gap between advanced research and practical deployment. A substantial portion of its current research initiatives is dedicated to developing new features, capabilities, and products for IBM, its corporate members, and a diverse range of real-world applications. These include advanced large language models, AI hardware, and specialized foundation models across multi-modal, bio-medical, and geo-spatial domains.

Inquiry-driven students and interns are indispensable to this mission, contributing invaluable enthusiasm and fresh perspectives. Their involvement not only facilitates their acquisition of specialized domain knowledge but also directly aids in engineering field advancements and charting new frontiers for AI-driven exploration.

The AAAI 2025 Presidential panel on the Future of AI Research has underscored the critical importance of robust collaborations between academic institutions and the private sector. Its findings strongly advocate for joint ventures, such as the AI arena lab, emphasizing their indispensable role in shaping AI’s trajectory.

The panel highlighted that academics are uniquely positioned to offer independent analysis and interpretation of AI advancements originating from industry, alongside their broader societal consequences. This distinction is paramount, as the private sector typically prioritizes short-term objectives—a stark contrast to the longer-term vision cultivated by universities and society, which focuses on sustained development and enduring impact.

The convergence of diverse expertise, coupled with a resolute drive towards open sourcing and open science, promises to ignite a spark of innovation unattainable through solitary endeavors. History unequivocally demonstrates that embracing these principles—particularly the public sharing of code and accessible research—delivers profound, long-term benefits for both industry and society at large.

Embodying these very tenets and aligned with the core missions of IBM and MIT, the collaborative lab actively enriches the public sphere. It contributes crucial technologies, groundbreaking findings, robust governance models, and essential standards, thereby enhancing transparency, accelerating reproducibility, and ensuring the trustworthiness of future advancements.

The laboratory was established through a strategic partnership, bringing together MIT’s extensive research expertise and IBM’s substantial industrial R&D capabilities. This collaboration aims to achieve significant breakthroughs in foundational artificial intelligence methods and hardware. Furthermore, it seeks to develop innovative applications across vital sectors, including healthcare, chemistry, finance, cybersecurity, and advanced business planning and decision-making.

Superiority is not always a matter of size; larger dimensions do not automatically guarantee enhanced quality or performance.

The artificial intelligence landscape is rapidly shifting, as the industry moves away from large foundation models toward smaller, more specialized designs that deliver enhanced performance. This evolution is significantly bolstered by the contributions of researchers like Song Han, an associate professor in MIT’s Department of Electrical Engineering and Computer Science (EECS), and IBM Research’s Chuang Gan.

Their innovative work, which includes methodologies such as “once-for-all” and “AWQ,” is pivotal in boosting efficiency. These advancements are achieved through optimized architectures, refined algorithm shrinking techniques, and sophisticated activation-aware weight quantization. As a result, advanced AI applications, particularly in areas like language processing, can now operate on edge devices with notably faster speeds and reduced latency.

Advancements across foundation, vision, multimodal, and large language models have opened new avenues for research, empowering teams led by Oliva, MIT EECS Associate Professor Yoon Kim, and IBM Research members Rameswar Panda, Yang Zhang, and Rogerio Feris. Their collaborative efforts have yielded significant progress, including the development of novel techniques to integrate external knowledge into AI models. Additionally, the groups have pioneered linear attention transformer methods, which deliver superior processing throughput compared to existing state-of-the-art systems.

The understanding and reasoning capabilities of vision and multimodal AI systems have recently experienced a significant surge. This progress is notably demonstrated by projects such as “Task2Sim,” which highlights how pre-training vision models on synthetic data can lead to enhanced performance. Further exemplifying this trend, “AdaFuse” reveals how video action recognition can be substantially boosted through the intelligent fusion of channels from both past and current feature maps.

**MIT and IBM Researchers Unveil Breakthroughs for ‘Leaner AI’**

**CAMBRIDGE, MA/ARMONK, NY** – Pioneering new strategies for more resource-efficient artificial intelligence, research teams from MIT and IBM have successfully demonstrated that AI models can achieve both high adaptability and data efficiency simultaneously. This crucial development, led by MIT’s Gregory Wornell alongside IBM Research’s Chuang Gan and David Cox, marks a significant step towards developing “leaner AI” that thrives on limited computational power and data.

The breakthrough is driven by two innovative approaches: EvoScale and Chain-of-Action-Thought (COAT) reasoning. These cutting-edge techniques empower language models to substantially improve their output quality by iteratively refining initial generation attempts. This structured, self-correcting process allows AI systems to make the most of restricted data and computational resources, consistently narrowing in on superior responses.

Specifically, COAT reasoning employs a meta-action framework combined with reinforcement learning, enabling AI to tackle complex, reasoning-intensive tasks through continuous self-correction. In parallel, EvoScale applies a similar evolutionary philosophy to code generation, developing and iteratively enhancing high-quality candidate solutions.

Together, these advancements are poised to facilitate the deployment of AI systems that are not only mindful of resource consumption but also precisely targeted and effective for real-world applications.

Cox has underscored the “critical” impact of MIT-IBM research on their large language model development efforts. He specifically highlighted a significant trend: smaller, highly specialized models and tools are delivering “outsized impact,” especially when integrated. These advancements, stemming from the MIT-IBM Watson AI Lab, are instrumental in defining the company’s technical directions and influencing its market strategy, particularly through platforms like watsonx.

IBM’s Granite Vision, a cutting-edge computer vision solution, has significantly benefited from extensive laboratory research, which has equipped it with powerful features and capabilities for sophisticated document understanding. Despite its compact design, Granite Vision delivers impressive performance, addressing a critical and growing enterprise demand. This technology is particularly vital in an era where businesses urgently require reliable methods for extracting, interpreting, and accurately summarizing information and data from voluminous, long-format documents.

The 2025 AAAI panel has concluded that achievements extending beyond direct artificial intelligence research and encompassing various disciplines are not merely advantageous, but fundamentally indispensable. Such interdisciplinary contributions are deemed crucial for both propelling AI technology forward and fostering broader societal advancement.

Caroline Uhler and Devavrat Shah, both Andrew (1956) and Erna Viterbi Professors in EECS and the Institute for Data, Systems, and Society (IDSS), are spearheading interdisciplinary research alongside Kristjan Greenewald from IBM Research. Their collaborative efforts focus on advanced causal discovery methods, designed to illuminate precisely how various interventions influence outcomes and, critically, to pinpoint which actions yield specific desired results.

A key component of this research is the development of a robust framework capable of predicting the effects of “treatments” or interventions on distinct sub-populations. This innovative approach could model scenarios ranging from consumer behavior on an e-commerce platform to the health implications of public health measures like mobility restrictions. The potential applications of these groundbreaking findings are far-reaching, promising to revolutionize diverse sectors from marketing and medicine to education and risk management.

The accelerating advancements in artificial intelligence and computing are fundamentally reshaping how challenges are conceptualized and addressed across nearly every discipline. This pervasive influence is keenly recognized at the MIT-IBM Watson AI Lab, where researchers adopt a multidisciplinary approach. They delve into complex issues from various perspectives, integrating practical, real-world problems sourced directly from industry to forge groundbreaking solutions, according to Dan Huttenlocher. Huttenlocher serves as the MIT lab co-chair, dean of the MIT Schwarzman College of Computing, and the Henry Ellis Warren (1894) Professor of Electrical Engineering and Computer Science.

Student talent is a cornerstone of this thriving research ecosystem, significantly enhanced by the contributions of participants in MIT’s Undergraduate Research Opportunities Program (UROP), the MIT EECS 6A Program, and the new MIT-IBM Watson AI Lab Internship Program.

Collectively, more than 70 young researchers have benefited from these initiatives, rapidly accelerating their technical skill development. Under the expert guidance and support of the lab’s mentors, these students have acquired specialized knowledge in artificial intelligence domains, successfully transitioning into emerging AI practitioners. This proven success underscores the lab’s continuous commitment to identifying and nurturing promising students at every stage of their exploration into AI’s transformative potential.

Sriram Raghavan, IBM Research Vice President for AI and IBM chair of the lab, asserts that unlocking the full economic and societal potential of artificial intelligence requires fostering “useful and efficient intelligence.” To translate AI’s considerable promise into tangible progress, Raghavan emphasizes the imperative of continuous innovation. This necessitates developing highly efficient, optimized, and purpose-built AI models that are easily adaptable to specific domains and varied use cases. He further notes that crucial breakthroughs enabling this vision are significantly advanced through academic-industry collaborations, citing initiatives like the MIT-IBM Watson AI Lab as vital drivers.

Related Articles