![NVIDIA-AWS]()
As we gather for the NVIDIA GTC conference, organizations of all sizes find themselves at a critical juncture in their AI journeys. The pressing question is no longer whether to adopt generative AI but how to transition from promising pilot projects to production-ready systems that deliver tangible business value. Those who successfully navigate this transition will gain a significant competitive edge, as evidenced by compelling examples emerging from various sectors.
Infrastructure and Trust in AI Success
Achieving production-ready AI requires more than advanced models or powerful GPUs. Through my decade of experience in customer data journeys, I've learned that an organization’s most valuable asset is its domain-specific data and expertise. Customers consistently emphasize the need for trustworthy infrastructure and services that deliver performance, cost-efficiency, security, and flexibility—all at scale. AWS has successfully addressed these challenges for numerous customers, and our partnership with NVIDIA's accelerated computing platform enhances this capability.
Adobe's Rapid Integration of Generative AI
Content creation serves as one of the most visible applications of generative AI today. Adobe has swiftly integrated generative AI across its flagship products, enabling millions of creators to work in innovative ways. Their VP of Generative AI, Alexandru Costin, describes their infrastructure as an “AI superhighway,” allowing rapid iteration of models and seamless integration into creative applications.
Adobe utilizes NVIDIA GPU-accelerated Amazon EC2 instances for their training and inference workloads, demonstrating the importance of a robust infrastructure that supports high-performance storage and container orchestration.
Perplexity: Redefining Search Technology
Perplexity exemplifies the spirit of startups tackling ambitious challenges. By processing 340 million queries monthly and serving over 1,500 organizations, they are transforming search technology. Their innovative approach earned them membership in both AWS Activate and NVIDIA Inception programs, providing essential resources for scaling.
Perplexity leverages Amazon SageMaker HyperPod for distributed training and employs an optimized inference stack with NVIDIA TensorRT-LLM to achieve significant performance improvements.
ServiceNow: Transforming Enterprise Workflows
ServiceNow is rapidly integrating AI to reimagine core business processes at scale. Their innovative solutions focus on deep integration with technology workflows and CRM systems, using NVIDIA DGX Cloud on AWS for training generative AI models. This architecture allows ServiceNow to prioritize domain-specific AI development while minimizing infrastructure management.
Cisco's Webex Team: Methodical Transformation with Generative AI
Cisco’s Webex team showcases how large organizations can transform applications while maintaining standards for reliability. By separating their models from applications, they have improved development velocity and resource utilization significantly. This architectural change enables them to scale independently without compromising performance.
Hippocratic AI: Rigorous Engineering for Healthcare Solutions
Hippocratic AI's success in reaching 100,000 patients during a crisis highlights its commitment to safety and reliability in healthcare. Their "constellation architecture" consists of over 20 specialized models focused on various safety aspects, allowing them to manage substantial computational resources effectively.
A Partnership Driving Innovation
As we continue our partnership with NVIDIA, our collaboration evolves to meet the demands of the generative AI era. Together, we offer the industry’s widest range of NVIDIA accelerated computing solutions and software services for optimizing AI deployments. The stories shared at this conference illustrate how organizations are leveraging these capabilities to transform industries and solve meaningful problems.
The true promise of our partnership with NVIDIA lies in enabling innovators to create positive change at scale. As we look ahead, I am excited about the possibilities that await us and eager to see how our mutual customers will continue to innovate with these powerful tools.