Accessibility Statement Skip Navigation
  • Why PRWeb
  • How It Works
  • Who Uses It
  • Pricing
  • Login
  • GDPR
  • Create a Free Account
Return to PRWeb homepage
  • News
  • Resources
  • Contact
When typing in this field, a list of search results will appear and be automatically updated as you type.

Searching for your content...

No results found. Please change your search terms and try again.
  • News in Focus
      • Browse News Releases

      • All News Releases
      • Multimedia Gallery

      • All Multimedia
      • All Photos
      • All Videos
  • Business & Money
      • Auto & Transportation

      • Aerospace, Defense
      • Air Freight
      • Airlines & Aviation
      • Automotive
      • Maritime & Shipbuilding
      • Railroads and Intermodal Transportation
      • Supply Chain/Logistics
      • Transportation, Trucking & Railroad
      • Travel
      • Trucking and Road Transportation
      • View All Auto & Transportation

      • Business Technology

      • Blockchain
      • Broadcast Tech
      • Computer & Electronics
      • Computer Hardware
      • Computer Software
      • Data Analytics
      • Electronic Commerce
      • Electronic Components
      • Electronic Design Automation
      • Financial Technology
      • High Tech Security
      • Internet Technology
      • Nanotechnology
      • Networks
      • Peripherals
      • Semiconductors
      • View All Business Technology

      • Entertain­ment & Media

      • Advertising
      • Art
      • Books
      • Entertainment
      • Film and Motion Picture
      • Magazines
      • Music
      • Publishing & Information Services
      • Radio & Podcast
      • Television
      • View All Entertain­ment & Media

      • Financial Services & Investing

      • Accounting News & Issues
      • Acquisitions, Mergers and Takeovers
      • Banking & Financial Services
      • Bankruptcy
      • Bond & Stock Ratings
      • Conference Call Announcements
      • Contracts
      • Cryptocurrency
      • Dividends
      • Earnings
      • Earnings Forecasts & Projections
      • Financing Agreements
      • Insurance
      • Investments Opinions
      • Joint Ventures
      • Mutual Funds
      • Private Placement
      • Real Estate
      • Restructuring & Recapitalization
      • Sales Reports
      • Shareholder Activism
      • Shareholder Meetings
      • Stock Offering
      • Stock Split
      • Venture Capital
      • View All Financial Services & Investing

      • General Business

      • Awards
      • Commercial Real Estate
      • Corporate Expansion
      • Earnings
      • Environmental, Social and Governance (ESG)
      • Human Resource & Workforce Management
      • Licensing
      • New Products & Services
      • Obituaries
      • Outsourcing Businesses
      • Overseas Real Estate (non-US)
      • Personnel Announcements
      • Real Estate Transactions
      • Residential Real Estate
      • Small Business Services
      • Socially Responsible Investing
      • Surveys, Polls and Research
      • Trade Show News
      • View All General Business

  • Science & Tech
      • Consumer Technology

      • Artificial Intelligence
      • Blockchain
      • Cloud Computing/Internet of Things
      • Computer Electronics
      • Computer Hardware
      • Computer Software
      • Consumer Electronics
      • Cryptocurrency
      • Data Analytics
      • Electronic Commerce
      • Electronic Gaming
      • Financial Technology
      • Mobile Entertainment
      • Multimedia & Internet
      • Peripherals
      • Social Media
      • STEM (Science, Tech, Engineering, Math)
      • Supply Chain/Logistics
      • Wireless Communications
      • View All Consumer Technology

      • Energy & Natural Resources

      • Alternative Energies
      • Chemical
      • Electrical Utilities
      • Gas
      • General Manufacturing
      • Mining
      • Mining & Metals
      • Oil & Energy
      • Oil and Gas Discoveries
      • Utilities
      • Water Utilities
      • View All Energy & Natural Resources

      • Environ­ment

      • Conservation & Recycling
      • Environmental Issues
      • Environmental Policy
      • Environmental Products & Services
      • Green Technology
      • Natural Disasters
      • View All Environ­ment

      • Heavy Industry & Manufacturing

      • Aerospace & Defense
      • Agriculture
      • Chemical
      • Construction & Building
      • General Manufacturing
      • HVAC (Heating, Ventilation and Air-Conditioning)
      • Machinery
      • Machine Tools, Metalworking and Metallurgy
      • Mining
      • Mining & Metals
      • Paper, Forest Products & Containers
      • Precious Metals
      • Textiles
      • Tobacco
      • View All Heavy Industry & Manufacturing

      • Telecomm­unications

      • Carriers and Services
      • Mobile Entertainment
      • Networks
      • Peripherals
      • Telecommunications Equipment
      • Telecommunications Industry
      • VoIP (Voice over Internet Protocol)
      • Wireless Communications
      • View All Telecomm­unications

  • Lifestyle & Health
      • Consumer Products & Retail

      • Animals & Pets
      • Beers, Wines and Spirits
      • Beverages
      • Bridal Services
      • Cannabis
      • Cosmetics and Personal Care
      • Fashion
      • Food & Beverages
      • Furniture and Furnishings
      • Home Improvement
      • Household, Consumer & Cosmetics
      • Household Products
      • Jewelry
      • Non-Alcoholic Beverages
      • Office Products
      • Organic Food
      • Product Recalls
      • Restaurants
      • Retail
      • Supermarkets
      • Toys
      • View All Consumer Products & Retail

      • Entertain­ment & Media

      • Advertising
      • Art
      • Books
      • Entertainment
      • Film and Motion Picture
      • Magazines
      • Music
      • Publishing & Information Services
      • Radio & Podcast
      • Television
      • View All Entertain­ment & Media

      • Health

      • Biometrics
      • Biotechnology
      • Clinical Trials & Medical Discoveries
      • Dentistry
      • FDA Approval
      • Fitness/Wellness
      • Health Care & Hospitals
      • Health Insurance
      • Infection Control
      • International Medical Approval
      • Medical Equipment
      • Medical Pharmaceuticals
      • Mental Health
      • Pharmaceuticals
      • Supplementary Medicine
      • View All Health

      • Sports

      • General Sports
      • Outdoors, Camping & Hiking
      • Sporting Events
      • Sports Equipment & Accessories
      • View All Sports

      • Travel

      • Amusement Parks and Tourist Attractions
      • Gambling & Casinos
      • Hotels and Resorts
      • Leisure & Tourism
      • Outdoors, Camping & Hiking
      • Passenger Aviation
      • Travel Industry
      • View All Travel

  • Policy & Public Interest
      • Policy & Public Interest

      • Advocacy Group Opinion
      • Animal Welfare
      • Congressional & Presidential Campaigns
      • Corporate Social Responsibility
      • Domestic Policy
      • Economic News, Trends, Analysis
      • Education
      • Environmental
      • European Government
      • FDA Approval
      • Federal and State Legislation
      • Federal Executive Branch & Agency
      • Foreign Policy & International Affairs
      • Homeland Security
      • Labor & Union
      • Legal Issues
      • Natural Disasters
      • Not For Profit
      • Patent Law
      • Public Safety
      • Trade Policy
      • U.S. State Policy
      • View All Policy & Public Interest

  • People & Culture
      • People & Culture

      • Aboriginal, First Nations & Native American
      • African American
      • Asian American
      • Children
      • Diversity, Equity & Inclusion
      • Hispanic
      • Lesbian, Gay & Bisexual
      • Men's Interest
      • People with Disabilities
      • Religion
      • Senior Citizens
      • Veterans
      • Women
      • View All People & Culture

  • Hamburger menu
  • Cision PRWeb provides efficient communication tools to continuously engage with target audiences across multiple online channels
  • Create a Free Account
    • ALL CONTACT INFO
    • Contact Us


      11AM ET Sunday – 8PM ET Friday

  • Send a Release
  • Sign up
  • Log in
  • Resources
  • RSS
  • GDPR
  • News in Focus
    • Browse All News
    • Multimedia Gallery
  • Business & Money
    • Auto & Transportation
    • Business Technology
    • Entertain­ment & Media
    • Financial Services & Investing
    • General Business
  • Science & Tech
    • Consumer Technology
    • Energy & Natural Resources
    • Environ­ment
    • Heavy Industry & Manufacturing
    • Telecomm­unications
  • Lifestyle & Health
    • Consumer Products & Retail
    • Entertain­ment & Media
    • Health
    • Sports
    • Travel
  • Policy & Public Interest
  • People & Culture
    • People & Culture
  • Send a Release
  • Sign up
  • Log in
  • Resources
  • RSS
  • GDPR
  • Send a Release
  • Sign up
  • Log in
  • Resources
  • RSS
  • GDPR
  • Send a Release
  • Sign up
  • Log in
  • Resources
  • RSS
  • GDPR

Avesha Announces Breakthrough AI Scaling with Smart Scaler, Delivering Up to 3x Performance Gains


News provided by

Avesha, Inc

Mar 19, 2025, 13:46 ET

Share this article

Share toX

Share this article

Share toX


Avesha's Smart Scaler introduces a Reinforcement Learning-based intelligent scaling solution for AI workloads, delivering unprecedented performance gains and cost efficiencies.

BOSTON and SAN FRANCISCO, March 19, 2025 /PRNewswire-PRWeb/ -- Avesha's Smart Scaler, part of its Elastic AI Services Suite, for Inference Endpoint scaling and GPU/CPU resource optimization, delivering up to 3x performance gains and reducing inference latency by 75%.

Avesha, a Gartner Cool Vendor and a leader in AI-driven GPU/CPU orchestration, today announced groundbreaking results from its latest benchmarking of Smart Scaler, which dynamically scales GPU resources in proportion to traffic, delivering up to 3x improvement in processing efficiency, 85% larger batch sizes, and 70% higher token throughput per batch for the llambda3-8B model on the Huggingface/TGI framework.. In addition, Smart Scaler demonstrated 2x improvement for the same model on the VLLM framework over TGI and a further 1.5x boost coming from Smart Scaler alone. This enables enterprises to scale AI workloads seamlessly across multiple clusters and cloud environments without overprovisioning or wasted compute.

"With Avesha's Elastic AI Services we're able to optimize our GPU workloads dynamically, ensuring we maximize performance without overpaying for underutilized resources," said Tulasee Rao Chintha, CTO, InpharmD.

Post this

Smart Scaler, an advanced AI-powered predictive scaling mechanism, dynamically scales resources based on workload demand. The benchmarking results highlight key advantages for AI inferencing and training:

  • Higher Instantaneous Throughput: Processed 3X more tokens in a burst enabling faster AI inferencing using the HuggingFace/TGI framework..
  • Reduced Latency: AI model inference latency dropped from 8 seconds to 2 seconds.
  • Improved Throughput for Industry-Leading AI Models: Llama3-8B workloads had a 31% increase in token throughput, while DeepSeek 7B had a 13.5% boost.

Driving AI Innovation with EGS

For exciting research companies like InpharmD that combine pharmacist expertise with AI to provide state-of-the-art, evidence-based drug information, having the right tools to optimize research and reduce costs is essential.

"With Avesha's Elastic AI Services we're able to optimize our GPU workloads dynamically, ensuring we maximize performance without overpaying for underutilized resources," said Tulasee Rao Chintha, CTO, InpharmD. "This allows us to scale efficiently while keeping our research and operational costs predictable and manageable."

Benchmarking Results Validate EGS Performance

"The benchmarking results speak for themselves—Avesha is setting a new standard for AI workload efficiency for LLMs as well as scientific or specialized models ," said Raj Nair, Founder and CEO at Avesha. Avesha improves interactive performance by 85% and triples overall efficiency, making high-performance AI more accessible and cost-effective for enterprises and startups alike"

Pay-per-work-output pricing

Avesha's innovative high-performance scaling solution enables GPU Cloud Providers to offer pay-per-work-output pricing instead of traditional GPU time-based pricing, significantly reducing costs and making AI development more accessible. This incredible performance improvement creates the opportunity for very competitive pay-per-work-output made feasible by sharing higher performance GPUs – a higher throughput makes the price per work-output lower than a lower priced but slower GPU. .

"With Avesha, startups no longer need to pay for idle GPU hours," added Raj Nair, Founder and CEO at Avesha. "Now, they can only pay for actual AI workloads processed, making it a game-changer for companies creating innovative AI applications while maintaining cost efficiency. We are introducing a FREE Tier for our GPU services available through OCI."

Startups can sign up for the EGS Free Tier today at the OCI Marketplace.

A Hybrid Pricing Model That Maximizes Value

EGS introduces a flexible pricing strategy designed to optimize costs while maintaining high-performance AI scaling:

  • Value-Based Pricing – Customers pay for actual performance gains rather than static GPU time.
  • On Demand/Spot Pricing – Leverages unused GPU capacity for cost savings.
  • Tiered Commitments – Offers long-term cost reductions for enterprise-scale AI workloads.
  • Auto-Scaling Capabilities – Dynamically adjusts GPU allocation based on real-time demand.

With this approach, GPU cloud providers also benefit by optimizing resource allocation and monetizing idle capacity efficiently.

About Avesha

Avesha is a pioneer in AI-powered GPU and CPU orchestration & scaling solutions, utilizing Kubernetes to optimize performance across diverse cloud and edge environments. As a Gartner Cool Vendor and a CNCF Sandbox project, Avesha is committed to delivering scalable, high-performance solutions that empower businesses across industries, including finance, retail, media, and healthcare.

For more information, visit www.avesha.io.

Media Contact

Olyvia Rakshit, Avesha, Inc, 1 5046122716, [email protected], www.avesha.io

SOURCE Avesha, Inc

Modal title

Contact PRWeb

  • 11AM ET Sunday – 8PM ET Friday
  • Contact Us

About PRWeb

  • About PRWeb
  • Partners
  • Partnership Programs
  • Editorial Guidelines
  • Resources

Why PRWeb

  • Why PRWeb
  • How It Works
  • Who Uses It
  • Pricing

Accounts

  • Create a Free Account
  • Log in
  • Contact Us

Do not sell or share my personal information:

  • Submit via [email protected] 
  • Call Privacy toll-free: 877-297-8921

Contact Cision

Products

About

My Services
  • All News Releases
  • Online Member Center
  • ProfNet
Cision Distribution Helpline
888-776-0942
  • Legal
  • Site Map
  • RSS
  • Cookie Settings
Copyright © 2025 Cision US Inc.