AI Blog | TensorOps

Aug 29, 20241 min read

TensorOps AI Driven Talent Management

Dive into the future of HR with TensorOps AI-Driven Talent Management! 🤖💼 Our latest video showcases how TensorOps is revolutionizing...

Analyzing the Costs of Large Language Models in Production

TensorOps

Aug 29, 20241 min read

Analyzing the Costs of Large Language Models in Production

This one-hour webinar offers a deep dive into the costs aspects of leveraging Large Language Models (LLMs) in production environments....

Panaya Implements TensorOps' AI Agent Solution to Accelerate Time-to-Market

Gad Benram

Aug 14, 20242 min read

Panaya Implements TensorOps' AI Agent Solution to Accelerate Time-to-Market

In an impressive collaboration, TensorOps and Panaya have successfully launched the "SeeMore" AI Agent into production within just three...

Floor Price Optimization in Programmatic Advertising Using Machine Learning

Gabriel Gonçalves

Aug 12, 20246 min read

Floor Price Optimization in Programmatic Advertising Using Machine Learning

Introduction As the industry of programmatic advertising becomes more competitive more companies want to build their smart bidding...

Lifetime Value Predictions for Google Ads and Meta Advertising

Gabriel Gonçalves

Aug 8, 20245 min read

Lifetime Value Predictions for Google Ads and Meta Advertising

Acquiring users who bring long-term value to your business is crucial in today's digital market. Platforms like Google Ads and Meta...

Optimizing User Acquisition and Retention with Machine Learning: Predicting Customer Lifetime Value and Churn in Freemium Mobile Games

Vasco Reid

Jul 18, 202410 min read

Optimizing User Acquisition and Retention with Machine Learning: Predicting Customer Lifetime Value and Churn in Freemium Mobile Games

Introduction In the highly competitive mobile gaming industry, acquiring and retaining profitable users is crucial for success. The...

LLM Proxy & LLM gateway: All You Need to Know

Gad Benram

Jun 15, 20245 min read

LLM Proxy & LLM gateway: All You Need to Know

Since OpenAI first introduced ChatGPT, the landscape of AI models has evolved significantly. While OpenAI now offers multiple versions of...

Modal: A Powerful Alternative to AWS Lambda for AI Workloads

Gad Benram

Jun 10, 20244 min read

Modal: A Powerful Alternative to AWS Lambda for AI Workloads

As the CTO of TensorOps, and previously as a consultant for one of the largest cloud MSPs, I've had the privilege of working with various...

Vertex AI Workbench vs Colab Enterprise: Which Notebook Solution is Right for You?

Gad Benram

Jun 8, 20244 min read

Vertex AI Workbench vs Colab Enterprise: Which Notebook Solution is Right for You?

When it comes to data science and machine learning, notebooks (based on Jupyter) are often the main tool for research and exploration as...

Reflections from the RAG Funeral: A Melting Pot of Minds and Misconceptions

Gad Benram

May 26, 20242 min read

Reflections from the RAG Funeral: A Melting Pot of Minds and Misconceptions

On a sad February day, I attended the professional event known as the “RAG Funeral,” which, contrary to its name, was a lively and...

No Clouds Allowed: Building an All Open Source Local RAG System

Diogo Azevedo

May 20, 20248 min read

No Clouds Allowed: Building an All Open Source Local RAG System

In today’s AI landscape, companies like Microsoft and Google offer sophisticated Retriever-Augmented Generation (RAG) solutions through...

Prompt Eng vs RAG vs Fine-Tuning - What do you need?

Miguel Carreira Neves

May 9, 20246 min read

Prompt Eng vs RAG vs Fine-Tuning - What do you need?

Do you know which method to focus on to improve your LLM App? What are the pros and cons? Here we give advice based on our past experience.

Cost of AI - What Would an Organization Pay in 2024?

Gad Benram

Apr 7, 20247 min read

Cost of AI - What Would an Organization Pay in 2024?

Generative AI has been at the forefront of the automation revolution, particularly since the emergence of ChatGPT. The emergence of...

RAG vs Large Context Models: How Gemini 1.5 changes the world

Clara Gadelho

Mar 26, 202411 min read

RAG vs Large Context Models: How Gemini 1.5 changes the world

Should you use GPT4 or other models with RAG or just send everything in the context to Gemini 1.5?

MDClone Revolutionizes Clinical Data Analysis with AI

Gad Benram

Mar 20, 20245 min read

MDClone Revolutionizes Clinical Data Analysis with AI

MDClone: Setting New Standards for clinical analysis with ADAM MDClone is a successful startup that provides the ADAMS Platform, an...

Understanding the cost of Large Language Models (LLMs)

Gad Benram

Feb 10, 202410 min read

Understanding the cost of Large Language Models (LLMs)

What stands behind the cost of LLMs? Do you need to pay for training an LLM and how much does it cost to host one on AWS? Read about it here

LLM-FinOps: The Key to Cost-Effective Gen AI Applications

Gad Benram

Feb 4, 20246 min read

LLM-FinOps: The Key to Cost-Effective Gen AI Applications

Discover LLM-FinOps: The art of balancing cost, performance, and scalability in AI, where strategic cost monitoring meets innovative perform

Miguel Carreira Neves

Jan 29, 20248 min read

LLM Mixture of Experts Explained

Explaining Mixture of Experts (MoE): GPT4 is just 8 smaller Expert models; Mixtral is just 8 Mistral models. Advantages and disadvantages.

Advanced Prompt Engineering - Practical Examples

Miguel Carreira Neves

Dec 8, 202313 min read

Advanced Prompt Engineering - Practical Examples

This blog post will cover more complex state-of-the-art methods in prompt engineering including Chains, Agents, and more.

Bruno Alho

Nov 4, 20237 min read

ML Model Deployment Strategies

As a data scientist, you may occasionally train a machine learning model to be part of a production system. Once you have completed the...