top of page
Vasco Reid
Aug 29, 20241 min read
TensorOps AI Driven Talent Management
Dive into the future of HR with TensorOps AI-Driven Talent Management! 🤖💼 Our latest video showcases how TensorOps is revolutionizing...
TensorOps
Aug 29, 20241 min read
Analyzing the Costs of Large Language Models in Production
This one-hour webinar offers a deep dive into the costs aspects of leveraging Large Language Models (LLMs) in production environments....
Gad Benram
Aug 14, 20242 min read
Panaya Implements TensorOps' AI Agent Solution to Accelerate Time-to-Market
In an impressive collaboration, TensorOps and Panaya have successfully launched the "SeeMore" AI Agent into production within just three...
Gabriel Gonçalves
Aug 12, 20246 min read
Floor Price Optimization in Programmatic Advertising Using Machine Learning
Introduction As the industry of programmatic advertising becomes more competitive more companies want to build their smart bidding...
Gabriel Gonçalves
Aug 8, 20245 min read
Lifetime Value Predictions for Google Ads and Meta Advertising
Acquiring users who bring long-term value to your business is crucial in today's digital market. Platforms like Google Ads and Meta...
Vasco Reid
Jul 18, 202410 min read
Optimizing User Acquisition and Retention with Machine Learning: Predicting Customer Lifetime Value and Churn in Freemium Mobile Games
Introduction In the highly competitive mobile gaming industry, acquiring and retaining profitable users is crucial for success. The...
Gad Benram
Jun 15, 20245 min read
LLM Proxy & LLM gateway: All You Need to Know
Since OpenAI first introduced ChatGPT, the landscape of AI models has evolved significantly. While OpenAI now offers multiple versions of...
Gad Benram
Jun 10, 20244 min read
Modal: A Powerful Alternative to AWS Lambda for AI Workloads
As the CTO of TensorOps, and previously as a consultant for one of the largest cloud MSPs, I've had the privilege of working with various...
Gad Benram
Jun 8, 20244 min read
Vertex AI Workbench vs Colab Enterprise: Which Notebook Solution is Right for You?
When it comes to data science and machine learning, notebooks (based on Jupyter) are often the main tool for research and exploration as...
Gad Benram
May 26, 20242 min read
Reflections from the RAG Funeral: A Melting Pot of Minds and Misconceptions
On a sad February day, I attended the professional event known as the “RAG Funeral,” which, contrary to its name, was a lively and...
Diogo Azevedo
May 20, 20248 min read
No Clouds Allowed: Building an All Open Source Local RAG System
In today’s AI landscape, companies like Microsoft and Google offer sophisticated Retriever-Augmented Generation (RAG) solutions through...
Miguel Carreira Neves
May 9, 20246 min read
Prompt Eng vs RAG vs Fine-Tuning - What do you need?
Do you know which method to focus on to improve your LLM App? What are the pros and cons? Here we give advice based on our past experience.
Gad Benram
Apr 7, 20247 min read
Cost of AI - What Would an Organization Pay in 2024?
Generative AI has been at the forefront of the automation revolution, particularly since the emergence of ChatGPT. The emergence of...
Clara Gadelho
Mar 26, 202411 min read
RAG vs Large Context Models: How Gemini 1.5 changes the world
Should you use GPT4 or other models with RAG or just send everything in the context to Gemini 1.5?
Gad Benram
Mar 20, 20245 min read
MDClone Revolutionizes Clinical Data Analysis with AI
MDClone: Setting New Standards for clinical analysis with ADAM MDClone is a successful startup that provides the ADAMS Platform, an...
Gad Benram
Feb 10, 202410 min read
Understanding the cost of Large Language Models (LLMs)
What stands behind the cost of LLMs? Do you need to pay for training an LLM and how much does it cost to host one on AWS? Read about it here
Gad Benram
Feb 4, 20246 min read
LLM-FinOps: The Key to Cost-Effective Gen AI Applications
Discover LLM-FinOps: The art of balancing cost, performance, and scalability in AI, where strategic cost monitoring meets innovative perform
Miguel Carreira Neves
Jan 29, 20248 min read
LLM Mixture of Experts Explained
Explaining Mixture of Experts (MoE): GPT4 is just 8 smaller Expert models; Mixtral is just 8 Mistral models. Advantages and disadvantages.
Miguel Carreira Neves
Dec 8, 202313 min read
Advanced Prompt Engineering - Practical Examples
This blog post will cover more complex state-of-the-art methods in prompt engineering including Chains, Agents, and more.
Bruno Alho
Nov 4, 20237 min read
ML Model Deployment Strategies
As a data scientist, you may occasionally train a machine learning model to be part of a production system. Once you have completed the...
bottom of page