top of page
Diogo Gonçalves
Jan 65 min read
Vector DBs Will Not Save Your RAG
The AI world has collectively fixated on Vector Databases as the holy grail for scalable, accurate information retrieval and synthesis to so
Gad Benram
Dec 25, 20246 min read
Agents Are Just Long-Running Jobs: A Pragmatic View of an Overhyped AI
The Routing Workflow. Source: Anthropic "Building effective agents" Building an “AI agent” sounds exciting —visions of an autonomous...
TensorOps
Dec 8, 20244 min read
Emerging Architectures of LLM Applications (2025 Update)
The world of AI applications is changing rapidly. Not too long ago, most AI systems were simple: a single model received input, made a...
Gad Benram
Dec 5, 20245 min read
Faster Than XGBoost: Using Catboost with C++
Integrating machine learning models into production environments often requires a balance between performance, compatibility, and ease of...
Miguel Carreira Neves
Nov 7, 20246 min read
Contextual Retrieval - Enhancing RAG Performance
Traditional RAG systems cannot maintain context in retrieved information. Contextual Retrieval addresses this by enriching data with context
Diogo Azevedo
Oct 18, 20243 min read
Deploying LLM Proxy on Google Kubernetes Engine: A Step-by-Step Guide
In our previous post , we explored the concept of an LLM Proxy and its importance in scalable LLM application architectures. In this...
Higor Ribeiro de Oliveira
Oct 17, 20244 min read
Cohort-Based Forecasting: A Technical Deep Dive
At TensorOps , we specialize in implementing AI solutions that drive business growth. One powerful application of AI in the business...
Clara Gadelho
Oct 14, 20249 min read
Building AI and LLM Agents from the Ground Up: A Step-by-Step Guide
OpenAI’s vision of creating artificial general intelligence (AGI) might still be futuristic, but today’s AI agents are already making a sign
Bruno Alho
Oct 14, 20244 min read
Comparing Context Caching in LLMs: OpenAI vs. Anthropic vs. Google Gemini
Compare context caching in LLMs—OpenAI, Anthropic, Google Gemini. Discover the best option for your project's cost, ease, and features.
Gad Benram
Oct 13, 20245 min read
10 Essential AI Technologies for Software Supply Chain Companies
Table of Contents Introduction The Software Supply Chain AI in Software Development: The Rise of Code Assistants...
Gad Benram
Oct 13, 20246 min read
Knowledge Graph RAG vs. Vector DB RAG: Is It Time for GraphDBs to Shine?
The emergence of AI has revolutionized the way we interact with data—or even knowledge itself. Among the buzzwords circulating in the...
Diogo Gonçalves
Oct 8, 20246 min read
Moving from Chatbots to Agents
While the terms “Chatbot” and “AI agent” are sometimes used interchangeably, there are notable differences between them:
Gad Benram
Sep 24, 20245 min read
Prompt Translation: The Way to Switch Between LLMs Without Losing Performance
Since the debut of ChatGPT in 2023, the landscape of Large Language Models (LLMs) has evolved dramatically. Back then, the primary...
Gad Benram
Sep 20, 20244 min read
UX in LLM Applications: Examples of 4 Companies Getting It Right and 1 That Missed the Mark
Over the past year, TensorOps has observed a recurring scenario: organizations invest significant time—often 5-7 months—fine-tuning...
Gad Benram
Sep 12, 20243 min read
OpenAI Unveils o1 Model: The Biggest Leap Towards AGI since ChatGPT
September 12, 2024 OpenAI has unveiled its latest breakthrough in artificial intelligence—the O1 model series—now available in Preview....
Gad Benram
Aug 31, 20245 min read
What can shift Nvidia's stock up or down?
NVIDIA's stock soared 2750% due to $10B in Q2 data center sales. Buyers like Google and Meta aim to leverage AI tech, but is it a bubble?
TensorOps
Aug 29, 20242 min read
Lessons Learned From Managing AI Innovation Projects
Watch the video here: In this session, Senior Engineering Managers will share the pains and successes of over 18 months into the GenAI...
TensorOps
Aug 29, 20241 min read
Reducing the costs of AI and LLM applications
Understand Your Costs: Discover why many companies struggle with scattered billing and multiple vendor payments
TensorOps
Aug 29, 20241 min read
Beyond PoC: Enterprise Chatbot Architectures
This one-hour webinar showcasing architectures for enterprise-grade chatbots, moving beyond the proof of concept stage. learn how to...
TensorOps
Aug 29, 20241 min read
A Survey of Advanced Prompt Engineering Techniques
This one-hour webinar exploring the Secrets of Prompt Engineering , we'll discuss how prompt engineering resembles programming and what...
bottom of page