AI Blog | TensorOps

Amazon Q vs Google Agentspace: What’s the Best Enterprise Agent Tool?

Updated in May 2025 Enterprise AI assistants are no longer science‑fiction sidekicks. In 2025, they sit at the centre of board‑room...

João Sousa

May 53 min read

Self-Hosting Large Language Models in China: Limitations and Possibilities

While cloud vendors are ramping up their supply of AI models and GPU hardware in China, companies that are required to self-host these...

Bernardo Pissarra

Apr 721 min read

Emerging Architectures of LLM Applications 2025

Watch the full webinar, "Building the Future of AI: Emerging Architectures of LLM Applications in 2025

TensorOps

Mar 31 min read

Wenrix’s AI-Driven Transformation with Amazon Q Business

Wenrix is an AI-driven company that helps travel agencies optimize airfare spending from start to finish. By leveraging advanced machine...

Gabriel Gonçalves

Feb 273 min read

Deepseek - Open Source Revolution in AI Models - (AI Lover podcast)

The conversation explores the emergence of DeepSeek as a significant player in the AI landscape, particularly in the context of open-source

TensorOps

Feb 191 min read

DeepSeek-V3 Technical Analysis - MoE, Fine-Grained Quantization, DualPipe, MLA

Analysis of Performance and Technical Innovations. Dive into Mixture of Experts (MoE), Fine-Grained Quantization, DualPipe, MLA and more.

Miguel Carreira Neves

Feb 1313 min read

Vector DBs Will Not Save Your RAG

The AI world has collectively fixated on Vector Databases as the holy grail for scalable, accurate information retrieval and synthesis to so

Diogo Gonçalves

Jan 65 min read

Agents Are Just Long-Running Jobs: A Pragmatic View of an Overhyped AI

The Routing Workflow. Source: Anthropic "Building effective agents" Building an “AI agent” sounds exciting —visions of an autonomous...

Gad Benram

Dec 25, 20246 min read

Emerging Architectures of LLM Applications (2025 Update)

The world of AI applications is changing rapidly. Not too long ago, most AI systems were simple: a single model received input, made a...

TensorOps

Dec 8, 20244 min read

Faster Than XGBoost: Using Catboost with C++

Integrating machine learning models into production environments often requires a balance between performance, compatibility, and ease of...

Gad Benram

Dec 5, 20245 min read

Contextual Retrieval - Enhancing RAG Performance

Traditional RAG systems cannot maintain context in retrieved information. Contextual Retrieval addresses this by enriching data with context

Miguel Carreira Neves

Nov 7, 20246 min read

Deploying LLM Proxy on Google Kubernetes Engine: A Step-by-Step Guide

In our previous post , we explored the concept of an LLM Proxy and its importance in scalable LLM application architectures. In this...

Diogo Azevedo

Oct 18, 20243 min read

Cohort-Based Forecasting: A Technical Deep Dive

At TensorOps , we specialize in implementing AI solutions that drive business growth. One powerful application of AI in the business...

Higor Ribeiro de Oliveira

Oct 17, 20244 min read

Building AI and LLM Agents from the Ground Up: A Step-by-Step Guide

OpenAI’s vision of creating artificial general intelligence (AGI) might still be futuristic, but today’s AI agents are already making a sign

Clara Gadelho

Oct 14, 20249 min read

Comparing Context Caching in LLMs: OpenAI vs. Anthropic vs. Google Gemini

Compare context caching in LLMs—OpenAI, Anthropic, Google Gemini. Discover the best option for your project's cost, ease, and features.

Bruno Alho

Oct 14, 20244 min read

10 Essential AI Technologies for Software Supply Chain Companies

Table of Contents Introduction The Software Supply Chain AI in Software Development: The Rise of Code Assistants...

Gad Benram

Oct 13, 20245 min read

Knowledge Graph RAG vs. Vector DB RAG: Is It Time for GraphDBs to Shine?

The emergence of AI has revolutionized the way we interact with data—or even knowledge itself. Among the buzzwords circulating in the...

Gad Benram

Oct 13, 20246 min read

Moving from Chatbots to Agents

While the terms “Chatbot” and “AI agent” are sometimes used interchangeably, there are notable differences between them:

Diogo Gonçalves

Oct 8, 20246 min read

Prompt Translation: The Way to Switch Between LLMs Without Losing Performance

Since the debut of ChatGPT in 2023, the landscape of Large Language Models (LLMs) has evolved dramatically. Back then, the primary...

Gad Benram

Sep 24, 20245 min read

UX in LLM Applications: Examples of 4 Companies Getting It Right and 1 That Missed the Mark

Over the past year, TensorOps has observed a recurring scenario: organizations invest significant time—often 5-7 months—fine-tuning...

Gad Benram

Sep 20, 20244 min read

Amazon Q vs Google Agentspace: What’s the Best Enterprise Agent Tool?

Self-Hosting Large Language Models in China: Limitations and Possibilities

Emerging Architectures of LLM Applications 2025

Wenrix’s AI-Driven Transformation with Amazon Q Business

Deepseek - Open Source Revolution in AI Models - (AI Lover podcast)

DeepSeek-V3 Technical Analysis - MoE, Fine-Grained Quantization, DualPipe, MLA

Vector DBs Will Not Save Your RAG

Agents Are Just Long-Running Jobs: A Pragmatic View of an Overhyped AI

Emerging Architectures of LLM Applications (2025 Update)

Faster Than XGBoost: Using Catboost with C++

Contextual Retrieval - Enhancing RAG Performance

Deploying LLM Proxy on Google Kubernetes Engine: A Step-by-Step Guide

Cohort-Based Forecasting: A Technical Deep Dive

Building AI and LLM Agents from the Ground Up: A Step-by-Step Guide

Comparing Context Caching in LLMs: OpenAI vs. Anthropic vs. Google Gemini

10 Essential AI Technologies for Software Supply Chain Companies

Knowledge Graph RAG vs. Vector DB RAG: Is It Time for GraphDBs to Shine?

Moving from Chatbots to Agents

Prompt Translation: The Way to Switch Between LLMs Without Losing Performance

UX in LLM Applications: Examples of 4 Companies Getting It Right and 1 That Missed the Mark

Amazon Q vs Google Agentspace: What’s the Best Enterprise Agent Tool?