All articles
The Integration Tax Is Where Enterprise Data Platforms Get Expensive
Christopher Wagner, Director of AI and Data at Baker Tilly, explains why the real cost of enterprise data is not the platform itself but the security, governance, and consumption layers that organizations have to stitch together around it.

Make CX Current News one of your go-to sources on Google
Use the right tool for the right job. When Snowflake is the right thing, use Snowflake. When it's Databricks, use Databricks.
Every data platform promises consolidation. Most deliver it for the data engineering team and leave everyone else to figure out how to connect. The security framework needs third-party tooling. The ingestion pipeline needs its own integration layer. Governance sits in a separate system. Consumption requires yet another set of connectors. The platform itself may be excellent, but the integration work around it is where time, budget, and complexity actually accumulate.
Christopher Wagner is Director of AI and Data at Baker Tilly, a top-ten US advisory firm, where he leads organizations through modern data architecture and AI-ready implementation. A multi-year Microsoft MVP who has built production solutions on Snowflake, Databricks, and the full Microsoft stack across financial services, insurance, and industrial automation, Wagner sees the integration tax as the defining challenge in enterprise data, not the capabilities of any single platform.
"Use the right tool for the right job," Wagner says. "When Snowflake is the right thing, use Snowflake. When it's Databricks, use Databricks."
The 99% problem
Wagner is careful to credit both platforms on their merits. Snowflake was built by engineers with decades of experience as a cloud-native MPP solution. Databricks pioneered scalable cloud-first architecture with true separation of compute from storage. Both remain strong for teams that live inside data engineering workflows.
The challenge surfaces when everyone else needs to interact with the data. "Any integration pattern you have for consumption has a security challenge associated with it that requires third-party tools, additional integrations, additional components," Wagner says. "Same thing on the input side." That integration overhead compounds across identity, operating systems, ERP connectivity, and governance infrastructure, creating friction that accumulates quietly until it dominates the total cost of the data environment.
Wagner points to unified platform architectures as the structural response. Microsoft Fabric, the platform he has gone deepest on, borrows from both competitors: its warehouses were rebuilt by engineers formerly at Snowflake, and its notebooks run on Spark engines parallel to Databricks.
The difference is that ingestion, storage, governance, and consumption sit inside a single environment with native integration across identity and security. "You almost have to adopt an anti-pattern to break away from those integrations," Wagner says. "So it becomes easier to deliver higher business value automatically."
The utilization gap
The economics reinforce the architecture argument. Wagner cites compute utilization across Snowflake and Databricks environments, averaging roughly 6% when measured across all provisioned tools and spin-up cycles. Environments that unify compute across ingestion, storage, and consumption push that number closer to 24% through automatic bursting and smoothing. The difference matters at scale: teams in fragmented environments spend significant energy optimizing uptime and spin-down cycles rather than building solutions.
Cost confusion compounds the problem. Wagner says teams estimating spend for unified platforms often apply consumption-based frameworks designed for standalone tools and produce inflated projections. "They blow up cost estimates. But when people actually go in to use it, they realize they can pause capacity and manage costs." He cites small companies running full data and analytics environments for as little as $40 per month.
Where AI forces the question
Wagner sees AI adoption accelerating the integration reckoning. As organizations bring AI into their data strategy, security defined only at the consumption layer is no longer sufficient. Platforms that push row-level, column-level, and object-level security down to the storage and file layer give AI systems access to governed data by default. "That is going to become inherently important as AI becomes more predominant," Wagner says. "At that lowest grain layer, the security is defined there."
The practical advice is not to rip out existing platforms. Organizations with years of assets in Snowflake or Databricks should keep them and adopt integration patterns that add unified governance, where they create value. Wagner describes a common data mesh pattern where central IT teams maintain one platform for enterprise assets while domain owners use another for domain-level ownership and governance.
The point is not which platform wins. It is that the integration tax has become a larger line item than the platform spend itself, and the organizations that recognize that are making different architectural decisions because of it.





