ABOUT ME

-

Today
-
Yesterday
-
Total
-
  • Command Query Responsibility Segregation (CQRS)
    Modeling/Architecture 2020. 3. 10. 08:58

    1. Overview

    The Command and Query Responsibility Segregation (CQRS) pattern separates read and update operations for a data store. Implementing CQRS in your application can maximize its performance, scalability, and security. The flexibility created by migrating to CQRS allows a system to better evolve over time and prevents update commands from causing merge conflicts at the domain level.

    1.1 Motivation

    In traditional architectures, the same data model is used to query and update a database. That's simple and works well for basic CRUD operations. In more complex applications, however, this approach can become unwieldy. For example, on the read side, the application may perform many different queries, returning data transfer objects (DTOs) with different shapes. Object mapping can become complicated. On the write side, the model may implement complex validation and business logic. As a result, you can end up with an overly complex model that does too much.

    Read and write workloads are often asymmetrical, with very different performance and scale requirements.

    • There is often a mismatch between the read and write representations of the data, such as additional columns or properties that must be updated correctly even though they aren't required as part of an operation.
    • Data contention can occur when operations are performed in parallel on the same set of data.
    • The traditional approach can have a negative effect on performance due to the load on the data store and data access layer, and the complexity of queries required to retrieve information.
    • Managing security and permissions can become complex because each entity is subject to both read and write operations, which might expose data in the wrong context.

    2. Command Query Responsibility Separation

    CQRS separates reads and writes into different models, using commands to update data, and queries to read data.

    • Commands should be task-based, rather than data-centric. ("Book hotel room", not "set ReservationStatus to Reserved").
    • Commands may be placed on a queue for asynchronous processing, rather than being processed synchronously.
    • Queries never modify the database. A query returns a DTO that does not encapsulate any domain knowledge.

    The models can then be isolated, as shown in the following diagram, although that's not an absolute requirement

    Having separate query and update models simplifies the design and implementation. However, one disadvantage is that CQRS code can't automatically be generated from a database schema using scaffolding mechanisms such as O/RM tools.

    For greater isolation, you can physically separate the read data from the write data. In that case, the read database can use its own data schema that is optimized for queries. For example, it can store a materialized view of the data, in order to avoid complex joins or complex O/RM mappings. It might even use a different type of data store. For example, the write database might be relational, while the read database is a document database.

    If separate read and write databases are used, they must be kept in sync. Typically this is accomplished by having the write model publish an event whenever it updates the database. Updating the database and publishing the event must occur in a single transaction.

    The read store can be a read-only replica of the write store, or the read and write stores can have a different structure altogether. Using multiple read-only replicas can increase query performance, especially in distributed scenarios where read-only replicas are located close to the application instances.

    Separation of the read and write stores also allows each to be scaled appropriately to match the load. For example, read stores typically encounter a much higher load than write stores.

    Some implementations of CQRS use the Event Sourcing pattern. With this pattern, the application state is stored as a sequence of events. Each event represents a set of changes to the data. The current state is constructed by replaying the events. In a CQRS context, one benefit of Event Sourcing is that the same events can be used to notify other components — in particular, to notify the read model. The read model uses events to create a snapshot of the current state, which is more efficient for queries. However, Event Sourcing adds complexity to the design.

    3. Benefits of CQRS

    3.1 Independent scaling

    CQRS allows the read and write workloads to scale independently, and may result in fewer lock contentions.

    3.2 Optimized data schemas

    The read side can use a schema that is optimized for queries, while the write side uses a schema that is optimized for updates.

    3.3 Security

    It's easier to ensure that only the right domain entities are performing writes on the data.

    3.4 Separation of concerns

    Segregating the read and write sides can result in models that are more maintainable and flexible. Most of the complex business logic goes into the write model. The read model can be relatively simple.

    3.5 Simpler queries

    By storing a materialized view in the read database, the application can avoid complex joins when querying.

    4. Implementation issues and considerations

    Some challenges in implementing this pattern include:

    4.1 Complexity

    The basic idea of CQRS is simple. But it can lead to more complex application design, especially if they include the Event Sourcing pattern.

    4.2 Messaging

    Although CQRS does not require messaging, it's common to use messaging to process commands and publish update events. In that case, the application must handle message failures or duplicate messages.

    4.3 Eventual consistency

    If you separate the read and write databases, the read data may be stale. The read model store must be updated to reflect changes to the write model store, and it can be difficult to detect when a user has issued a request based on stale read data.

    5. When to use

    Consider CQRS for the following scenarios:

    • Collaborative domains where many users access the same data in parallel. CQRS allows you to define commands with enough granularity to minimize merge conflicts at the domain level, and conflicts that do arise can be merged by the command.
    • Task-based user interfaces where users are guided through a complex process as a series of steps or with complex domain models. The write model has a full command-processing stack with business logic, input validation, and business validation. The write model may treat a set of associated objects as a single unit for data changes (an aggregate, in DDD terminology) and ensure that these objects are always in a consistent state. The read model has no business logic or validation stack and just returns a DTO for use in a view model. The read model is eventually consistent with the write model.
    • Scenarios, where performance of data reads must be fine-tuned separately from performance of data, write, especially when the number of reads is much greater than the number of writes. In this scenario, you can scale out the read model, but run the write model on just a few instances. A small number of write model instances also helps to minimize the occurrence of merge conflicts.
    • Scenarios where one team of developers can focus on the complex domain model that is part of the write model, and another team can focus on the read model and the user interfaces.
    • Scenarios where the system is expected to evolve over time and might contain multiple versions of the model, or where business rules change regularly.
    • Integration with other systems, especially in combination with event sourcing, where the temporal failure of one subsystem shouldn't affect the availability of the others.

    6. When not to use

    • The domain or business rules are simple.
    • A simple CRUD-style user interface and data access operations are sufficient.

    Consider applying CQRS to limited sections of your system where it will be most valuable.

    7. Event Sourcing and CQRS

    The CQRS pattern is often used along with the Event Sourcing pattern. CQRS-based systems use separate read and write data models, each tailored to relevant tasks and often located in physically separate stores. When used with the Event Sourcing pattern, the store of events is the write model and is the official source of information. The read model of a CQRS-based system provides materialized views of the data, typically as highly denormalized views. These views are tailored to the interfaces and display requirements of the application, which helps to maximize both display and query performance.

    Using the stream of events as the write store, rather than the actual data at a point in time, avoids update conflicts on a single aggregate and maximizes performance and scalability. The events can be used to asynchronously generate materialized views of the data that are used to populate the read store.

    Because the event store is the official source of information, it is possible to delete the materialized views and replay all past events to create a new representation of the current state when the system evolves, or when the read model must change. The materialized views are in effect a durable read-only cache of the data.

    When using CQRS combined with the Event Sourcing pattern, consider the following:

    • As with any system where the write and read stores are separate, systems based on this pattern are only eventually consistent. There will be some delay between the event being generated and the data store is updated.
    • The pattern adds complexity because code must be created to initiate and handle events, and assemble or update the appropriate views or objects required by queries or a read model. The complexity of the CQRS pattern when used with the Event Sourcing pattern can make a successful implementation more difficult, and requires a different approach to designing systems. However, event sourcing can make it easier to model the domain, and makes it easier to rebuild views or create new ones because the intent of the changes in the data is preserved.
    • Generating materialized views for use in the read model or projections of the data by replaying and handling the events for specific entities or collections of entities can require significant processing time and resource usage. This is especially true if it requires summation or analysis of values over long periods because all the associated events might need to be examined. Resolve this by implementing snapshots of the data at scheduled intervals, such as a total count of the number of a specific action that has occurred, or the current state of an entity.

    8. Example

    8.1 Read Model

    The model interfaces don't dictate any features of the underlying data stores, and they can evolve and be fine-tuned independently because these interfaces are separated.

    // Query interface
    namespace ReadModel
    {
      public interface ProductsDao
      {
        ProductDisplay FindById(int productId);
        ICollection<ProductDisplay> FindByName(string name);
        ICollection<ProductInventory> FindOutOfStockProducts();
        ICollection<ProductDisplay> FindRelatedProducts(int productId);
      }
    
      public class ProductDisplay
      {
        public int Id { get; set; }
        public string Name { get; set; }
        public string Description { get; set; }
        public decimal UnitPrice { get; set; }
        public bool IsOutOfStock { get; set; }
        public double UserRating { get; set; }
      }
    
      public class ProductInventory
      {
        public int Id { get; set; }
        public string Name { get; set; }
        public int CurrentStock { get; set; }
      }
    }

    8.2 Write Model

    The system allows users to rate products. The application code does this using the RateProduct command shown in the following code.

    public interface ICommand
    {
      Guid Id { get; }
    }
    
    public class RateProduct : ICommand
    {
      public RateProduct()
      {
        this.Id = Guid.NewGuid();
      }
      public Guid Id { get; set; }
      public int ProductId { get; set; }
      public int Rating { get; set; }
      public int UserId {get; set; }
    }

    8.3 Event Handler

    The system uses the ProductsCommandHandler class to handle commands sent by the application. Clients typically send commands to the domain through a messaging system such as a queue. The command handler accepts these commands and invokes methods of the domain interface. The granularity of each command is designed to reduce the chance of conflicting requests. The following code shows an outline of the ProductsCommandHandler class.

    public class ProductsCommandHandler :
        ICommandHandler<AddNewProduct>,
        ICommandHandler<RateProduct>,
        ICommandHandler<AddToInventory>,
        ICommandHandler<ConfirmItemShipped>,
        ICommandHandler<UpdateStockFromInventoryRecount>
    {
      private readonly IRepository<Product> repository;
    
      public ProductsCommandHandler (IRepository<Product> repository)
      {
        this.repository = repository;
      }
    
      void Handle (AddNewProduct command)
      {
        ...
      }
    
      void Handle (RateProduct command)
      {
        var product = repository.Find(command.ProductId);
        if (product != null)
        {
          product.RateProduct(command.UserId, command.Rating);
          repository.Save(product);
        }
      }
    
      void Handle (AddToInventory command)
      {
        ...
      }
    
      void Handle (ConfirmItemsShipped command)
      {
        ...
      }
    
      void Handle (UpdateStockFromInventoryRecount command)
      {
        ...
      }
    }

    9. Reference

    https://docs.microsoft.com/en-us/azure/architecture/patterns/cqrs

    https://www.jameseastham.co.uk/posts/cqrs-in-microservices-a-practical-example-2p86/

    https://dzone.com/articles/cqrs-by-example-introduction

    https://martinfowler.com/bliki/CQRS.html

    https://culttt.com/2015/01/14/command-query-responsibility-segregation-cqrs/

    댓글

Designed by Tistory.