Talking about stuff, and junk: nHibernate

Showing posts with label nHibernate. Show all posts

Tuesday, March 25, 2014

“Don’t Query From the View” - Unit of Work and AOP

Fowler: http://martinfowler.com/eaaCatalog/unitOfWork.html

Maintains a list of objects affected by a business transaction and coordinates the writing out of changes and the resolution of concurrency problems.

I was asked about a warning from NHProfiler from a colleague to see my opinion on the matter. The warning was: http://hibernatingrhinos.com/products/NHProf/learn/alert/QueriesFromViews “Don’t Query From the View”.

The team had a session lifetime per controller action using AOP (Attribute Filters), but needed to extend this out to be active during the view rendering phase after the controller completed. They did this by instantiating the session in the request filter chain instead of the controller attribute filters. When they did this, they received the NHProfiler warning above.

The collegue was dismissive of the warning, partly because the implementation of the code was innocuous and partly because the warning reasons were not particularly compelling.

There are some serious implications of the pattern that was being followed however, some of which I have covered on this blog in the past (here) and (here).

tldr;

· Sessions can be open as long as you need them to be. You do not need to arbitrarily close them if you know they will be needed again as soon as the next step in the pipeline is reached.

· There are (arguably) valid reasons why the View rendering would be able to access an active session.

· The longer a session is open, the more chance of unexpected behaviour from working with Entities in a connected state. Care should be taken.

Pro1: Simplicity

One benefit from having a session opened and closed automatically using cross-cutting AOP is that your application doesn’t need to care about it. It knows that it will have a session when used, and will commit everything when the scope ends. This is often done as a controller/action filter attribute, or higher in the request pipeline. You don’t need to pass a session around, check if it exists, etc.

Con1: Deferring code to View Rendering adds complexity.

I have argued that having the session active outside the controller makes it more difficult to maintain and debug your code as the session is active during the View render component. The response was that forcing a controller action to pre-load everything the view *might* need and just pushing a dumb model prevents the view from performing optimisation of what it actually loads. I don’t believe that the View rendering should have such an impact on code execution but the point is a valid point of contention.

Con2: Loss of control.

When using AOP to manage the session lifetime you have much less control over the failed state. As the failure occurred outside of the business logic context, you can’t easily use business logic to handle the failure. If the standard policy is to simply direct you to an error page or similar, then this behaviour is completely valid. However if you needed to perform actions based on the failure (such as attempting to resolve optimistic concurrency issues automatically) then you can’t.

Con3: Loss of traceability.

When using a long running session there is no traceability between the persistence and the application logic that made the entity changes. If you experience an unexpected query when the unit of work commits, you can’t immediately identify which code caused this behaviour.

Con4: Unexpected change tracking.

Having a long running (and potentially ‘invisible’) session exposes you to potentially unexpected behaviour due to the session tracking all changes on the underlying entities. If you load an entity from the data access layer, then make some changes for display purposes (perhaps change a Name property from “Jason Leach” to “LEACH, Jason”) before passing that entity to the view, when the unit of work cleanup occurs it will persist that change to the database because all of your changes are being tracked.

A less obvious example is if you had a Parent entity with a list of Children. In a particular controller action you want to get the parent and only a subset of the children. So you might select the parent and all children. Then you may do something like Parent.Children = Parent.Children.Select(x=>childfilter).ToList(). Depending on your configuration this is likely to delete all of the children not matching the filter when the unit of work completes. Oops.

Con3 and 4 are direct side effects of leaving a session open longer. In the Parent->Child example, you would likely only need the session active when you did the initial load of the Parent and Child entities, then close the session and start manipulating the entities. Obviously you should be mapping entities to ViewModels and never manipulating those entities, but it is a very easy mistake to make. Coupled with con3, it can be near impossible to identify in a complex unit of work.

Conclusion

As long as you are careful about what you are doing, and not arbitrarily altering domain model entities while a session is open, then long-running sessions are not a problem. However there is significant potential of issues if your devs don't understand the risks.
As long as you are happy with standard error/failure, or UI driven resolution workflows, then AOP for unit of work management is acceptable, again as long as the risks / limitations are acknowledged.

Thursday, February 27, 2014

NHibernate Cross-database Hacks

Cross-Database Queries

Ok, so this is a bit of a hack, but it does work. Thanks to this which set me down the "right" path.

I work with a few legacy systems that have central databases for common information, and individual databases for application specific information. The data is queried using joins across the databases to retrieve the data required.
A separate service model was introduced for the common data, but when performing filtered queries across both data sets the service model was not efficient (this is a greater issue of context boundaries that I won’t go into here). To perform the queries that were previously performed using stored procedures using cross-database joins in nHibernate required a bit of a cheat.

nHibernate mappings have a “schema” property as well as the more commonly used “table” property. By manipulating this schema property you can convince nHibernate to perform cross-database queries. Setting the schema to “{database}.{schema}” any join query to that element will effectively use the full cross-database query syntax when converted to a SQL query.

Neat (but ultimately not very satisfying because it is not a very nice design).

Bigger Hack, run away

If the target database name is not known until runtime, you can even hack it more to support this.

During the configuration of the nHibernate session factory, you can make a few modifications that will allow you to update the schema property of an entity. This is useful if you have a different ‘other’ database name for each environment (e.g. OtherDatabase-dev, OtherDatabase-prd).

First we appropriately generate the fluent configuration, and build it.
We then iterate through the class mappings. Each persistentClass.MappedClass is the underlying POCO model object of the entity.
We check if this is one that we want to override the schema property for (IOtherDatabaseEntity is a simple blank interface, it could be done via naming convention or whatever)
And then update the schema property on the mapping
Finally we create the session factory from the modified config

var fluentConfiguration = config.Mappings(m =>
       m.FluentMappings
       .AddFromAssemblyOf()
       );
var builtConfig = fluentConfiguration.BuildConfiguration();

foreach (PersistentClass persistentClass in builtConfig.ClassMappings)
{
       if (
           typeof(IOtherDatabaseEntity)
               .IsAssignableFrom(persistentClass.MappedClass)
           )
        {
           persistentClass.Table.Schema = cdrName;
       }
}

ISessionFactory sessionFactory = builtConfig
       .BuildSessionFactory();

Hacks away!

NHibernate + LINQPad

While looking at ways to assess and improve performance some nHibernate queries, I was frustrated with the tools at my disposal.

I will start with the point: I don't have access to nhprof. It seems to be the gold standard for any nHibernate shop, but them's the breaks.

What I was doing:

It was painful to execute the entire codebase to run one or two queries and view the output sql, so I started writing integration-unit tests for query optimisation. This was slightly better, but still required a change-compile-run-review process which was annoying.

What was I thinking:

I then remembered I have a personal LINQPad license that I hadn't used in a while and wondered if I could get it working. I saw this which helped me on my way, but we don't use nHibernate.Linq, so the steps were a bit different.

The outcome was extremely useful however, and now I am free to tinker with my queries a lot more freely.

How I did it:

To start you need to add the references to nHibernate that you require (in my case NHibernate, Iesi.Collections, and FluentNHibernate). You then add the references to your Domain / Models and mapping assemblies.

The next step is to create a hibernate.cfg.xml file with the appropriate configuration for your database. Make sure you set show_sql=true in the configuration so LINQPad can display the generated SQL.

Then you can call the following code

var cfg = new Configuration().Configure(@"D:\linqpad\ahs\hibernate.cfg.xml");
var factory = Fluently.Configure(cfg)
                .Mappings(m =>
                {
                    m.FluentMappings.AddFromAssemblyOf();
                })
                .BuildSessionFactory();

using (var session = factory.OpenSession()){
    var site = session.QueryOver()
    .List();
}

and viola, you can now tinker with queries as you will, with immediate feedback on the generated SQL and execution time.

You can then save this as a query, or save the assembly references/using statements as a snippet to get you up and running quickly for new queries.

Caveats:

This method only works with pre-compiled entity mappings, so if you intend to improve performance at the entity mapping layer you still need to do this through your application and export the assemblies for LINQPad to use.

Extensions:

LINQPad allows you to create an 'application config' file that is used when your inner assemblies require web/app.config sections. Run the code:

AppDomain.CurrentDomain.SetupInformation.ConfigurationFile

to find the location of the file, and if it does not exist create it. Note that unlike most .NET apps, this is not the LINQPad.exe.config, but LINQPad.config. Enter any configuration you need into this file. This can include the nHibernate config instead of the separate file (but limits configuration flexibility).

This allows you to configure things like nHibernate 2nd level cache instances, such as memcache. As long as you include the necessary libraries in the query references, and the configuration in the linqpad.config file this will work and provide even greater flexibility for performance analysis and testing.

Conclusion:

So there you go, a "poor man's" guide to nHibernate performance analysis, thanks to the ever awesome LINQPad.

Tuesday, November 15, 2011

EF Deleted Items Issue

I noticed an issue in one of my service methods whereby a record I deleted showed up in a subsequent query within a single unit of work.

The example code is

int orderId = order.OrderID;
_orderRepository.delete(order);
Order newOrder = _orderRepository.getAll(x=>x.orderID == orderId);

The above example is a bit contrived, there's a fair bit more that goes on but this code highlights the issue.

Now that I know what is going on this is realtively straight forward, but it is a bit counterintuitive when starting out.

The problem was in my repository, where I was using the DbContext DbSet property for each entity directly, instead of the DbSet.Local property. The difference between the two is that the root DbSet property contains all elements in their modified state (e.g. it contains the deleted order, with an updated state of Deleted), while the Local DbSet property (which is an IObservable of the root DbSet) has the entities in their 'current state' so if you delete an entity from the context it is removed from the Local DbSet.

I say this is counterintuitive because the only way to identify whether an item is deleted or not is through the root context Entry() method, you cannot base a query on the DbSet to exclude deleted items.

The solution is however fairly simple. Since I am using a Unit of Work pattern on the context, and my service methods are a single unit of work, I can use the Local DbSet for my repository actions without any issues down the line with disconnected or orphan entities, and I can do this without any modifications to my service.

So where all my repository queries used to use the code below as the base for all repository queries
IQueryable query = _set.AsQueryable(); //_set is the appropriate DbSet for the entity T in the context
I now simply base all my queries off
IQueryable query = _set.Local.AsQueryable();

Now deleted items should not show up in my list of queries. I hope - I haven't had a chance to actually test it just yet.

*edit*

Well, that was short lived - it seems as though using the Local context only works on previously loaded data, for instance you do a load, then delete an entity, then a load from the local context will not show the deleted item.

This is incredible frustrating as it means I need to know under what scenario I am 'loading' data in order to choose the right context to load from, and means I need to front-load all the entities I will be working with, then use the local context from that point on.

I am seriously thinking of switching to nHibernate over this one.

*edit 2*

I have identified a possible solution, but I am concerned by performance implications

When performing a query on my context, I can use Linq to query the state of each entity in the resulting query and further filter the results.

query.Where(whereClause).ToList().Where(x=> ((DbContext)_context).Entry(x).State != System.Data.EntityState.Deleted ).ToList();

The two performance issues with this are

a) I need to 'ToList()' the query and then apply the state filter (otherwise EF will attempt to apply the filter to the SQL query, which it can't do). This is not ideal, but not critical, and I may be able to force the first Where() to resolve the EF part in another way to avoid the extra list creation.

and b) queries that return a large number of results will be impacted (potentially severely) since each entity will be inspected individually against the context change manager.

So perhaps an nHibernate implementation could wait if this does what I want it to without critical performance implications.