Daniel Cazzulino's Blog : Live Mesh FeedSync: an overview of the protocol under the hood

Live Mesh FeedSync: an overview of the protocol under the hood

I already mentioned why I think Live Mesh is cool and that I think the most important part of it, FeedSync, is being largely ignored by reviewers. Fortunately, there’s an extensive interview with the team that goes quite deep in FeedSync and how it works. Go watch it, it’s good info.

At the most basic level, FeedSync is a mechanism to associate versioning “headers” to arbitrary objects (items), and an algorithm to merge and detect conflicts based on that header information. Replace “header” with “extension element” and “arbitrary object” with “RSS/Atom Item” and you have the XML feed version of it:

				<
				item 
				xmlns:sx
				='http://feedsync.org/2007/feedsync'>  
 <title>Buy groceries</title>  
 <sx:sync id='0a7903db47fb0fff' updates='1'>  
  <sx:history sequence='1' by='kzu'/>  
 </sx:sync>  
</item>

The sx:sync element is the versioning header. Every time the item is updated (say, by another user/device), a new sx:history element is added and the updates attribute is incremented:

				<
				item
				>  
 <
				title
				>Buy groceries</title>  
 <sx:sync id='0a7903db47fb0fff' updates='2'>  
   <sx:history sequence='2' by='vga'/>  
   <sx:history sequence='1' by='kzu'/>  
 </sx:sync>  
</item>

If v1 of the item was updated simultaneously by two users and you try to merge them, you’d end up with something like the following:

				<
				item
				>  
 <
				title
				>Buy groceries</title>  
 <sx:sync id='0a7903db47fb0fff' updates='2'>  
   <sx:history sequence='2' by='kzu'/>  
   <sx:history sequence='1' by='kzu'/>  
   <sx:conflicts>  
     <item>  
       <title>Buy icecream</title>  
       <customer id='1' />  
       <sx:sync id='0a7903db47fb0fff' updates='2'>  
         <sx:history sequence='2' by='vga'/>  
         <sx:history sequence='1' by='kzu'/>  
       </sx:sync>  
     </item>  
   </sx:conflicts>  
 </sx:sync>  
</item>

Notice the added sx:conflicts element, which contains the conflicting item in its entirety. The algorithm provides a consistent selection of a default “winner” from the merge (in this case my v2 item, rather than Victor’s). This gives you a consistent state across the mesh. But note that the merge operation with conflict does NOT resolve the conflict. It just surfaces it in a predictable way.

Needless to say, the default winner might not be what the end user wants, so at that point the application can surface the conflict (i.e. a different icon for the item) and allow him to resolve it in a different way (i.e. doing a full content diff and picking the new state). Alternatively, a specific implementation of an item store/adapter (i.e. the feedsync-enabled store for files) can provide automatic resolution if it so wishes to, such as doing an auto-merge of the file contents.

Pablo has more info on how it works and the way the Microsoft Sync Framework exposes this.

It seems really simple at first glance, doesn’t it? The devil is in the details, as usual. It’s obviously a RESTful approach to synchronization, where the item payload is the actual representation of the state. This means that upon conflict, all you have is the state. Implementing automatic conflict resolution can be quite tricky as a consequence.

Automatic Conflict Resolution?

As part of our Mesh4x implementation with InSTEDD, we’re thinking about ways to increase the chances of automatic conflict resolution, as it’s generally desirable and you don’t want to be bothering your users for every apparently conflicting change. Think about a very simple example: tags. Say both user A and B add a tag at the same time, and then they synchronize:

Initial state of both A and B (say, created by yet another user, C):

				<
				item
				>  
  <
				title
				>Buy groceries</title>  
  <sx:sync id='0a7903db47fb0fff' updates='1'>  
    <sx:history sequence='1' by='C'/>  
  </sx:sync>  
</item>

User A changes item by adding a category “to-do” (again, notice the new sx:history):

				<
				item
				>  
  <
				title
				>Buy groceries</title>  
  <category>to-do</category>  
  <sx:sync id='0a7903db47fb0fff' updates='2'>  
    <sx:history sequence='2' by='A'/>  
    <sx:history sequence='1' by='C'/>  
  </sx:sync>  
</item>

Whereas user B changes it by adding a different category “supermarket”:

				<
				item
				>  
  <
				title
				>Buy groceries</title>  
  <category>supermarket</category>  
  <sx:sync id='0a7903db47fb0fff' updates='2'>  
    <sx:history sequence='2' by='B'/>  
    <sx:history sequence='1' by='C'/>  
  </sx:sync>  
</item>

When they sync, the algorithm will detect a conflict, as we have the same sequence number but with different “by”. It will pick a default winner (say, A’s change). But clearly this is a case where we could apply auto-merge semantics if we knew the operation each user performed.

You might be tempted to say: just do a content merge and that’s it! But the semantics of a merge of state cannot be applied generically (i.e. a Company element gets an IsBankrupt=’true’ value from one user, and a new Invoice element from another: at the business logic level, the two changes cannot be merged. If the company went bankrup, you can no longer receive invoices from it), and it’s even quite tricky with XML elements that can be moved around, renamed, deleted and re-inserted across multiple versions, etc.

So one approach we’re thinking is to add command-pattern like hints that an adapter/store can use as hints for automatic conflict resolution and state reconstruction for merging. In the case above, maybe the XML adapter expresses user A operation as a combination of the item payload, the sync versioning header, and diff information about what was changed (added an element inmediately after the first child element of the root, in terms of XPath):

				<
				item
				>  
  <
				title
				>Buy groceries</title>  
  <category>to-do</category>  
  <sx:sync id='0a7903db47fb0fff' updates='2'>  
    <sx:history sequence='2' by='A'/>  
    <sx:history sequence='1' by='C'/>  
  </sx:sync>  
  <diff xmlns=".../xmldiff">  
    <add match="/*[1]">  
      <category>to-do</category>  
    </add>  
  </diff>  
</item>  

The new diff element would be very similar in spirit to the XML Diff and Patch tool from Microsoft and would make it easier for the XML adapter to determine if a given change is compatible for auto-merge (i.e. it’s not a change in a node that was removed, etc.).

I think different adapters might benefit from surfacing different conflict resolution hints. A database may include schema manipulation statements to perform upgrades, for example.

This is exploratory field as we try to make for the best user experience possible. We don’t want to end up in the Groove approach where your choices are: save your changes as a different item, or discard your changes :S.

What do you think?

/kzu

/kzu dev↻d