Click here to Skip to main content
15,886,873 members
Articles / Programming Languages / C#

A Fast/Compact Serialization Framework

Rate me:
Please Sign up or sign in to vote.
4.85/5 (37 votes)
13 Oct 2010GPL35 min read 282K   1.2K   175   97
A framework for object serializiation/deserialization that is many times faster and yields a compact output.

Introduction

NxSerialization is an easy to use object serialization framework that replaces the functionality of the default serialization providers for .NET and Mono. The binary formatter for NxSerialization can be up to 50 times faster than the default binary formatter for .NET and Mono. This is evident from the screenshot of the benchmark application shown above. There are three main benefits this framework provides to applications that serialize objects. The main benefits being that of increased space and time performance, and enhanced security comes as a byproduct.

Quick Facts

The figure below contains benchmark results using the sample application shipped with NxSerialization for CLI. The important values are given in bold. The time measured was for 100 iterations of 100 runs each. In each run, an object of the specified type was serialized and then deserialized. These results may vary depending upon the system configuration; however, the important thing to consider is the relative difference or the performance factors between the native and the NxSerializer.

Warning: These stats are from the previous release, and do not reflect comparison with latest native formatters.

Size based comparison of .NET and NxSerialization formatters

Size based comparison of .NET and NxSerialization formatters

Time based comparison of .NET and NxSerialization formatters

Time based comparison of .NET and NxSerialization formatters

What is New in 3.0?

There is nothing substantially new in this release, except the inclusion of the Remoting sub-system and a few unfinished features. After an extended long period of inactivity and quite some queries to release the Remoting specific portions, I have finally decided to release all that I had in my dev folders, and it's probably going to be the last release ever.

An interesting observation is that the latest versions of CLR have much improved native formatters, and what used to be on the average >5 times speed gain in the past is now much reduced. The stats above are therefore not representative of comparison with latest .NET versions. It also follows that the toolkit has probably seen its time :)

Unfinished Features

EAR - (Emit Avoid Reflection)

Some of the surrogates have an EAR property that when set uses dynamic IL to facilitate creation of objects and avoids the abhorred Activator.CreateInstance that is not known to be a super-fast way. The support is in early stages, and not rigorously tested, and therefore issues may popup. Moreover, there is no way to configure EAR externally, and source modifications are needed should you want to try it.

Remoting

The ability to use NxSerialization in Remoting sinks should theoretically speedup Remoting code - though the network latency may overshadow it - but surprisingly, the results have always been quite the opposite (which is why I never released it). There are also issues with HTTP channels (some functionality is missing), as well as Channel security that does not work at all.

Surrogates for System.Data.*

Still unimplemented - even though a straightforward task.

I would love to know if anyone still finds it useful and could spot the shortcomings in Remoting slowdown and suggest a fix. As always, your feedback is highly welcome!

Using the Framework

Application objects can be integrated with the framework in two ways. By writing a surrogate for the object type and registering the surrogate with the framework, or by implementing INxSerializable. The framework provides a built-in surrogate for types that implement INxSerializable. For unknown types, native .NET serialization is used.

The following sample of code demonstrates a type that implements INxSerializable. Note the line at the bottom that registers the type with the framework.

C#
// Sample class that implements INxSerializable
[Serializable]
class SampleCompactableClass : INxSerializable
{
   private String title = "SampleCompactableClass";

   void INxSerializable.Serialize(INxBinaryWriter w)
   {
      w.Write(title);
   }

   void INxSerializable.Deserialize(INxBinaryReader r)
   {
      title = r.ReadString();
   }
}

...
// Register the class with the framework.
NxFormatterServices.Default.RegisterKnownType(typeof(SampleCompactableClass));

The following sample of code demonstrates a sample surrogate for another type that does not implement INxSerializable. Using surrogates is the only way the framework is able to compactly serialize .NET native types.

C#
// Sample surrogate for SampleSurrogatedClass
class SampleSurrogate : NxSerializationSurrogate
{
   public SampleSurrogate() : base(typeof(SampleSurrogatedClass)) {}

   public override object Read(INxBinaryReader r)
   {
      SampleSurrogatedClass obj = new SampleSurrogatedClass();
      obj.title = r.ReadString();
      return obj;
   }

   public override void Write(INxBinaryWriter w, object graph)
   {
      SampleSurrogatedClass obj = (SampleSurrogatedClass) graph;
      w.Write(obj.title);
   }
}

// Sample class that does not implement INxSerializable
[Serializable]
class SampleSurrogatedClass
{
   internal string title = "SampleSurrogatedClass";
}

...
// Register the surrogate with the framework.
NxTypeSurrogateSelectorNative.Default.Register(new SampleSurrogate());

Everything else is pretty much self-explanatory. For more information, look at the sample benchmark application provided with the source code.

Comments

Please note that for objects where the actual data size to type-info size ratio is very large, not much memory reduction will occur. Try a byte array of size 100K. It is also possible to come up with a case where the native serializer is actually more efficient in terms of CPU.

Among other possibilities with the framework are:

  • Enhanced security as custom serialization protects your object's data from prying eyes. Excluding the possibilities of complete reverse engineering, objects cannot be deserialized from persistent streams.
  • .NET CLR 1.x objects can be deserialized into 2.0 objects. Objects of type A can be deserialized to objects of type B etc.

History

OpenNxSerialization 2.0 (August 08, 2008)

Changes in this version include:

  • Arrays and collections serialization is now significantly faster.
  • New surrogates for a lot of built-in types.
  • Support for serialization of containers in the System.Collections.Generic namespace.
  • Support for serialization of BitVector32, BitArray and KeyValuePair objects.
  • Support for serialization of Type objects.
  • Surrogate redirection support now provided.
  • Dynamic (on the fly) surrogates now supported.
  • Major refactoring of the API.
  • Quite a few enhancements and utilities everywhere.

OpenNxSerialization 1.5 (March 12, 2008)

Changes in this version include:

  • NxFormatter now implements IRemotingFormatter.
  • New surrogates for a lot of built-in types.
  • Support for serialization of ISerializable objects.
  • Support for serialization of MarshalByRef objects.
  • Support for generic versions of SerializeAs and DeserializeAs functions.
  • Streaming context can now contain application specific items.
  • Quite a few enhancements and utilities everywhere.

OpenNxSerialization 1.0 (CompactSerialization 2.5) (July 21, 2007)

Once again, thanks to all contributors. Changes in this version include:

  • CompactSerialization 2.5 is now OpenNxSerialization 1.0.
  • Support for multiple instances of TypeSurrogateSelector.
  • Support for SerializeAs and DeserializeAs functions (faster and more compact).
  • Reader does not close the base stream.
  • Support to configure types using a config file.
  • Quite a few enhancements and utilities everywhere.

CompactSerialization 2.0 (May 17, 2006)

This has been possible due to the wonderful feedback I've received. Thanks to all contributors. Changes in this version include:

  • Support for .NET 2.0 Nullable types.
  • Circular and shared references are now handled wisely.
  • Support for permanent/hard type handles.
  • Support for enumerations, SortedList etc.
  • Major refactoring of the internal and public APIs.
  • Improved performance at places, and decreased at places :).

CompactSerialization 1.0 (Feb 15, 2006)

  • Released the initial version of the framework.

License

This article, along with any associated source code and files, is licensed under The GNU General Public License (GPLv3)


Written By
Architect
Pakistan Pakistan
Let a = b ....... (1)
a - b = a - b
a^2 - ab = a^2 - ab
a^2 - ab = a^2 - b^2 (from 1)
a (a - b) = (a + b) (a - b)
a = (a + b) ...... (2)

if a = 1
1 = (1 + 1) (from 1 & 2)
1 = 2 !!

Comments and Discussions

 
AnswerRe: XmlSerialization Pin
.Shoaib3-Jul-08 19:17
.Shoaib3-Jul-08 19:17 
GeneralRe: XmlSerialization [modified] Pin
jboarman7-Jul-08 15:10
jboarman7-Jul-08 15:10 
QuestionRe: XmlSerialization Pin
.Shoaib20-Jul-08 18:53
.Shoaib20-Jul-08 18:53 
AnswerRe: XmlSerialization Pin
jboarman21-Jul-08 6:22
jboarman21-Jul-08 6:22 
GeneralOpenNxSerialization within Shared Cache Pin
roni schuetz25-Apr-08 6:13
roni schuetz25-Apr-08 6:13 
AnswerRe: OpenNxSerialization within Shared Cache Pin
.Shoaib28-Apr-08 21:32
.Shoaib28-Apr-08 21:32 
GeneralSurrogate Emitter Pin
Aaron Jackson11-Apr-08 8:01
Aaron Jackson11-Apr-08 8:01 
AnswerRe: Surrogate Emitter Pin
.Shoaib13-Apr-08 19:48
.Shoaib13-Apr-08 19:48 
That's some great feedback. I really appreciate and value your comments.

As for the dynamic surrogate generation, it is something that i'm already working on. Actually, it's been there since quite a while but I didn't have the time to finish if off, but i'm still open to suggestions. My approach so far has been to have sort of a DynamicSurrogateBuilder that can generate surrogate for a given type. The generated surrogate can then be registered with the framework in the usual way. This can come in handy because you no longer have to implement surrogates or INxSerializable. However, I have always wanted to be able to attach surrogate methods with registered types so as to be able to avoid surrogate lookups and call the serialization code directly. Admittedly, I haven't explored enough to validate the possibility and therefore the performance gains of such a design. But does your comment reinforces the idea that there is a way to use the DynamicMethod to avoid the surrogate lookup? Can you please share your thoughts on that?

As for the remoting part, the "tedious" part is already done [Even in the released version you can find traces of IRemotingFormatter etc.], but somehow the results are not encouraging. Contrary to the belief that remoting performance can be increased, it actually degrades. Again, I face a shortage of time to investigate this apparent anomaly Frown | :(

>> Assuming you just wanted to serialize the payload though how might you approach that problem. If the BinaryFormatter was in the chain wouldn't it serialize the content before you even got a chance at it?

Exactly the issue, but is solved by writing a FormatterSink that takes care of serializing the payload as well as the meta data (remoting objects). You would need to serialize remoting objects so as to be able to get your hands at the payload.
GeneralRe: Surrogate Emitter Pin
Derek Viljoen18-Jul-08 5:08
Derek Viljoen18-Jul-08 5:08 
AnswerRe: Surrogate Emitter Pin
.Shoaib20-Jul-08 18:50
.Shoaib20-Jul-08 18:50 
GeneralRe: Surrogate Emitter Pin
Aaron Jackson23-Jul-08 8:48
Aaron Jackson23-Jul-08 8:48 
GeneralRe: Surrogate Emitter [modified] Pin
Alex_120-Apr-08 20:02
Alex_120-Apr-08 20:02 
GeneralFix the license please Pin
jpmik2-Apr-08 11:29
jpmik2-Apr-08 11:29 
NewsRe: Fix the license please Pin
.Shoaib3-Apr-08 21:34
.Shoaib3-Apr-08 21:34 
GeneralThe download link is wrong Pin
kjetilroe25-Jul-07 23:15
kjetilroe25-Jul-07 23:15 
GeneralIt would be nice if TypeSurrogateSelector wasn't static Pin
patperry11-Jul-07 12:31
patperry11-Jul-07 12:31 
AnswerRe: It would be nice if TypeSurrogateSelector wasn't static Pin
.Shoaib11-Jul-07 19:27
.Shoaib11-Jul-07 19:27 
GeneralCommercial License Pin
Kay Herzam10-Jul-07 1:44
Kay Herzam10-Jul-07 1:44 
AnswerRe: Commercial License Pin
.Shoaib10-Jul-07 2:51
.Shoaib10-Jul-07 2:51 
QuestionCan you verify compatibility with Mono 1.2.4? Pin
Felipe Decroaux28-May-07 3:59
Felipe Decroaux28-May-07 3:59 
AnswerRe: Can you verify compatibility with Mono 1.2.4? Pin
.Shoaib4-Jun-07 0:17
.Shoaib4-Jun-07 0:17 
AnswerRe: Can you verify compatibility with Mono 1.2.4? Pin
patperry11-Jul-07 10:14
patperry11-Jul-07 10:14 
GeneralRe: Can you verify compatibility with Mono 1.2.4? Pin
patperry11-Jul-07 10:17
patperry11-Jul-07 10:17 
Joke1=2, Not so fast :) Pin
Alexandru Lungu30-Apr-07 5:03
professionalAlexandru Lungu30-Apr-07 5:03 
GeneralRe: 1=2, Not so fast :) Pin
bledazemi28-May-07 21:45
bledazemi28-May-07 21:45 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.