InterviewJava

15 Java Serialization Interview Questions and Answers for Experienced Developers

10 Mins read
List of Java Serialization Interview Questions and Answers for Experienced Developers
Java Serialization is defined as a process by which an object can be serialized to be stored on a file or sent over the network. It is one of the most important concepts but is often overlooked by the developers. However, Java Serialization is an important topic when it comes to Java interviews.

Any Java interview will have at least 2-3 questions on this topic and if a candidate is not familiar with the subject, he/she is bound to feel uncomfortable due to lack of experience. This post compiles a list of some most important Java Serialization questions which are worth having a look at before going for an interview.

Define Serialization? What do you mean by Serialization in Java?

Serialization is a mechanism by which you can save or transfer the state of an object by converting it to a byte stream. This can be done in Java by implementing the Serializable interface. Serializable is defined as a marker interface which needs to be implemented for transferring an object over a network or persistence of its state to a file. Since its a marker interface, it does not contain any methods. Implementation of this interface enables the conversion of an object into a byte stream and thus can be transferred. The object conversion is done by the JVM using its default serialization mechanism.

Why is Serialization required? What is the need to Serialize?

Serialization is required for a variety of reasons. It is required to send across the state of an object over a network by means of a socket. One can also store an object’s state in a file. Additionally, manipulation of the state of an object as streams of bytes is required. The core of Java Serialization is the Serializable interface. When the Serializable interface is implemented by your class it provides an indication to the compiler that the Java Serialization mechanism needs to be used to serialize the object.

What is the Difference between Externalizable and Serializable Interfaces?

This is one of the top serialization questions that is asked in many big companies to test your in-depth understanding of serialization. Serializable is a marker interface, therefore, you are not forced to implement any methods, however, Externalizable contains two methods readExternal() and writeExternal() which must be implemented.

The Serializable interface provides an inbuilt serialization mechanism to you which can be inefficient at times. However Externalizable interface is designed to give you greater control over the serialization mechanism. The two methods provide you immense opportunity to enhance the performance of specific object serialization based on application needs.

The Serializable interface provides a default serialization mechanism, on the other hand, Externalizable interface instead of relying on default Java Serialization provides flexibility to control this mechanism.

One can drastically improve application performance by implementing the Externalizable interface correctly. However there is also a chance that you may not write the best implementation, so if you are not really sure about the best way to serialize, I would suggest you stick to the default implementation using Serializable interface.

When will you use Serializable or Externalizable interface? and why?

Most of the times when you want to do a selective attribute serialization you can use Serializable interface with a transient modifier for variables not to be serialized. However, the use of the Externalizable interface can be really effective in cases when you have to serialize only some dynamically selected attributes of a large object.

Let’s take an example, Some times when you have a big Java object with hundreds of attributes and you want to serialize only a dozen dynamically selected attributes to keep the state of the object you should use Externalizable interface writeExternal method to selectively serialize the chosen attributes.

In case you have small objects and you know that most or all attributes are required to be serialized then you should be fine with using Serializable interface and use of transient variable as appropriate.

What are the ways to speed up Object Serialization? How to improve Serialization performance?

The default Java Serialization mechanism is really useful, however, it can have a really bad performance based on your application and business requirements. The serialization process performance heavily depends on the number and size of attributes you are going to serialize for an object. Below are some tips you can use for speeding up the marshaling and un-marshaling of objects during Java serialization process.

  • Mark the unwanted or non-Serializable attributes as transient. This is a straight forward benefit since your attributes for serialization are clearly marked and can be easily achieved using the Serializable interface itself.
  • Save only the state of the object, not the derived attributes. Some times we keep the derived attributes as part of the object however serializing them can be costly. Therefore consider calculating them during the de-serialization process.
  • Serialize attributes only with NON-default values. For examples, serializing an int variable with value zero is just going to take extra space however, choosing not to serialize it would save you a lot of performance. This approach can avoid some types of attributes taking unwanted space. This will require the use of Externalizable interface since attribute serialization is determined at runtime based on the value of each attribute.
  • Use the Externalizable interface and implement the readExternal and writeExternal methods to dynamically identify the attributes to be serialized. Some times there can be a custom logic used for serialization of various attributes.

What is a Serial Version UID (serialVersionUID) and why should I use it? How to generate one?

The serialVersionUID represents your class version, and you should change it if the current version of your class is not backward compatible with its earlier versions. This is an extract from Java API Documentation

The serialization runtime associates with each serializable class a version number, called a serialVersionUID, which is used during deserialization to verify that the sender and receiver of a serialized object have loaded classes for that object that are compatible with respect to serialization.

Most of the times, we probably do not use serialization directly. In such cases, I would suggest generating a default serializable UID by clicking the quick fix option in eclipse.

What would happen if the SerialVersionUID of an object is not defined?

If you don’t define serialVersionUID in your serializable class, Java compiler will make one by creating a hash code using most of your class attributes and features. When an object gets serialized, this hash code is stamped on the object which is known as the SerialVersionUID of that object. This ID is required for the version control of an object. SerialVersionUID can be specified in the class file also. In case, this ID is not specified by you, then Java compiler will regenerate a SerialVersionUID based on updated class and it will not be possible for the already serialized class to recover when a class field is added or modified. Its recommended that you always declare a serialVersionUID in your Serializable classes.

Does setting the serialVersionUID class field improve Java serialization performance?

Declaring an explicit serialVersionUID field in your classes saves some CPU time only the first time the JVM process serializes a given Class. However the gain is not significant, In case when you have not declared the serialVersionUID its value is computed by JVM once and subsequently kept in a soft cache for future use.

What are the alternatives to Serialization? If Serialization is not used, is it possible to persist or transfer an object using any other approach?

In case, Serialization is not used, Java objects can be serialized in many ways, some of the popular methods are listed below:

  • Saving object state to the database, this is the most common technique used by most applications. You can use ORM tools (e.g. hibernate) to save the objects in a database and read them from the database.
  • XML based data transfer is another popular mechanism, and a lot of XML based web services use this mechanism to transfer data over the network. Also, a lot of tools save XML files to persist data/configurations.
  • JSON Data Transfer – is recently popular data transfer format. A lot of web services are being developed in JSON due to its small footprint and inherent integration with web browser due to JavaScript format.

What are transient variables? What role do they play in the Serialization process?

The transient keyword in Java is used to indicate that a field should not be serialized. Once the process of de-serialization is carried out, the transient variables do not undergo a change and retain their default value. Marking unwanted fields as transient can help you boost the serialization performance. Below is a simple example where you can see the use of the transient keyword.

class MyVideo implements Serializable
{
    private Video video;
    private transient Image thumbnailVideo;

    private void generateThumbnail()
    {
        // Generate thumbnail.
    }

    private void readObject(ObjectInputStream inputStream)
            throws IOException, ClassNotFoundException
    {
        inputStream.defaultReadObject();
        generateThumbnail();
    }    
}

Why does serialization NOT save the value of static class attributes? Why static variables are not serialized?

The Java variables declared as static are not considered part of the state of an object since they are shared by all instances of that class. Saving static variables with each serialized object would have the following problems

  • It will make a redundant copy of the same variable in multiple objects which makes it inefficient.
  • The static variable can be modified by any object and a serialized copy would be stale or not in sync with the current value.

How to Serialize a collection in java? How to serialize an ArrayList, Hashmap or HashSet object in Java?

All standard implementations of collections List, Set and Map interface already implement java.io.Serializable. All the commonly used collection classes like java.util.ArrayList, java.util.Vector, java.util.Hashmap, java.util.Hashtable, java.util.HashSet, java.util.TreeSet does implement Serializable. This means you do not really need to write anything specific to serialize collection objects. However, you should keep the following things in mind before you serialize a collection object – Make sure all the objects added in the collection are Serializable. – Serializing the collection can be costly therefore make sure you serialize only required data instead of serializing the whole collection. – In case you are using a custom implementation of Collection interface then you may need to implement serialization for it.

Is it possible to customize the serialization process? How can we customize the Serialization process?

Yes, the serialization process can be customized. When an object is serialized, objectOutputStream.writeObject (to save this object) is invoked and when an object is read, ObjectInputStream.readObject () is invoked. What most people do not know is that Java Virtual Machine provides you with an option to define these methods as per your needs. Once this is done, these two methods will be invoked by the JVM instead of the application of the default serialization process. Classes that require special handling during the serialization and deserialization process must implement special methods with these exact signatures:

private void writeObject(java.io.ObjectOutputStream out)
     throws IOException
 private void readObject(java.io.ObjectInputStream in)
     throws IOException, ClassNotFoundException;
 private void readObjectNoData() 
     throws ObjectStreamException;

How can a sub-class of Serializable super class avoid serialization? If the Serializable interface is implemented by the superclass of a class, how can the serialization of the class be avoided?

In Java, if the superclass of a class is implementing the Serializable interface, it means that it is already serializable. Since an interface cannot be unimplemented, it is not possible to make a class non-serializable. However, the serialization of a new class can be avoided. For this, writeObject () and readObject() methods should be implemented in your class so that a Not Serializable Exception can be thrown by these methods. And, this can be done by customizing the Java Serialization process. Below the code that demonstrates it

class MySubClass extends SomeSerializableSuperClass {
private void writeObject(java.io.ObjectOutputStream out)
     throws IOException {
 throw new NotSerializableException(“Can not serialize this class”);
}
 private void readObject(java.io.ObjectInputStream in)
     throws IOException, ClassNotFoundException {
 throw new NotSerializableException(“Can not serialize this class”);
}
 private void readObjectNoData() 
     throws ObjectStreamException; {
 throw new NotSerializableException(“Can not serialize this class”);
}
}

What changes are compatible and incompatible to the mechanism of Java Serialization?

This is one of the difficult and tricky questions and answering this correctly would mean you are an expert in Java Serialization concept. In an already serialized object, the most challenging task is to change the structure of a class when a new field is added or removed. As per the specifications of Java Serialization, the addition of any method or field is considered to be a compatible change whereas changing of class hierarchy or non-implementation of Serializable interface is considered to be a non-compatible change. You can go through the Java serialization specification for the extensive list of compatible and non-compatible changes. If a serialized object needs to be compatible with an older version, it is necessary that the newer version follows some rules for compatible and incompatible changes. A compatible change to the implementing class is one that can be applied to a new version of the class, which still keeps the object stream compatible with an older version of the same class. Some Simple Examples of compatible changes are:

  • Addition of a new field or class will not affect serialization since any new data in the stream is simply ignored by older versions. the newly added field will be set to its default values when the object of an older version of the class is unmarshaled.
  • The access modifiers change (like private, public, protected or default) is compatible since they are not reflected in the serialized object stream.
  • Changing a transient field to a non-transient field is compatible change since it is similar to adding a field.
  • Changing a static field to a non-static field is compatible change since it is also similar to adding a field.

Some Simple Examples of incompatible changes are:

  • Changing implementation from Serializable to Externalizable interface cannot be done since this will result in the creation of an incompatible object stream.
  • Deleting an existing Serializable field will cause a problem.
  • Changing a non-transient field to a transient field is incompatible change since it is similar to deleting a field.
  • Changing a non-static field to a static field is incompatible change since it is also similar to deleting a field.
  • Changing the type of an attribute within a class would be incompatible since this would cause a failure when attempting to read and convert the original field into the new field.
  • Changing the package of the class is incompatible. Since the fully-qualified class name is written as part of the object byte stream.

Java Serialization Interview Questions and Answers
Java serialization is one of the most commonly misunderstood areas. Many developers still think it’s only used for saving objects on the file system.

Hope you found this list useful. Can you think of a question that is not part of this list? Please don’t forget to share it with me in the comments section and I will try to include it in the list.

Article Updates

  • Updated on May 2019: Minor changes and updates to the introduction section. Images are updated to HTTPS.

52 Comments

Leave a Reply

Your email address will not be published. Required fields are marked *