In certain computer
programming language
A programming language is a system of notation for writing computer programs.
Programming languages are described in terms of their Syntax (programming languages), syntax (form) and semantics (computer science), semantics (meaning), usually def ...
s,
data type
In computer science and computer programming, a data type (or simply type) is a collection or grouping of data values, usually specified by a set of possible values, a set of allowed operations on these values, and/or a representation of these ...
s are classified as either value types or reference types, where reference types are always implicitly accessed via
references
A reference is a relationship between Object (philosophy), objects in which one object designates, or acts as a means by which to connect to or link to, another object. The first object in this relation is said to ''refer to'' the second object. ...
, whereas value type variables directly contain the
values
In ethics and social sciences, value denotes the degree of importance of some thing or action, with the aim of determining which actions are best to do or what way is best to live ( normative ethics), or to describe the significance of different a ...
themselves.
Properties of value types and reference types
Even among languages that have this distinction, the exact properties of value and reference types vary from language to language, but typical properties include:
*
Primitive data types, such as Booleans, fixed-size integers, floating-point values, and characters, are value types.
* Objects, in the sense of
object-oriented programming
Object-oriented programming (OOP) is a programming paradigm based on the concept of '' objects''. Objects can contain data (called fields, attributes or properties) and have actions they can perform (called procedures or methods and impl ...
, belong to reference types.
* Assigning to a variable of reference type simply copies the reference, whereas assigning to a variable of value type copies the value. This applies to all kinds of variables, including local variables, fields of objects, and array elements. Likewise when calling a function: parameters of reference type are copies of the reference, whereas parameters of value type are copies of the value.
* If a reference type is ''mutable'', then mutations made via one reference are visible via any other, whereas if a value type is ''immutable'', then mutations made to one value are not visible in another.
* Reference types support the notion of ''
identity'' — it makes sense to discuss whether two values of reference type refer to the same object, and the language provides functionality to determine whether they do — whereas value types do not.
* Values of reference type refer to objects
allocated in the heap, whereas values of value type are contained either on the call stack (in the case of local variables and function parameters) or inside their containing entities (in the case of fields of objects and array elements). (With reference types, it is only the reference itself that is contained either on the call stack or inside a containing entity.)
* Reference types support the notion of ''
subtyping
In programming language theory, subtyping (also called subtype polymorphism or inclusion polymorphism) is a form of type polymorphism. A ''subtype'' is a datatype that is related to another datatype (the ''supertype'') by some notion of substi ...
'', whereby all values of a given reference type are automatically values of a different reference type. Value types do not support subtyping, but may support other forms of implicit
type conversion
In computer science, type conversion, type casting, type coercion, and type juggling are different ways of changing an expression from one data type to another. An example would be the conversion of an integer value into a floating point val ...
, e.g. automatically converting an integer to a floating-point number if needed. Additionally, there may be implicit conversions between certain value and reference types, e.g. "boxing" a primitive (a value type) into an object (an object type), or reversing this via "unboxing".
Reference types and "call by sharing"
Even when function arguments are passed using "call by value" semantics (which is always the case in Java, and is the case by default in C#), a value of a reference type is intrinsically a reference; so if a parameter belongs to a reference type, the resulting behavior bears some resemblance to "call by reference" semantics. This behavior is sometimes called ''
call by sharing''.
Call by sharing resembles call by reference in the case where a function mutates an object that it received as an argument: when that happens, the mutation will be visible to the caller as well, because the caller and the function have references to the same object. It differs from call by reference in the case where a function assigns its parameter to a different reference; when that happens, this assignment will ''not'' be visible to the caller, because the caller and the function have ''separate'' references, even though both references initially point to the same object.
Reference types vs. explicit pointers
Many languages have ''explicit'' pointers or references. Reference types differ from these in that the entities they refer to are ''always'' accessed via references; for example, whereas in C++ it's possible to have either a and a , where the former is a mutable string and the latter is an explicit pointer to a mutable string (unless it's a null pointer), in Java it is only possible to have a , which is implicitly a reference to a mutable string (unless it's a null reference).
While C++'s approach is more flexible, use of non-references can lead to problems such as
object slicing, at least when
inheritance
Inheritance is the practice of receiving private property, titles, debts, entitlements, privileges, rights, and obligations upon the death of an individual. The rules of inheritance differ among societies and have changed over time. Offi ...
is used; in languages where objects belong to reference types, these problems are automatically avoided, at the cost of removing some options from the programmer.
Reference rebinding and aliasing
In most programming languages, it is possible to change the variable of a reference type to refer to another object, i.e. to rebind the variable to another object.
For example, in the following Java code:
class Foo
Foo a = new Foo();
Foo b = a;
a.prop = 3;
a = new Foo();
a.prop = 1;
Foo
is a reference type, where
a
is initially assigned a reference of a new object, and
b
is assigned to the same object reference, i.e. bound to the same object as
a
, therefore, changes through
a
is also visible to
b
as well. Afterwards,
a
is assigned a reference (rebound) to another new object, and now
a
and
b
refer to different objects. At the end,
a
refers to the second object with its
prop
field having the value
1
, while
b
refers to the first object with its
prop
field having the value
3
.
However, such as C++, the term "reference type" is used to mean an alias, and it is not possible to rebind a variable of a reference type once it is created, as it is an alias to the original object.
struct Foo ;
Foo a;
a.prop = 1;
Foo &b = a;
Foo c = a;
a.prop = 3;
In C++, all non-reference class types have value semantics. In the above example,
b
is declared to be a reference (alias) of
a
, and for all purposes,
a
and
b
are the same thing. It is impossible to rebind
b
to become something else. After the above example is run,
a
and
b
are the same
Foo
object with
prop
being
3
, while
c
is a copy of the original
a
with
prop
being
1
.
In C#, apart from the distinction between value types and reference types, there is also a separate concept called reference variables.
A reference variable, once declared and bound, behaves as an alias of the original variable, but it can also be rebounded to another variable by using the reference assignment operator
= ref
. The variable itself can be of any type, including value types and reference types, i.e. by passing a variable of a reference type by reference (alias) to a function, the object where the reference-type variable points to can also be changed, in addition to the object itself (if it is mutable).
Immutable data types
If an object is immutable and object equality is tested on content rather than identity, the distinction between value type and reference types is no longer clear, because the object itself cannot be modified, but only replaced as a whole (for value type) / with the reference pointed to another object (for reference type). Passing such immutable objects between variables have no observable differences if the object is copied or passed by reference, unless the object identity is taken. In a functional programming language where nothing is mutable (such as Haskell), such distinction does not exist at all and becomes an implementation detail.
Classification per language
See also
*
Primitive data type
In computer science, primitive data types are a set of basic data types from which all other data types are constructed. Specifically it often refers to the limited set of data representations in use by a particular processor, which all compiled ...
*
Composite data type
In computer science, a composite data type or compound data type is a data type that consists of programming language scalar data types and other composite types that may be heterogeneous and hierarchical in nature. It is sometimes called a struct ...
References
{{DEFAULTSORT:Value Type
Data types