VSadov’s Blog

C# Local Functions vs. Lambda Expressions.

2017-04-08T00:00:00+00:00

C# Local Functions are often viewed as a further enhancement of lambda expressions. While the features are related, there are also major differences.

Local Functions is the C# implementation of Nested function feature. It is a bit unusual for a language to get support for nested functions several versions after supporting lambdas. Usually it is the other way around.

Lambdas, or first-class functions in general, require implementation of local variables that are not allocated on the stack and have life times tied to the functional objects that need them. It is nearly impossible to implement them correctly and efficiently without relying on Garbage Collection or dropping the burden of variable ownership on the user via solutions such as capture lists. That was a serious blocking issue for some early languages.
A simple implementation of nested functions does not run into such complications, so it is more common for a language to support only nested functions and not lambdas.

Anyways, since C# had lambdas for a long time, it does make sense to look at the Local Functions in terms of differences and similarities.

Lambda expressions.

Lambda expressions like x => x + x are expressions that abstractly represent a piece of code and how it binds to parameters and variables in its lexical environment. Being an abstract representation of code, a lambda expression cannot be used on its own. In order to use values produced by a lambda expression, it needs to be converted to something more material such as a delegate or an expression tree.

using System;
using System.Linq.Expressions;

class Program
{
    static void Main(string[] args)
    {
        // can't do much with the lambda expression directly
        // (x => x + x).ToString();  // error

        // can assign to a variable of delegate type and invoke
        Func<int, int> f = (x => x + x);
        System.Console.WriteLine(f(21)); // prints "42"

        // can assign to a variable of expression type and introspect
        Expression<Func<int, int>> e = (x => x + x);
        System.Console.WriteLine(e);     // prints "x => (x + x)"
    }
}

There are several things that are worth noting:

lambdas are expressions that produce functional values.
lambda values have unbounded life times - from the execution of the lambda expression and as long as any reference to the value exists. That implies that any local variables used, or “captured”, by the lambda from the enclosing method must be allocated on the heap. Since the life time of the lambda value is not limited by the life time of the stack frame where it was produced, the variables cannot be allocated on that stack frame.
lambda expression requires that all external variables used in the body are definitely assigned at the time the lambda expression is executed. The moment of the first and the last use of a lambda are rarely deterministic, so the language assumes that lambda values can be used right after creation and as long as they are reachable.
As a result a lambda value must be fully functional at the point of its creation and all outer variables that it uses must be definitely assigned.

        int x;

        // ERROR: 'x' is not definitely assigned
        Func<int> f = () => x;

lambdas do not have names and cannot be referred to symbolically. In particular lambda expressions cannot be declared recursively.

NOTE: It is possible to make a recursive lambda by invoking a variable to which the lambda is assigned or by passing to a higher-order method which self-applies its parameter (see: Anonymous Recursion in C#), but that does not make such expressions truly self-referential.

Local functions.

Local function is basically just a method declared inside another method as a way of reducing visibility of the method to the scope within which it is declared.

Naturally, the code in a local function has access to everything that is accessible in its containing scope - local variables, enclosing methods’s parameters, type parameters, local functions. A notable exception is the visibility of outer method’s labels. Labels of the enclosing method are not visible in a local function. That is just normal lexical scoping and it works the same as in lambdas.

public class C
{
    object o;

    public void M1(int p)
    {
        int l = 123;

        // lambda has access to o, p, l,
        Action a = ()=> o = (p + l);
    }

    public void M2(int p)
    {
        int l = 123;

        // Local Function has access to o, p, l,
        void a()
        {
          o = (p + l);
        }
    }
}

The obvious difference from lambdas is that local functions have names and can be used without any indirection. Local functions can be recursive.

static int Fac(int arg)
{
    int FacRecursive(int a)
    {
        return a <= 1 ?
                    1 :
                    a * FacRecursive(a - 1);
    }

    return FacRecursive(arg);
}

The main semantical difference from lambda expressions is that local functions are not expressions, they are declaration statements. Declarations are very passive entities when it comes to code execution. In fact declarations do not really get “executed”. Similarly to other declarations like labels, local function declarations simply introduce the functions into containing scope without running any code.

What is more important is that neither declarations by themselves nor regular invocations of a nested function result in an indefinite capture of the environment. In simple and common cases, like an ordinary invoke/return scenario, the captured locals do not need to be heap-allocated.

Example:

public class C
{    
    public void M()
    {
        int num = 123;

        // has access to num
        void  Nested()
        {
           num++;
        }

        Nested();

        System.Console.WriteLine(num);
    }
}

The code above is emitted as roughly equivalent of (decompiled):

public class C
{
  // A struct to hold "num" variable.
  // We are not storing it on the heap,
  // so it does not need to be a class
  private struct <>c__DisplayClass0_0
  {
      public int num;
  }

  public void M()
  {
      // reserve storage for "num" in a display struct on the _stack_
      C.<>c__DisplayClass0_0 env = default(C.<>c__DisplayClass0_0);

      // num = 123
      env.num = 123;

      // Nested()
      // note - passes env as an extra parameter
      C.<M>g__a0_0(ref env);

      // System.Console.WriteLine(num)
      Console.WriteLine(env.num);
  }

    // implementation of the the "Nested()".
    // note - takes env as an extra parameter
    // env is passed by reference so it's instance is shared
    // with the caller "M()"
    internal static void <M>g__a0_0(ref C.<>c__DisplayClass0_0 env)
    {
        env.num += 1;
    }
}

Note that the code above calls the implementation of “Nested()” directly (not via a delegate indirection) and does not introduce an allocation of display storage on the heap (as lambda would have). The locals are stored in a struct instead of a class. The life time of the num was not altered by its use in Nested(), so it can still be allocated on the stack. M() could just pass num by reference, but compiler uses a struct for packaging, so that it could pass all locals like num using just one env parameter.

Another interesting point is that Local Functions can be used as long as they are visible in a given scope. This is an important fact that makes recursive and mutually recursive scenarios possible. That also makes the exact location of the local function declaration in the source largely unimportant.

For example all the variables of the enclosing method must be definitely assigned at the invocation of a Local Function that reads them, not at its declaration. Indeed, making that requirement at declaration would not do any good if an invocation can happen earlier.

public void M()
{
    // error here -
    // Use of unassigned local variable 'num'
    Nested();

    int num;

    // whether 'num' is assigned here or not is irrelevant
    void  Nested()
    {
       num++;
    }

    num = 123;

    // no error here - 'num' is assigned
    Nested();

    System.Console.WriteLine(num);
}

Also - if a local function is never used, it is no better than a piece of unreachable code and any variable, that it would otherwise use, does not need to be assigned.

public void M()
{        
    int num;

    // warning - Nested() is never used.
    void  Nested()
    {
       // no errors on unassigned 'num'.
       // this code never runs.
       num++;
    }
}

So, what is the purpose of Local Functions?

The main value proposition of local functions, in comparison to lambdas, is that local functions are simpler, both conceptually and in terms of run time overhead.

Lambdas serve their role as first-class functions very well, but sometimes you only need a simple helper. Lambda assigned to a local variable could do the job, but there is an overhead of indirection, allocation of a delegate and possibly a closure. A private method works too and is cheaper to call, but there is an issue with encapsulation, or lack thereof. Such helper would be visible to everyone in the containing type. Too many helpers like this can result in a serious mess.

A Local Function fits this scenario nicely. The overhead of calling a Local Function is comparable with a call to a private method, but there is no issue with polluting the containing type with a method that nothing else should call.

C# Tuples. Conversions.

2017-02-11T00:00:00+00:00

In a statically typed language like C#, every new kind of type or a new expression needs to define how it fits into the framework of type conversions. Tuples are not an exception.

Truth be told, initially it was believed that it would be better for tuples to have only a very limited support for conversions. That was mostly out of fear that a composite type such as tuple would run into contradicting scenarios with conversion classification. I.E. if (int, object) needs to convert to (object, int), do we have an implicit or explicit conversion or something in between? Is it boxing, or unboxing, or both?
Forcing the user to deconstruct/reconstruct into a tuple of a desired type would avoid the issues, but it was quickly found to be inconvenient.

“Distributing” behavior of tuple conversions.

The overall guiding principle for tuple conversions is that tuple conversions are composite conversions consisting of N underlying conversions, one per element, and classification questions are “distributed” to the underlying conversions, which themselves could be tuple conversions, and in such case the classification is recursive.

This uncomplicated principle allows tuple conversions to be a relatively low-friction feature, but the design has some interesting details.

Tuple Literal Conversions and Target Typing.

C# distinguishes conversions from expression and conversions from type.

Conversions from expression are used when expression results are coerced to be of a particular type. -

// conversion from expression is used to turn int into long
long x = int.MaxValue;  

Conversions from type are used in analysis that operates with types - like when determining the best overload resolution candidate.

void M1(int val) => Console.WriteLine("int");
void M1(long val) => Console.WriteLine("long");

// overload resolution analyses parameter types of the applicable candidates
// M1(int) is selected.
// An implicit conversion from `int` to `long` makes it "better"
M1(short.MaxValue);

Existence of a conversion from a type generally entails that similar conversion from an expression of such type to the same target type also exists. The opposite is not always true though.
There are several reasons for the distinction:

some expressions do not have a natural type at all.
Example: (x)=>x does not have any type on its own, but converts to Func<int, int>
some expressions have a natural type, but their value fits other types.
Example: 42 has type int, but implicitly convertible to byte, even though int does not.
some types have special behaviors.
Example: expressions of type dynamic implicitly convert to any type, but the type dynamic does not have such conversions.

Indeed, since most types are implicitly convertible to dynamic, having equal conversion the other way would make an overload that takes dynamic always ambiguous. An implicit conversion from dynamic expresson , though, just means that conversions of dynamic values are statically acceptable with the actual dynamic conversions happening at the run time.

Tuple conversions transparently have the same distinction. There are tuple conversions that exist only from tuple literals, but not from tuple types. That happens when there are conversions from the argument expressions of the literal to the target element types, but not from the types of those arguments.

Examples of tuple literal conversions:

// the RHS tuple literal does not have a natural type at all
// because some of the argument expressions do not have a type.
// Yet, it is implicitly convertible to the LHS type
// because every argument _expression_ is implicitly convertible
(Func<int, int>, string, object) t1 = ((x)=>x, null, 1);

// RHS has natural type (int, (int, int)),
// but is implicitly convertible to (byte, (short, object))
// because element-wise implicit conversions from argument expressions exist.
(byte, (short, object)) t2 = (1, (2, 3));

Target-typing and evaluation order

Conversion of a literal to the target type is often called ‘target-typing’. That is because the RHS is never materialized in its natural type, instead an instance of the target type is directly created from the RHS value. Indeed, the RHS may not even have a natural type so an instance of such type would not be possible to create.

All the same rules apply to tuple literal conversions, just in a “distributed” manner.

Example:

(Func<int, int>, string) x = ((x)=>x, null); // is the same as
(Func<int, int>, string) x = ((Func<int, int>)((x)=>x), (string)null);

(byte, short) y = (1, 2);  // is the same as
(byte, short) y = ((byte)1, (short)2);

The evaluation order of target typing in tuple literals is observable when both arguments and conversions have sideeffects:

using System;

class C
{
    static void Main()
    {
        // literal tuple conversion is "distributed" to the arguments of the tuple.
        // I.E. every argument is individually target-typed.
        // An instance of (int, int, int) is never created.
        //
        // This is very similar to a constructor call:
        // (C1 a, C1 b, C1 c) t = new ValueTuple<C1,C1, C1>(NextInt(), NextInt(), NextInt());
        //
        (C1 a, C1 b, C1 c) t = (NextInt(), NextInt(), NextInt());

        Console.WriteLine("result: " + t);
    }

    private static int i;
    static int NextInt()
    {
        Console.WriteLine("produced: " + i);
        return i++;
    }
}

class C1
{
    private int val;

    public C1(int val) => this.val = val;

    public static implicit operator C1(int arg)
    {
        Console.WriteLine("converted: " + arg);
        return new C1(arg);
    }

    public override string ToString()
    {
        return val.ToString();
    }
}

=== prints:

produced: 0
converted: 0
produced: 1
converted: 1
produced: 2
converted: 2
result: (0, 1, 2)

Implicit and Explicit conversions

Tuple conversions can be implicit or explicit.
Naturally, a tuple type/expression:

has an implicit tuple conversion to the target type if all elements have implicit conversions.
has an explicit tuple conversion to the target type if all elements have explicit conversions.

Example:

static void Main(string[] args)
{
    (int, int) ti = (1, 1);

    // type `int` has implicit conversion to `dynamic`, so this works
    (dynamic, dynamic) td = ti;

    // `dynamic` type has _explicit_ conversion to `int`, so this works
    (int, int) ti1 = ((int, int))td;

    // Method((long, long)) is preferred
    // since `(long, long)` is implicitly convertible to `(dynamic, dynamic)`,
    // but `(dynamic, dynamic)` has no implicit conversion to `(long, long)`
    Method(ti);
}

static void Method((long, long) ll){}

static void Method((dynamic, dynamic) dd){}

The principle of existence of a conversion when underlying conversion exists is very similar to the lifted conversions in a case of nullable types. As long as T converts to U, same conversion exists between T? and U?. The main difference for tuples is that they have more than one underlying conversion and classification of the overall tuple conversion is performed conservatively based on all the underlying conversions.

It is actually possible for a conversion to be both lifted into nullable and into tuple conversions.

Example of a conversion lifted into a nullable, a tuple and then a nullable conversion again:

(int?, int?)? nubTupleOfNubs = (1, 1);

// `int` has implicit conversion to `long`, thus
// `(int?, int?)? has implicit conversion to `(long?, long?)?`
(long?, long?)? td = nubTupleOfNubs;

Tuple Conversions are Standard Conversions, unconditionally.

User-defined conversions is, perhaps, the most complicated aspect of C# conversions.

To define composition with user-defined operators, C# language has a concept of Standard Conversions. Standard Conversions are specially privileged conversions - they can “stack” with user-defined conversion operators to form user-defined conversions. The reason for the existence of such set of conversions is to widen the applicability of user-defined conversions to more cases than covered by the operator. The reason for the set to be small, and in particular to not include user-defined conversions, is to limit the number of combinations that can result in a conversion.

For example if there is a user-defined conversion operator from type C1 to byte, then an instance of type C1 is also convertible to short. Since there is a standard conversion from byte to short, compiler can stitch one user-defined operator and one standard conversion into a user defined conversion from C1 to short:

C1 --- [implicit user defined operator] ---> byte --- [implicit numeric conversion] ---> short

Note that the chain of conversions is never longer than 2 - one user-defined operator and, optionally, one standard conversion on either end. With such constraints the algorithm for finding conversion chains stays fairly simple.

Consider that we are looking for conversion from T1 to T2. Since any user-defined operator involved would need to be defined in either T1 or T2, these are the only types we would look into. We would collect all the user-defined operators defined in these types that convert from T1 or to T2. Now, for those operators that go “half way” - from T1 to S1 or from S2 to T2, we would look for a standard conversion that would “complete” the conversion - from S1 to T2 or from T1 to S2. If one such found, then we can build a conversion from T1 to T2, if more than one found, then we have an ambiguity.

The point is that the search space has a strict upper bound. If, for example the conversion, that stacks with user defined operator, could be another user-defined conversion, we would need to look at potentially endless chains of conversions involving unlimited number of intermediate types.

The question is whether tuple conversions belong to “The Exclusive Club of Standard Conversions” or not. It was decided that tuple conversions are, in fact, standard conversions.

The convenience is obvious - if, for example, there is a user-defined implicit conversion operator from C1 to (int, int), then we can implicitly convert C1 to (long, long) as well.

C1 --- [implicit user defined op.] ---> (int, int) --- [implicit tuple conv.] ---> (long, long)

A curious part is that tuple conversions are standard conversions regardless whether their underlying conversions are standard or not. The underlying conversions could even themselves be user defined conversions.
This is a case where the conversion classification is not “distributed” to the underlying conversions. Turns out that for the purpose of limiting the search space, such requirement is unnecessary. - at the top we still have a chain of conversions no longer than 2, and underlying element conversions, even if user-defined, cannot nest indefinitely, because tuples cannot nest indefinitely.

It does, however, allow for some interesting scenarios.

Example:
(implicit expanding into nested tuples of any level)

using System;
class C
{
    static void Main()
    {
        C1 y = new C1();   

        // `C1` converts to `(byte, C1)`, and thus to `(int, C1)` too.
        // `C1` converts to `(byte, byte)`, and thus to `(int, int)` too.
        // as a result `C1` converts to  types like `(int, (int, ...))`
        // regardless of how deeply they are nested
        (int, (int, (int, (int, (int, (int, (int, (int, (int, (int, (int, int))))))))))) x12 = y;
        System.Console.WriteLine(x12);
    }

    class C1
    {
        private byte x;

        static public implicit operator (byte, C1)(C1 arg)
        {
            return ((byte)(arg.x++), arg);
        }

        static public implicit operator (byte c, byte d)(C1 arg)
        {
            return ((byte)(arg.x++), (byte)(arg.x++));
        }
    }
}

prints:
(0, (1, (2, (3, (4, (5, (6, (7, (8, (9, (10, 11)))))))))))

Tuple conversions and extension methods

Another interesting example of “distributed” conversion classification in tuples involves applicability checks for extension method receivers.

Generally an expression is acceptable as a receiver of an extension method call if the extension method targets the type of that expression or any of its base types or implemented interfaces.

From a more formal point of view an expression is applicable as a receiver of an extension method if it is convertible to the type of the instance parameter via:

identity conversion
implicit reference conversion
implicit boxing conversion

Based just on that an extension method defined on object would be applicable to an expression of type (int[], int[]). However an extension defined on (IEnumerable<int>, IEnumerable<int>) would not be applicable. Early users of the feature indicated that such limitation is unexpected and inconvenient (see bug 16159).

The solution was to add implicit tuple conversions to the set of allowed instance conversions, but require that all underlying element conversions are valid instance conversions. I.E. the instance conversion rule became distributed and recursive in a case of tuples.

Examples:

using System;
using System.Collections.Generic;

class C
{
    static void Main()
    {
        // ok,  
        // `string` has implicit reference conversion to `IEnumerable<char>`
        ("hello", "hi").M1();

        // ok
        // `int` has implicit boxing conversion to `object`
        (1, (2, 3)).M2();

        // ok
        // the first element is convertible as a whole
        // the second element is convertible recursively
        (("hi", "hello"), (2, 3)).M2();
    }
}

static class C1
{
    public static void M1(this (IEnumerable<char>, IEnumerable<char>) arg)
    {
        Console.WriteLine("M1");
    }

    public static void M2(this (object, (object, object)) arg)
    {
        Console.WriteLine("M2");
    }
}

Why so complicated?

Conversions is a pervasive and complicated aspect of the language. Some degree of complexity is unavoidable when a feature needs to work with conversions and behave in a consistent and predictable manner.

Integration with conversions is often cited as contributing a good portion of the famous “minus 100 points” penalty that applies to every new language feature and needs to be balanced out with benefits.

C# Tuples. More about element names.

2017-01-28T00:00:00+00:00

C# tuples can have optional element names. Here are some interesting details about tuple element names and how they are treated by the language.

The matter of allowing named elements was a major choice in the design of C# tuples. It was definitely attractive to allow element names when tuples are used in APIs.

(int CustomerID, int Orders) GetRecord(){...}

is clearly more descriptive and less error prone than

// NOTE: the first element is CustomerID, second is Orders
(int, int) GetRecord(){...}

On the other hand names could become an obstacle when implementing abstract operations that operate with tuples.
If a dictionary factory is implemented in terms of Key and Value tuples, would it work with Customers and Orders?

What about completely generic algorithms? -
If I have (int X, int Y) and int Z, can I apply the following?

(T, U, V) Append<T, U, V>((T, U) tu, V v) => (tu.Item1, tu.Item2, v);

If users can’t use tuples in generic/abstract scenarios just because of the element names, they’d be inclined to avoid the names altogether making the whole support of names questionable.

C# designers wanted to have both the expressiveness of the names, but also to make sure that names do not “stand in the way” when tuples are used as structural types. So the guiding principle was set to be:

Element names are semantically insignificant except when used directly.

The tuple types with element names are really the same as ones without. The only addition is the presence of “friendly names”.
In particular all tuple elements have the default Item1, Item2,…. ItemN names, even those that have “friendly” element names. It is allowed for friendly names to be the same as the default names, but only as long as they are in the right position.

// Item2 causes an error here, since it is in a wrong position.
// Item2 name is essentially already taken by the element #2
(int Item1, int X, int Item2) v;

Another consequence is that overloaded methods whose signatures differ only in tuple element are disallowed.

public class C
{
    public void Ext((int X, int Y) arg){}

    // error CS0111: Type 'C' already defines a member called 'Ext' with the same parameter types
    public void Ext((int V, int W) arg){}
}

Conversely - overload resolution will not consider element names when selecting the target of an invocation.
The following call is ambiguous since, ignoring element names, both Ext methods have the same signatures.

public class C
{
    public void M()
    {
        var v = default((int X, int Y));

        // error CS0121: The call is ambiguous between the following. . .
        v.Ext();
    }
}

static class Ext1
{
    public static void Ext(this (int X, int Y) arg){}
}

static class Ext2
{
    public static void Ext(this (int V, int W) arg){}
}

The dynamic type of a tuple variable is just the underlying ValueTuple.

Essentially the “tuple” part of these types, including their element names, is a compile-time decoration that compiler understands, uses and propagates through expressions.

The erasure of tuple related information can be observable by checking the type of boxed instances or the static type as tracked by CLR type system.

class Program
{
    static void Main(string[] args)
    {
        // tuple instances do not know they are tuples
        object instance = (Alice: 1, Bob: 2);
        System.Console.WriteLine(instance.GetType());

        // CLR does not trace tuple types either.
        PrintStaticType((Alice: 1, Bob: 2));
    }

    static void PrintStaticType<T>(T arg)
    {
        System.Console.WriteLine(typeof(T));
    }
}

The output is:

   System.ValueTuple`2[System.Int32, System.Int32]
   System.ValueTuple`2[System.Int32, System.Int32]

Representing element names in metadata

Since CLR types themselves do not store tuple information, compiler emits extra information to specify tuple element names in member signatures.
The encoding is rather simple - TupleElementNamesAttribute contains an array of element name strings in the pre-order depth-first traversal order of the parts of the corresponding type. Basically - when you go through the type declaration every tuple element would consume one string from the attribute. If no tuple element names are present the attribute does not need to be emitted.

Example:

// "C" and "F" are intentionally missing - will be encoded as "null" strings.
static Dictionary<(int A, int B), (int, int D)?> Test((int[] E, int)[] arg)
{
    return null;
}

Emitted as an equivalent of:

[return: TupleElementNames(new string[]{"A","B",null,"D"})]
private static Dictionary<ValueTuple<int, int>, ValueTuple<int, int>?> Test
    ([TupleElementNames(new string[]{"E",null})] ValueTuple<int[], int>[] arg)
{
    return null;
}

As explained in earlier post, ValueTuple types that match a tuple pattern are promoted into corresponding tuple types during metadata import. In addition to that, the element names are “rehydrated” from a TupleElementNames attribute, if one is specified for the given part of a member signature.

Note that in terms of cross-language interoperability, understanding TupleElementNames attribute or the tuple encoding pattern is optional.
If the consuming language does not care about element names (like F#), it can ignore the attribute and just see the signature with “nameless” tuples. If the consuming language does not understand tuples at all (like C#6), it can still interoperate by using ValueTuple structs.

Compile time propagation of tuple types

Note that compile time propagation of the tuple types can go quite far, including through the generic type inference. At compile time the tuple types are “real types”.

Example of a tuple type with element names propagated through several level of type inference:

static void Main(string[] args)
{
    // The only argument with a natural type is the "42"
    // T infers its type from "42"
    // U has dependency on T which is resolved via lambda inference once we know T
    // U[ ] is the return type of Apply and is known once we know U
    // type of 'r' is inferred to be the same as the return type of 'Apply'
    var r = Apply(42, (val) => (Alice: val, Bob: val.ToString()));

    // As a result
    // r has type: (int Alice, string Bob)[ ]
    Console.WriteLine(r[0].Alice);
    Console.WriteLine(r[0].Bob);
}

static U[] Apply<T, U>(T arg, Func<T, U> f)
{
    return new U[] { f(arg) };
}

The element names are not always involved in the inference. In scenarios where tuple arguments match tuple parameters of the same cardinality, the inference works in a purely structural way and element names are ignored.

Surely, when type parameters are inferred from the argument element types, the names of those elements cannot take part in that.

static void Main(string[] args)
{
    var v = (Alice: "hi", Bob: "there");

    // T is inferred to be 'string'
    // so is the type of r
    var r = Test(v).Result;
    Console.WriteLine(r.ToUpper());

    Console.WriteLine(Append(t: (1, 2), third: 3));
}

// T is inferred from the first element type of the argument tuple
static async Task<T> Test<T, U>((T, U) arg)
{
    // just await something
    await Task.Yield();

    return arg.Item1;
}

// T, U are inferred from element types of 2-tuple argument
// and used as element types of 3-tuple result
// element names are unrelated and unimportant for inference purposes here
static (T First, U Second, V Third) Append<T, U, V>((T First, U Second) t, V third)
{
    return (t.First, t.Second, third);
}

Tuple type merging and dropping of element names

When inferring tuple names from multiple sources, a situation may arise where multiple names for the same element would be inferred. In such case these names are “dropped” leaving the corresponding tuple element unnamed.

Indeed, there are only two design choices here - drop conflicting names or make the whole scenario an error. However making it an error would contradict the idea that presence of element names is semantically insignificant.

static void Main(string[] args)
{
    var x = (Alice: "hi", Bob: "there");
    var y = (Alpha: "bye", Beta: "bye");

    // T is inferred to be
    //    (string Alice, string Bob)  and also
    //    (string Alpha, string Beta)
    //
    // To resolve apparent ambiguity conflicting names are dropped.
    // T is just: (string, string)
    var z = OneOrAnother(x, y, DateTime.Now.DayOfWeek == DayOfWeek.Friday);

    // this would be an error
    // Console.WriteLine(z.Alice);

    // this is still ok
    Console.WriteLine(z.Item1);

    var x1 = (Alice: "bye", Todd: "bye");

    // only ambiguous names are dropped
    // z1 has type:  (string Alice, string)
    var z1 = DateTime.Now.DayOfWeek == DayOfWeek.Friday ?
                    x :
                    x1;

    // this is ok
    Console.WriteLine(z1.Alice);

}

// T is inferrable from both x and y
static T OneOrAnother<T>(T x, T y, bool flag)
{
    return flag ? x : y;
}

Can element names become “semantically significant” through lambda inference?

There is an interesting scenario which seemingly demonstrates that element names can have effect on overload resolution when combined with lambda inference. The example below is able to steer overload resolution to one of the candidates by using specific tuple element names.
However at closer examination, the element names are actually used directly in this scenario, so of course they make a difference. It is not a case where two tuple types compete for better applicability, it is a case where two reified lambdas compete, and one would have compile errors.

static void Main(string[] args)
{
    // calls the first Select -
    // the only case where ".Bob" would not be an error
    var r = Select(1, 2, t => t.Bob);

    // ambiguity error: lambda can be applied in either case
    // var r1 = Select(1, 2, t => t.Alice);
}

delegate TResult Selector1<TArg, TResult>(TArg arg);

static T Select<T>(T x, T y, Selector1<(T Alice, T Bob), T> selector)
{
    Console.WriteLine("first overload");
    return selector((x, y));
}

delegate TResult Selector2<TArg, TResult>(TArg arg);

static T Select<T>(T x, T y, Selector2<(T Alice, T Todd), T> selector)
{
    Console.WriteLine("second overload");
    return selector((x, y));
}

Diagnostics on element name mismatches

Considering how easily element names can be cast aside, the language designers had concerns that compiler would be less than helpful against certain kinds of mistakes. Some name mismatches could be indicative of a confusion or a typo.

static void Main(string[] args)
{
    // Warning!!
    //
    // "Boook" is ignored. Likely a typo.
    M1((Boook: 1, Chapter: 2));

    // Warnings!!
    //
    // "First" and "Last" are mismatched causing both to be dropped.
    // That is highly suspicious
    var r = DateTime.Now.DayOfWeek == DayOfWeek.Friday ?
                (ID: 1, First: "F", Last: "L") :
                (ID: 2, Last: "L", First: "F");

}

static void M1((int Book, int Chapter) arg)
{
    // . . .
}

While language is pretty clear on the semantics of the above samples, the code is likely to be unintentional.

Determining scenarios that result in warnings is not an easy task. The scenarios must be much more likely a result of an error than not. In addition there should be reasonable and obvious ways to fix the violations. In the initial release the warnings are produced under the following conditions:

It is an identity conversion from a tuple literal.
Some names specified in the literal are dropped as a result of conversion.

The mistake in such scenarios is fairly clear - the name is explicitly specified and immediately ignored due to mismatch - that is at very least redundant. The most trivial fix is to just fix the name to match destination or to remove it entirely.

There are plans to improve the name mismatch analysis. Some of those plans are captured in this WorkItem. More data/statistics on the real-world use of tuples would be useful to improve the analysis as well.

Element names must match when overriding or implementing.

Some language designers felt particularly strong about overriding and implementing scenarios. There was some discussion whether changing element names upon overriding/implementing is a bad enough pattern that it must be a compile error or just a warning.
What tipped the scales towards making this an error is that if error is found to be too strict, it can be relaxed, without being a compatibility issue. Change in the opposite direction would be breaking.

static void Main(string[] args)
{
    Animal a = new Dog();

    a.M1(). ???  //  AnimalName or DogName ?
}

abstract class Animal
{
    public abstract (int ID, string AnimalName) M1();
}

// Changing element names when overriding could be confusing to the caller.
class Dog: Animal
{
    // Error: cannot change tuple element names when overriding.
    public override (int ID, string DogName) M1()
    {
        return (1, "Spot");
    }
}

Note that these restrictions get validated and reported after the semantic analysis. The element names are ignored while determining overriding/implementing relationships, but when it is done, it is enforced that element names match.

C# Tuples. How tuples are related to ValueTuple.

2017-01-16T00:00:00+00:00

As a matter of implementation details, C# tuples are implemented on top of ValueTuple types. Here are some details about their relationship.

What is actually emitted when tuples are used in the code.

Underlying implementation of C# tuples is fairly simple. Tuples of cardinality 2 through 7 are directly mapped to ValueTuple type of corresponding generic arity. I.E (int, int) is represented by ValueTuple<int, int>.

public void Main()
{
    (int, int) n = (1,1);
    System.Console.WriteLine(n.Item1);
}

is emitted as

public void Main()
{
    ValueTuple<int, int> n = new ValueTuple<int, int>(1, 1);
    System.Console.WriteLine(n.Item1);
}

At 8+ elements things get more interesting. Since arities of ValueTuple types go only up to 8, compiler resorts to nesting. The first 7 elements are stored as-is and the rest of elements is stored as a tuple in the Rest field of ValueTuple'8.

public void Main()
{
    var n1 = (1,2,3,4,5,6,7,8);
    System.Console.WriteLine(n1.Item8);

    var n2 = (1,2,3,4,5,6,7,8,9);
    System.Console.WriteLine(n2.Item9);
}

is emitted as

public void Main()
{
    var n1 = new ValueTuple<int, int, int, int, int, int, int, ValueTuple<int>>(1, 2, 3, 4, 5, 6, 7, new ValueTuple<int>(8));
    System.Console.WriteLine(n1.Rest.Item1);

    var n2 = new ValueTuple<int, int, int, int, int, int, int, ValueTuple<int, int>>(1, 2, 3, 4, 5, 6, 7, new ValueTuple<int, int>(8, 9));
    System.Console.WriteLine(n2.Rest.Item2);
}

The encoding scheme used here is recursive. In a 15-element case, the Item15 will be mapped to outer.Rest.Rest.Item1. - I.E. every level of nesting can store 7 elements + remaining tail.

Importantly, the tail is always wrapped in a ValueTuple, even if it is just 1 element. The idea is that if the 8th element is itself a tuple, as in (int,int,int,int,int,int,int,(int,int)), the element would be wrapped and thus it could not be confused with a flat tuple that has N more elements, as in (int,int,int,int,int,int,int,int,int).
This clever encoding scheme is not actually new. It is exactly the same approach as has been used by F# tuples for a long time.

Another interesting observation is that this encoding makes it necessary to have ValueTuple<T>, even though by itself 1-element tuples are not expressible in the language.

What happens if ValueTuple is used in C#7 sources directly?

The backward compatibility requirements dictate that ValueTuple structs are allowed in C#7 code, and code that worked in C#6 should continue working in C#7.
In addition to that, considering that tuples are emitted as ValueTuple, the underlying functionality will unavoidably leak through boxing, interop, dynamic, reflection and other scenarios, so why not just make tuple types be “compatible” with the functionality of the underlying types - including fields, properties, methods, implemented interfaces?

There are two ways how this kind of “compatible” could be formalized in C#:

exactly the same type.
Basically it means that the same type has two syntaxes and wherever syntactically possible, one type reference can be replaced with another with no changes to the meaning of the program.
Example: System.Nullable<System.Int32> and int? - both refer to exactly the same type
Anything that can be done with System.Nullable<System.Int32> can be done with int?.
identity convertible.
Here language would track distinct types with different static capabilities, but runtime representation is indistinguishable so a variable of one type can be reinterpreted as a variable of another type.
Example: List<dynamic> and List<object>
myList[0].Blah() would work with the first, but would not compile with the second. However you can make an alias of one type to a variable of another.

public void Main()
{
    // 'lo' is a list of objects
    // this is the only "real" variable we have here
    var lo = new List<object>() {1, 2};

    // 'ld' is an alias to 'lo', typed as a list of dynamic
    // can do this since these types are identity convertible
    //
    // could also pass 'lo' as a 'ref List<dynamic>' parameter
    // but ref locals make example more compact
    ref List<dynamic> ld = ref lo;

    // this compiles with ld
    // GetTypeCode _can_ be called on dynamic
    ld[0].GetTypeCode();

    // this would not compile
    // GetTypeCode _cannot_ be called on object
    error -> lo[0].GetTypeCode();
}

So, what happens with tuples?

You can not do t.Alice when t is typed as ValueTuple<int, int>, but can when it is typed as (int Alice, int Bob). Tuple types with element names are clearly separate types. The matters of tuple tuples with element names is worth a separate post, but in short - yes, tuples with element names are identity convertible to corresponding ValueTuple<> types.

public void Main()
{
    // the only "real" variable here
    var ii = new ValueTuple<int, int>(1,2);

    // make an alias typed as '(int Alice, int Bob)'
    // can do this since these types are identity convertible
    ref (int Alice, int Bob) ab = ref ii;

    // '(int Alice, int Bob)' has an element 'Bob'
    // it is the same variable as 'ii.Item2'
    ab.Bob = 42;

    // prints 42
    System.Console.WriteLine(ii.Item2);
    // prints 42
    System.Console.WriteLine(ab.Item2);
    // prints 42
    System.Console.WriteLine(ab.Bob);
}  

On the other hand the semantical differences between (int, int) and ValueTuple<int, int> would be so subtle that it was decided to just make them the same types. It does mean that ValueTuple<int, int> is treated a bit specially by the language. In addition to all the properties common to similar generic types, ValueTuple<int, int> would have all the additional functionality of (int, int).

This difference is hard to notice (and that is the point).
The easiest way is through observing the presence of ItemN elements beyond the first 7:

public void Main()
{
    // this type matches the pattern of 8-ple (int, int, int, int, int, int, int, int)
    ValueTuple<int, int, int, int, int, int, int, ValueTuple<int>> vt =
        new ValueTuple<int, int, int, int, int, int, int, ValueTuple<int>>
            (1, 2, 3, 4, 5, 6, 7, new ValueTuple<int>(8));

    // surely it has 'Item8' element
    System.Console.WriteLine(vt.Item8);

    // that is actually emitted as
    System.Console.WriteLine(vt.Rest.Item1);
}

From the implementation prospective, when compiler sees ValueTuple<> type whose shape matches an underlying layout of a tuple type, it “upgrades” the type reference to mean the actual tuple type. The transformation applies equally to type references in source as well as in metadata. As a result conforming ValueTuple<> types behave exactly as corresponding tuple types.

Overall there are just two distinct groups of tuple types - with element names and without. And the tuple type relationship looks like this:

C# Tuples. Why mutable structs?

2017-01-07T00:00:00+00:00

C# defines tuples as mutable value types. Considering the general guidance against mutable structs, it may look as a peculiar design choice.

Indeed why? And in particular, why existing family of System.Tuple<> classes did not fit?

Why value types.

As with many other features, design of C# tuples started with investigating existing pain points that the new feature was supposed to fix. A special interest was paid to the existing tuple-like types in existing code bases. By tuple-like, here I mean an abstract datatype used just to bundle up several values without giving them any particular meaning.

Turns out Roslyn itself had three!! unrelated custom tuple-like types. There were ValueTuple<T1, T2>, ValueTuple<T1, T2, T3> and so on. There was Pair. And at some point, I believe, there was StructTuple, which was later merged with ValueTuple. In addition to that there was some use of KeyValuePair for purposes that have nothing to do with either keys or values - just to combine two unrelated pieces of data and to pass around. Other projects, including .Net FX itself, had similar finds. (for example Pair)

The common theme for all these was trafficking multiple pieces of data as a single unit and in particular returning multiple values from methods. One existing solution for returning multiple values is through out parameters, but it is often inconvenient and does not work for async methods at all, so an aggregate type is needed.
On the other hand programmers are not compelled to create specialized types just to move 2+ items around, so they create generalized helper types like Pair that otherwise have no special meaning - just to hold two things. These are scenarios where tuples would step in.

It was also observed that these types are typically structs. Considering that a tuple is just a combination of values with no identity on its own, the value semantics of structs appeared to be convenient.
Ideally it would be possible to just push more than one value to the stack upon returning from a method, but since that is not possible, pushing a single struct containing those values is the next best thing. Using classes here would only add the costs of allocation and indirection.

At the end it appeared that structs are a sufficient and also cheap solution to combining multiple piece of data into a single unit.
The only consideration in the favor of classes was the cost of copying in case if tuples are large. Turns out the large tuples, while possible, are exceedingly rare.

Why Mutable

Since tuples are structs there is no much point in making them immutable. The readonliness of a struct is a property of the whole variable. If a tuple variable is assignable, it can be changed to contain any value regardless of the readonliness of individual fields.

Example:

struct ImmutableTuple<T1, T2>
{
    // immutable, right?
    public T1 Item1{get;}
    public T2 Item2{get;}

    public ImmutableTuple(T1 item1, T2 item2)
    {
        Item1 = item1;
        Item2 = item2;
    }

    public override string ToString() => $"({Item1}, {Item2})";
}

static void Test(ImmutableTuple<int, int> arg)
{
    // change arg arbitrary to (42, 42)
    arg = new ImmutableTuple<int, int>(42, 42);
    System.Console.WriteLine(arg);
}

Interestingly, in the example above compiler is not even trying to create a new instance and assign. It directly initializes arg with new values since it knows there is no difference:

void Test (
  valuetype Program/ImmutableTuple`2<int32, int32> arg
) cil managed
{
// Method begins at RVA 0x206b
// Code size 23 (0x17)
.maxstack 8

// load the address of "arg"
IL_0000: ldarga.s arg

// load values
IL_0002: ldc.i4.s 42
IL_0004: ldc.i4.s 42

// call the constructor "in-place" on the arg
IL_0006: call instance void valuetype Program/ImmutableTuple`2<int32, int32>::.ctor(!0, !1)

IL_000b: ldarg.0
IL_000c: box valuetype Program/ImmutableTuple`2<int32, int32>
IL_0011: call void [mscorlib]System.Console::WriteLine(object)
IL_0016: ret
} // end of method Program::Test

Surely, the cheapest instance is the one that was not created at all.

Anyhow, immutability of tuples would not prevent elements from being changeable. It would only serve to annoy users when they want to change and have to do it in a roundabout way instead of directly assigning.

Also, once tuples are mutable, there is no point in using properties - that would just prevent passing individual elements by reference, resulting in unnecessary diminished functionality compared to using two variables directly. Indeed - if you had two mutable variables, you could pass either by reference, why prevent that once you have them in a mutable tuple?

The end result - C# tuples are extremely lightweight constructs - they are structs and their elements are mutable and directly addressable.

using System;

public class C
{
    public static void Main()
    {
        // tuple1 is a struct
        var tuple1 = (Alice: 1, Bob: 2);

        // tuple2 is a copy
        var tuple2 = tuple1;

        // elements can be assigned
        tuple1.Alice = 42;

        // elements can be passed by reference
        Inc(ref tuple1.Bob);

        // tuple2 is indeed a copy. (prints "False")
        System.Console.WriteLine(tuple1.Equals(tuple2));
    }

    public static void Inc(ref int x)
    {
        x++;
    }
}

Note aside: So, is it all that useless to make the content of a struct readonly?

Not at all !!!

While design of a struct, on its own, cannot prevent its values from being assigned as a whole, it can prevent piece-wise assignment. That is very important if a given struct has internal invariants guaranteed by the construction.

Consider the following struct:

// text span starting at "Start" and ending at "End"
struct TextSpan
{
     private readonly int start;
     private readonly int end;

     public TextSpan(int start, int end)
     {
          if (start < 0) throw new ArgumentException();
          if (end < 0) throw new ArgumentException();
          if (end < start) throw new ArgumentException();

          this.start = start;
          this.end = end;
     }

     // none of the following can be negative!!
     public int Start => start;
     public int End => end;
     public int Length => end - start;
}

Note that TextSpan values have special meaning. There are also guarantees that they have nonnegative Start and End and nonnegative Length. Even if a struct can be overwritten as a whole by another value, that new value would still have the same guarantees.
In a normal program (I.E. no racing assignments, use of reflection or unsafe code) the invariants of TextSpan would hold regardless of how it is used. Thanks to it being immutable!!!

What makes tuples different here is that tuples do not guarantee any invariants - they are just containers, and any values are allowed, so nothing extra would be achieved by being immutable.

Why ref locals allow only a single binding?

2016-11-30T00:00:00+00:00

Current restriction on ref locals to be single-assignable is a straightforward and simple way to guard against several potential problems. There are ways to relax the restriction in the future, if that is found to be beneficial enough.

First of all - ref locals are a new kind of ref variables. I have touched some general details common to all ref variables in the earlier posts. They are not pointers. They are implemented on top of managed references. In many ways ref locals are similar to ref parameters. Both belong to the kind of variables that do not get their own storage and instead are bound to existing storage.

Ref locals are lexically scoped, just like other locals, but the life time of the storage that they are bound to may not match the scopes of the references. That is where things get “interesting”.

It could be observed that unrestricted “any ref local can be bound or re-bound to any variable at any time” may lead to the following problems:

1. If you want to return a ref local, compiler must be able to validate that all possible bindings at that point are “safe to return by ref”

Here is an example of a ref local that is not safe to return due to ref assignments and nontrivial control flow.

ref int RotateRefs()
{
  ref var r0 = ref arr[0];  // safe to return
  ref var r1 = ref arr[1];  // safe to return
  ref var r2 = ref arr[2];  // safe to return
  ref var r3 = ref arr[3];  // safe to return

  var local = 42;
  ref var r4 = ref local;   // NOT safe to return !!!

  while(Condition())
  {
    // shift-rotate the refs
    ref var temp = ref r0;
    // (hypothetical syntax for ref re-assignment)
    r0 = ref r1;
    r1 = ref r2;
    r2 = ref r3;
    r3 = ref r4;
    r4 = ref temp;  
  }

  // "r0" is not safe to return here!!!
  // imagine an error message that tries to explain why.
  return ref r0;
}

In a most general case, enforcing safe-to-return rule for ref locals would require an analysis similar to the definite assignment analysis. Compiler would need to traverse the control flow graph while propagating what is known about the variables and repeating the analysis until no more knowledge could be gained.

That seems rather complicated, but there is more.

2. ref local should not be allowed to be used outside of the life times of the possible referents.

Violating this would lead to variables that are bound to something, that from the point of the language “does not exist”.

Consider the following example:

class Program
{
    static readonly int[] arr = { 0, 0, 0, 0 };

    static void Main(string[] args)
    {
        // bind r0, r1 to something initially
        ref var r0 = ref (new int[1])[0];
        ref var r1 = ref r0;

        for (int i = 0; i < 2; i++)
        {
            // NOTE: possibly initializing "variable" using a value
            //       of its own binding from 2 iterations behind.
            var variable = r0 + 1;

            // keeping the previous binding of "r1" in "r0"
            r1 = ref r0;

            // binding "r1" to "variable". (hypothetical syntax for ref re-assignment)
            // NOTE: "variable" is about to go out of scope,
            //       but "r0" and "r1" would still be around, bound to what???
            r0 = ref variable;
        }

        // NOTE: both "r0" and "r1" are bound to different
        //       bindings of "variable", which do not exist at this point.
        Console.WriteLine($"Different bindings of 'variable' are equal?: {r0 == r1}");
        Console.WriteLine($"r0 : {r0}");
        Console.WriteLine($"r1 : {r1}");
    }
}

As with locals being returned by reference, exposing a local from an inner scope to the code in the outer scopes by the means of a reference is a problem and should be:

1) allowed with undefined behavior.
2) allowed with exposed locals captured into closures.
3) disallowed.

For the same reasons as with ref returns, just disallowing seems more appropriate for C#.

Sadly, “locals should not be exposed to the outer scopes” is a simple principle that is not so simple to enforce. Precise enforcement would require a transitive tracking of all possible bindings of ref locals and validating that scoping rules are not violated at every use point of the references.

For example the following code could be allowed:

ref int outer;

try
{
    for(;;)
    {
      int inner = 0;

      // binding outer to the inner!!!
      outer = ref inner;

      // assigning inner trough outer.
      // that is ok since we could do it directly here too.
      outer = 42;
    }
}
catch
{
    outer = ref (new int[1])[0];
}

// this is ok also, as long as it is proven that outer
// is not referencing something that is not in scope
outer = 333;

The analysis that is required to validate the example above would be something similar to the analysis for the safe-to return validation for ref locals, but the state that is incrementally updated and propagated through control flow paths would be more complex. It would need to reflect the entire collections of all possible scopes from which ref locals could reference variables. The state will be affected by ref-assignments and will have to propagate the state from ref-parameters to the ref returns in a case of method calls.

Note that this analysis would also be sufficient to prove if/when ref locals are safe-to-return. If at particular point, the set of possible scopes for a ref local contains only the “out-of-method” scope, then ref-returning is safe.

Also note that the analysis is very complex, can be computationally expensive and would often yield diagnostics that is hard to act upon. -
“ERROR: can’t use a ref local here because it is possibly referencing a variable that is out of scope”, - scratch your head and try figure where and how the inconvenient binding was picked up.

Language designers are naturally concerned when a feature requires such analysis. In such situations it is desirable to find a way to constrain the feature in order to reduce the number of scenarios supported, preferably at the cost of uncommon cases. After all, maximizing the number of programs that compile correctly, by itself is not a goal.

In a case of ref locals, it was found that forcing the initialization of ref locals during initialization and not allowing re-assignment solves the problems described above very nicely:

the issue with exposing locals by reference to outer scopes is trivially prevented, since it is not possible to initialize something with a variable from an inner scope.
safe-to-return property can be simply copied from the initial referent, since we require that there is one and there won’t be another.

It is possible that single-assignment requirement will be found too constraining and the language may need to be relaxed a bit in the future. Generally it is ok to start accepting code that used to be an error in previous versions, but the opposite changes are extremely rare.

Some possible future directions for the feature are:

Addition1: allow re-assigning of ref locals, but only with something that is safe-to-return, but keep inferring the safe-to-return property from the initial assignment.
Addition2: relax the requirement that ref locals must be initialized at declaration. Treat such locals as safe-to-return, as long as they are definitely assigned.

Note that additions above would allow more code to be legal, primarily when dealing with unscoped/heap variables, while not yet require complicated flow analysis.

– Pedantic notes:

Before the ref locals were introduced, it was, in some situations, possible to use System.TypedReference in combination with __makeref, __refvalue keywords as a crude substitute. TypedReference clearly contains a managed reference and acts as a proxy, so why all the fuss with C# scoping and why that does not apply to TypedReference?

Well, __makeref and __refvalue are basically just special keywords that directly map to makerefany and refanyval IL opcodes in order to provide basic support for __arglist feature.

The functionality is optional in both C# and CLR and is not recommended for general purpose use. In a way, it is a platform/CLR feature (like reflection) and is not very well integrated into the language.

Compiler knows about CLR restrictions of TypedReference - it cannot be boxed, cannot be a field or an array element, cannot be returned from a method, etc… Compiler will try preventing unverifiable code and fatal GC holes, but not much beyond that.

Indeed, the following example shows that __makeref completely disrespects scoping rules of C# and does not care if references may outlive the life times of the referenced locals, which, while not fatal, may result in undocumented/unpredictable behavior.

class Program
{
    static readonly int[] arr = { 0, 0, 0, 0 };

    static void Main(string[] args)
    {
        // bind r0, r1 to something initially
        TypedReference r0 = __makeref((new int[1])[0]);
        TypedReference r1 = r0;

        for (int i = 0; i < 2; i++)
        {
            // NOTE: possibly initializing "variable" using a value
            //       of its own binding from 2 iterations behind.
            var variable = __refvalue(r0, int) + 1;

            // keeping the previous binding of "r1" in "r0"
            r1 = r0;

            // binding "r1" to "variable".
            // NOTE: "variable" is about to go out of scope,
            // but "r0" and "r1" would still be around, bound to what???
            r0 = __makeref(variable);


            // DUMMY NOOP CODE,
            // UNCOMMENT TO CHANGE THE PROGRAM'S OUTPUT!!!!
            //Func<int> dummy = () => variable; // cause "variable" be captured
        }

        // NOTE: both "r0" and "r1" are bound to different
        //       bindings of "variable", which do not exist at this point.
        Console.WriteLine($"Different bindings of 'variable' are equal?: {__refvalue(r0, int) == __refvalue(r1, int)}");
        Console.WriteLine($"r0 : {__refvalue(r0, int)}");
        Console.WriteLine($"r1 : {__refvalue(r1, int)}");
    }
}

The output is:

Different bindings of 'variable' are equal?: True  
r0 : 2
r1 : 2

And when the noop Func is uncommented, the output is:

Different bindings of 'variable' are equal?: False  
r0 : 2
r1 : 1

Definite Assignment Analysis of locals. The real purpose.

2016-11-24T00:00:00+00:00

Definite Assignment Analysis prevents bugs, but there are deeper reasons to have it as a required feature in the language.

Indeed - Why have such strict rules instead of just assuming that unassigned locals contain default/zero values? After all, that seems to work ok for fields. It is also known that C# compiler decorates all methods with IL directive localsinit, which guarantees that all locals are zeroed out when method is entered. So what is the problem?

localsinit is, unfortunately, not enough to implement the semantics of C# locals. It would work if the life time of all local bindings (i.e. extents), was the whole method, but that is not the case in C#. In C# locals can have scopes smaller than the entirety of a method and extents match the lexical scopes. Every time the control flow enters a scope, a new set of bindings for the locals contained by that scope is supposed to be created and the bindings exist as long as they can be referenced. In the most general sense the “new bindings” would imply a newly allocated storage completely unrelated to the bindings possibly created when the same scope was entered previously.

A brute-force solution would be to map local variables of the same scope to fields in a synthesized class and create a new instance of such class when entering a scope. In fact this is what happens to locals that are accessible from lambda expressions. Such locals can be used beyond the life time of the containing method and multiple bindings to the same variables could coexist at the same time, so compiler needs to allocate their storage on the heap and rely on GC for keeping them alive as long as they can be referenced.

Example of multiple bindings to the same local:

    class Program
    {
        static void Main(string[] args)
        {
            int iteration = 0;

            var setters = new Action<int>[2];
            var getters = new Func<int>[2];

          reenterScope:

            {
                int sameVariable = 0;  // <-- THE VARIABLE

                setters[iteration] = (i) => sameVariable = i;
                getters[iteration] = () => sameVariable;
            }

            if (iteration++ < 1)
            {
                goto reenterScope;
            }

            Console.WriteLine("Original values of different bindings of the sameVariable: ");

            Console.WriteLine(getters[0]());
            Console.WriteLine(getters[1]());
            Console.WriteLine();

            setters[0](33);
            setters[1](42);

            Console.WriteLine("Assigned values of different bindings of the sameVariable: ");

            Console.WriteLine(getters[0]());
            Console.WriteLine(getters[1]());
        }
    }

Original values of different bindings of the sameVariable:
0
0

Assigned values of different bindings of the sameVariable:
33
42

The most common case is, however, when locals are just that - locals. They are not accessed from lambdas or anything like that and at any time only one (or none) bindings to such local may exist. In such cases locals can be simply mapped to IL local slots and reused every time the control flow enters the scope. The only problem is that the slot values would need to be “reset” every time the scope is entered to the default value and there is no help from localsinit here since that works only once - when the whole method is invoked.

In theory, compiler could inject code that would do the “resetting” of all relevant slots, when a scope is entered, but that would be wasteful. Only some of the locals in a given scope would be read from. Besides, most of them would be written to before reading anyways, so why not just require that a local is written to before being read? That would make the code less buggy, but most of all it will make the “resetting” entirely unnecessary.

Essentially, a rule that requires that locals are definitely assigned before being read serves the same purpose as localsinit, but does much better job.

It works at every nested lexical scope recursively (not just on the method level)
It gives stronger guarantees. You can see only what you have already assigned to the variable. It is impossible to read uninitialized/stale state by accident.
It is minimally redundant. If you do not read a local on some code path you do not need to ensure that it is assigned on that code path

Simple example of some variables assigned on one code path and not assigned on the other. As long as we do not read the variable it is ok to have it not assigned.

static void Main(string[] args)
{
    int a;
    int b;

    if (args.Length > 0)
    {
        goto path1;
    }
    else
    {
        goto path2;
    }

    path1:
    // assign only "a"
    a = 123;
    goto path1continues;

    path2:
    // assign only "b"
    b = 345;
    goto path2continues;

    path1continues:
    // on this codepath "b" is not assignd,
    // but that is ok since we do not read it.
    Console.WriteLine(a);
    goto exit;

    path2continues:
    // on this codepath "a" is not assignd,
    // but that is ok since we do not read it.
    Console.WriteLine(b);

    exit:
    return;
}

Interestingly, in VB, for historical reasons, locals not referenced from lambdas do have extents that match the entirety of the method and thus definite assignment analysis is much less strict - it basically exists just to give warnings on some cases that likely to be coding mistakes.

example of a VB local binding maintained through the entirety of the method life time:

Module Module1

    Sub Main()

        Dim iterations As Integer = 3

        While iterations > 0

            Console.WriteLine("variable declared and initialized")
            Dim variable As Integer = 42

reentryTheScope:
            Console.WriteLine(variable)

            variable += 1
            iterations -= 1
        End While


        If iterations > -3 Then
            ' "variable" is out of scope here, but it exists and has a value
            ' let's reenter the While loop and check upon the value of the "variable"
            Console.WriteLine("reentering scope")
            GoTo reentryTheScope
        End If

    End Sub

End Module

variable declared and initialized
42
variable declared and initialized
42
variable declared and initialized
42
reentering scope
43
reentering scope
44
reentering scope
45

The locals captured into lambda closures, however, have scoped extents in VB - surely the lifetimes cannot be bound to the lifetime of the containing method anymore when lambdas are involved. Similarly to C#, fresh bindings for captured locals are created when scope is entered and their lifetimes are bound to the lifetime of the referencing lambdas. So if locals are captured, the example above would start behaving differently. To make the unfortunate inconsistency less observable, VB refuses to compile code like above when locals are captured.

Module Module1

    Sub Main()

        Dim iterations As Integer = 3

        While iterations > 0

            Console.WriteLine("variable declared and initialized")
            Dim variable As Integer = 42

reentryTheScope:
            Console.WriteLine(variable)

            variable += 1
            iterations -= 1

            ' cause "variable" to be captured into a closure
            Dim lambda As Func(Of Integer) = Function() variable
        End While


        If iterations > -3 Then
            ' "variable" is out of scope here, but it exists and has a value
            ' let's reentering the While loop and check upon the value of the "variable"
            Console.WriteLine("reentering scope")
            GoTo reentryTheScope
        End If

    End Sub

End Module

Module1.vb(27, 13) : Error BC36597 : 'Goto reentryTheScope' is not valid because 'reentryTheScope' is inside a scope that defines a variable that is used in a lambda or query expression.

Pedantic observations:

Since C# enforces stronger invariant than provided by localsinit, one would wonder why compiler still puts localsinit on methods. A simple answer is that IL verification rules require that. The underlying reason for the requirement is that the user’s code is not the only entity that might read the locals. The other one is the Garbage Collector.

The issue with GC is that it scans the IL locals of currently active methods in order to record the roots of reachable object graphs, and GC happens at fairly random times. Definite assignment analysis does not guarantee that locals will be assigned something deterministic before GC happens and things will go terribly bad if locals contain random junk. Therefore there is a rule that requires that verifiable methods have localsinit as an instruction directing the JIT to add a method preamble that wipes the whole stack frame clean before the method body is formally entered and GC had any chance to scan the locals.

In theory the rule could be required only on methods with locals of reference types (or structs containing references), but that would make a difference only to a fraction of methods while complicating the rule. Instead CLI standard allows JIT implementations to disregard the directive if, through some analysis, it could be inferred that not wiping the frame is a safe thing to do.

I am not sure if JITs use this kink in the rules very often though. With exception of the most trivial cases, the analysis could be too involved to be feasible at JIT time and wiping the stack frame is not overly expensive. Still, since there are some costs associated with locals (wiping the frame is just one of them), C# compiler generally tries to be frugal with usage of local slots, especially when compiling with /o+.

Conditional member access operator (idiomatic uses).

2016-11-15T00:00:00+00:00

Here are some of the less known uses of Null-conditional operator that could be handy to know.

1. use null-conditional access with void methods

A common misconception about null conditional operator is that it can be used only with members that return something. That is probably because of the special treatment where the return type is promoted to nullable (if it cannot represent null already).
It is, actually, perfectly fine to use null-conditional with a void returning method. The overall expression type is still void, just the method is called conditionally.

// get a dictionary if we have one
Dictionary<int, int> dict = GetDictionaryOrNull();

// add something, if we actually have a dictionary
// NOTE: Add has void return type
dict?.Add(42, GetValue());

It is particularly nice that execution of the wholeAdd(42, GetValue()) is conditional and thus GetValue() is only evaluated if dict is not null.
Without ?. the same code would not look as nice.

2. raising events

Generally C# events need to be null-checked before invoked. In addition to that, to be safe from races, an event needs to be captured into a local. That is quite a bit of code to just raise an event:

public class C
{    
    public event Action OnSomething = null;

    public void DoSomething()
    {
        // raise OnSomething, if not null
        var onSomething = OnSomething;
        if ( onSomething != null)
        {
           onSomething();         
        }
    }
}

Raising an event, however, is nothing more then invoking Invoke method on the event. And performing that conditionally on not being null is much clearer with ?. :

public class C
{    
    public event Action OnSomething = null;

    public void Something()
    {
        // raise OnSomething, if not null
        OnSomething?.Invoke();
    }
}

3. null-conditional and nul-coalescing operator together

When receiver of a null-conditional operator is null, the result is also null of appropriate type. That might be inconvenient in cases where a “default” result, other than null is supposed to be returned. That can be easily and elegantly fixed by combining ?. and ??.

// if obj is not null, give me its hashcode or 42 otherwise
int hashcode = obj?.GetHashCode() ?? 42;

It may seem that the code has two null checks here, which would be redundant, considering that only one input variable could be null:

// null check "obj", wrap result in "int?"
int? temp = obj?.GetHashCode();

// null check the "temp", unwrap "int?"
int hashcode = temp ?? 42;

Compiler is actually smart enough to understand the meaning of ?. + ?? combination, and emits more optimal code. It knows that the only way for the obj?.GetHashCode() to be null is when obj is null and in such case the whole expression returns 42. When obj is not null, the result of GetHashCode() is returned. In fact, there is no need to involve intermediate wrapping/unwrapping of int? at all.

The actual code, that is emitted, is an equivalent of:

object stackTemp = obj;
int hashcode = stackTemp != null ? stackTemp.GetHashCode() : 0;

4. use null-conditional in conditions

Null-conditional operator has type of bool?, when used with underlying expression of bool type. Such expression cannot be used directly in conditions. However, there are easy ways to “normalize” the three-state result in to true/false.

When null should be treated the same as false, use == true or ?? false.

public class C
{        
    // assigned externally
    public static HashSet<int> hs;

    public void Check()
    {		
        if (hs?.Contains(42) == true)
        {
            System.Console.WriteLine("contains");        
        }

        if (hs?.Contains(42) ?? false)
        {
            System.Console.WriteLine("contains");        
        }
    }
}

Both == true and ?? false result in the same code emitted as the resulting conditions indeed have the same semantics. In either case compiler can infer that the only situation in which the condition will be satisfied is when hs is not null and when hs.Contains(42) returns true.

I personally like == true form more, but I have seen ?? false used and I find it just as readable.

Again, the intermediate bool?, that would be produced by hs?.Contains(42) alone, and all expenses related to dealing with it, can be bypassed.

The actual codegen for either of the ifs above looks like:

IL_0000: ldsfld class [System.Core]System.Collections.Generic.HashSet`1<int32> C::hs
IL_0005: dup
IL_0006: brtrue.s IL_000c

IL_0008: pop
IL_0009: ldc.i4.0
IL_000a: br.s IL_0013

IL_000c: ldc.i4.s 42
IL_000e: call instance bool class [System.Core]System.Collections.Generic.HashSet`1<int32>::Contains(!0)

IL_0013: brfalse.s IL_001f

IL_0015: ldstr "contains"
IL_001a: call void [mscorlib]System.Console::WriteLine(string)

Which is an equivalent for an optimized:

HashSet<int> stackTemp = C.hs;
if (stackTemp != null && stackTemp.Contains(42))
{
    Console.WriteLine("contains");
}

Conversely != false and ?? true could be used when null is to be treated the same as true.

if (hs?.Contains(42) != false)
{
    System.Console.WriteLine("contains or null");        
}

if (hs?.Contains(42) ?? true)
{
    System.Console.WriteLine("contains or null");        
}

In either case, the condition is emitted as an optimized equivalent of:

HashSet<int> stackTemp = C.hs;
if (stackTemp == null || stackTemp.Contains(42))
{
    Console.WriteLine("contains or null");
}

5. composing null-conditional and lifted operators

Null-conditional operator mixes well with lifted operators.

public class C
{        
    // assigned externally
    public static string s;

    public bool IsLongEnough()
    {		
        return s?.Length * 2 + 1 > 10;
    }
}

Again, compiler knows about the short circuiting nature of ?. and that the only way a null can get into the calculation is through s being null and once that happens we immediately know the result without the need of propagating that null through the whole chain of lifted calculations.

The actual code emitted here is equivalent of:

public class C
{
    public static string s;
    public bool IsLongEnough()
    {
        string stackTemp = C.s;
        return stackTemp != null &&   // <- check for null once
                    stackTemp.Length * 2 + 1 > 10; // <- not null-propagating  
    }
}

As you may notice the intermediate nullables are again completely elided by the compiler and the math was simplified to regular (not the null-propagating) form.

Safe to return rules for ref returns.

2016-11-04T00:00:00+00:00

For the reasons explained in the earlier post, C# disallows returning local variables by reference. While the principle of “Cannot return local variables by reference” seems very simple, there are many ways for a user to violate the principle directly or indirectly and enforcing it is an interesting challenge.

So, what is exactly safe to return and what is not?

Clearly attempting to return a local by reference should trigger an error. But what about a field of a local? If that local happens to be a struct we would be in trouble, since we would still be returning a reference to the local data. On the other hand it would be ok if the local is a class. Therefore there is a need to generalize the rule to include the fields of struct locals as well, recursively. - Field of a field of a field of a field of … of a local is unsafe to return as long as all types in the chain are structs.

Another interesting question is whether a ref return or a ref parameter themselves are safe to return.

Consider the following example:

ref int Callee(ref int arg)
{
  return ref arg;
}

ref int Caller()
{
  int s = 42;

  // DANGER!! returning a reference to the local data
  return ref Callee(ref s);
}

Here Caller passes a local variable s by reference to Callee. Then Callee returns it back by reference. What comes from Caller is essentially ref s. While it would be ok for the caller to use that, returning that result by reference would be an equivalent of returning s and thus should be prevented.
In a general case, compiler has no knowledge of what is going on inside Callee. Conservatively, compiler must assume that any byref parameter or its field may be returned back by reference, so as long as any of the ref arguments are not safe to return, the result of the call is not safe to return either.
Note that in some cases, knowing the types of the ref parameters and the return type, it could be proven that the return can not possibly be referencing data from one of the parameters. However, it was decided to be conservative here for the sake of simplicity and consistency. (considering structs, interfaces and generics, the additional rules could get really complicated).

Here are the actual “safe to return” rules as enforced by the language:

refs to variables on the heap are safe to return
ref parameters are safe to return
out parameters are safe to return (but must be definitely assigned, as is already the case today)
instance struct fields are safe to return as long as the receiver is safe to return
“this” is not safe to return from struct members
a ref, returned from another method is safe to return if all refs/outs passed to that method as formal parameters were safe to return.
Specifically it is irrelevant if receiver is safe to return, regardless whether receiver is a struct, class or typed as a generic type parameter.

The last two rules might look a bit curious. - What’s up with “this”?
The special treatment for “this” was added to handle the following scenario:

interface IIndexable<T>
{
  public ref T this[int i]
}

ref int First<T>(T arg) where T: IIndexable<int>
{
  // is this safe to return by reference?
  return ref arg[0];
}

The problem is that “this” is passed by reference to struct members and by value to class members. If we consider “this” in struct members the same as other parameters for the purpose of rule #6, we would have a problem here since we do not know whether T is a struct or a class. Treating type T conservatively as “can be a struct” would diminish the usefulness of ref returns when used with generics, so another approach was chosen. - “this” is completely ignored at the call site for the purpose of “safe to return” rule as we may not even know whether we are dealing with a struct or a class. To make that safe in cases when we do get a struct, the rule #5 was added. Surely, it is known inside a member whether the container is a struct or a class and the safety can be enforced there.

Verifiability.
There is a little issue with ref returns concerning verifiability. Generally ECMA 335 specifies ref returns as not verifiable. Some JITs are less strict and allow ref returning of heap variables (to accommodate some patterns used by managed c++). That relaxed behavior is still stricter than “safe to return” rules and some examples involving ref returns would only work in scenarios that do not involve formal verification.

It is, however, believed that a system in agreement with “safe to return” rules is actually typesafe and there are plans to add corresponding relaxation to verification rules in the current JITs and tools like PEVerify

It is conceivable that ECMA 335 will be modified or get an implementation specific amendment for such relaxation at some point as well .

Local variables cannot be returned by reference.

2016-10-29T00:00:00+00:00

Ability to return by reference introduces an interesting scenario.- What happens when a local variable is returned by reference? Is the variable still alive when its containing method has completed? What happens with the returned reference when callee is invoked again?

These are the questions that every language that allows byref returns needs to answer one way or another. C# design team had to deal with these questions too.

Several options were considered:

– Allow returning locals by reference and leave the behavior unspecified.
That is how C++ handles locals returned by reference. Although most C++ compilers would give a warning.

It is not a viable option for C#. The underlying mechanism for ref returns is managed pointers and those are subject to GC tracking. Regular locals are typically implemented as slots on the stack and subsequent calls will reuse those slots while their local variables may have different types.
It is extremely dangerous to have a managed reference of type T pointing to unspecified data that has nothing to do with T. If GC attempts to track through such reference and follow what would be the fields of the T instance, but in reality bits and pieces of some other type, it would easily result in heap corruptions.

Example of a type safety problem if actual stack slots are returned by ref.

// returns a ref to an Exception local
ref int RefEx()
{
  Exception local = new Exception("hi");
  return ref local;
}

// returns a ref to an int local
ref int RefInt()
{
  int local = 42;
  return local;
}

void TakesTwoRefs(ref Exception s, ref int i)
{
  GC.Collect();
}

void WritesIntIntoEx()
{
  // RefEx will run in the same stack space as RefInt
  // so it is likely that results of RefInt and RefEx
  // would point to the same or overlapping memory location
  // That is already bad by itself.
  // What is worse is that RefInt writes "42" into that location
  // If GC happens during the call, it may see something typed
  // as "Exception" at completely bogus location.
  TakesTwoRefs(ref RefInt(), ref RefEx());
}

– Extend the life time of the local by allocating it on the heap.
This is how Go handles this situation.

It would not be something entirely new for C#. The approach would be somewhat similar to capturing locals into closures. However, it was decided that in the context of ref returns this is not a good solution.

Firstly, the extent (lifetime) of a local variable in C# matches its scope. Since the caller is running outside of the scope of the callee, then, from its point of view, the locals of the callee do not exist. Note that lambdas that cause locals to be captured into closures do not leave the scope, while returning from the method certainly does leave the scope. It would be strange that caller can get an alias to a local that does not exist and perhaps even multiple aliases to multiple incarnation of such local, if caller makes a ref returning call more than once.
That was not the major point, though. I am sure with some effort such behavior could be rationalized and accepted, if necessary.

Secondly, and more importantly, the whole idea of introducing ref returns was motivated by performance-sensitive scenarios where it would allow to avoid redundant copying. Enabling the feature via automatic capturing of locals into display classes would defeat the purpose.

– Disallow returning local variables by reference.
This is the solution that was chosen for C#. - To guarantee that a reference does not outlive the referenced variable C# does not allow returning references to local variables by reference.
Interestingly, this is the same approach used by Rust, although for slightly different reasons. (Rust is a RAII language and actively destroys locals when exiting scopes)