CSC/ECE 517 Fall 2011/ch1 1c cm: Difference between revisions
(54 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
== Introduction == | == Introduction == | ||
Closures are first-class functions that capture the | Closures are [[ #firstclass | first-class functions]] that capture the [[ #freevariable | free variables]] in the lexical scope in which they are defined. Closures are associated with functional programming. Functional programming is a programming paradigm in which higher-order functions act on other functions. Functional programming is motivated by lambda calculus which dates back to the 1930s. Object-oriented programming dates back to the 1960s. The philosophy was devised to simplify programming complex problems <ref name="SimulaPaper">like simulating intelligence</ref>. Objects in object-oriented programming are first-class entities that themselves contain instance data, referred to as fields or data members, and procedures or functions called [http://en.wikipedia.org/wiki/Method_(computer_programming) methods]. These methods can act on the instance data stored in an object instance. While object-oriented programming is widely (almost universally) used today, there are some tasks or problems that are solved more concisely, more extensibly, or more elegantly using closures than methods. Some of these tasks include [[ #generics | iterating across a collection and performing some computation for each item in the collection]], [[ #callbacks | callbacks]], [[ #parallel | parallelized reduction]], [[ #management | policy enforcement]], [[ #deferred | deferred execution]], and [[ #continuations | continuations]]. In addition the functional programming style that relies upon closures offers a more [[ #syntax | declarative style of programming]] that often leads to better readability and less code to maintain. | ||
== Definitions == | == Definitions == | ||
Line 7: | Line 7: | ||
=== Closures === | === Closures === | ||
"A closure is a function that captures the bindings of free variables in its lexical context." | "A closure is a function that captures the bindings of free variables in its lexical context."<ref>Gafter, Neal. "A Definition of Closures" http://gafter.blogspot.com/2007/01/definition-of-closures.html</ref> In other words, a closure is a block of executable code together with a reference to the variables in scope at the site of its creation. These free variables are captured by the closure, and their lifetime is extended throughout the lifetime of the closure. | ||
Closures can be created by defining a function within the body of another function as the following example illustrates. Note that the scope of the parameter <code>n</code> is limited to the inside of the <code>addGen</code> function. However, because <code>addGen</code> returns a closure, that closure has access to <code>n</code> whenever it is evoked. | Closures can be created by defining a function within the body of another function as the following example illustrates. Note that the scope of the parameter <code>n</code> is limited to the inside of the <code>addGen</code> function. However, because <code>addGen</code> returns a closure, that closure has access to <code>n</code> whenever it is evoked. | ||
Line 23: | Line 23: | ||
</pre> | </pre> | ||
==== First-class functions ==== | ==== <div id="firstclass">First-class functions</div> ==== | ||
Closures are typically implemented as first-class functions. A first-class function is a function that can appear anywhere in a program that other first-class values (such as a number or a string) can appear. Namely, a first-class function can be assigned to a variable, passed as an argument to another function, and returned as the return value from a function. | Closures are typically implemented as [http://en.wikipedia.org/wiki/First-class_function first-class functions]. A first-class function is a function that can appear anywhere in a program that other first-class values (such as a number or a string) can appear. Namely, a first-class function can be assigned to a variable, passed as an argument to another function, and returned as the return value from a function. | ||
==== Execution-time environment contains free variables ==== | ==== <div id="freevariable">Execution-time environment contains free variables</div> ==== | ||
Early functional programming languages such as LISP relied upon dynamic scoping in which free variables are bound at run-time to the first matching variable name in the call stack. First the compiler would look inside the function for a definition of each variable. If none was found, it would next look at the calling function, and so on. | Early functional programming languages such as [http://en.wikipedia.org/wiki/Lisp_(programming_language) LISP] relied upon [http://c2.com/cgi/wiki?DynamicScoping dynamic scoping] in which free variables are bound at run-time to the first matching variable name in the call stack. First the compiler would look inside the function for a definition of each variable. If none was found, it would next look at the calling function, and so on. | ||
An example of dynamic scoping in a JavaScript-like language: | An example of dynamic scoping in a JavaScript-like language: | ||
Line 68: | Line 68: | ||
=== Methods === | === Methods === | ||
Object-oriented programming dates back to at least the middle 1960's. One of the fundamental principals of objects is the notion of encapsulation. One of the earliest descriptions of the methodology, dates back to a 1966 paper describing the SIMULA, a language designed to ease simulation of large systems. The SIMULA language used classes to encapsulate coupled procedures and data into an object, “an object is a self-contained program (block instance) having its own local data and actions defined by a 'class declaration'” | [http://en.wikipedia.org/wiki/Object-oriented_programming Object-oriented programming] dates back to at least the middle 1960's. One of the fundamental principals of objects is the notion of encapsulation. One of the earliest descriptions of the methodology, dates back to a 1966 paper describing the [http://en.wikipedia.org/wiki/Simula SIMULA], a language designed to ease simulation of large systems. The SIMULA language used classes to encapsulate coupled procedures and data into an object, “an object is a self-contained program (block instance) having its own local data and actions defined by a 'class declaration'” <ref name="SimulaPaper">Ole-Johan Dahl and Kristen Nygaard. 1966. SIMULA: an ALGOL-based simulation language. Commun. ACM 9, 9 (September 1966), 671-678. DOI=10.1145/365813.365819 http://doi.acm.org/10.1145/365813.365819 </ref>. This 50 year-old concept of fields and procedures is central to the design of today's object-oriented languages such as Smalltalk, Java and C++. | ||
A first-class object method requires: | |||
#finite set of static or instance data (commonly referred to as fields) created by the programmer which hold the state of the object | #a finite set of static or instance data (commonly referred to as fields) created by the programmer which hold the state of the object | ||
# | #actions or procedures which can modify the object state referenced using an implicit 'this' or 'self' | ||
==== Instance or static class functions ==== | ==== Instance or static class functions ==== | ||
Our definition of a method requires functions which can be bound to a first-class object. All object-oriented languages provide this construct. We can create objects and pass those objects around to be | Our definition of a method requires functions which can be bound to a first-class object. All object-oriented languages provide this construct. We can create objects and pass those objects around to be used by other objects or in the global environment. Methods receive an implicit 'this' pointer which gives the execution context access to fields of the object. | ||
An example | An example below uses a C++ class which has internal fields for holding state. The class has internal state represented by private fields and both public and private methods (explained below). The internal state is only accessible using the methods provided by the class. | ||
<pre> | <pre> | ||
Line 105: | Line 105: | ||
</pre> | </pre> | ||
In the example the class has access to only | In the example the class itself has access to mData, mLeft, and mRight fields. The public instance functions getLeft, getRight, and getData provide read-only access to the internal state, while private instance functions setRight and setLeft can be used by other methods internal to the class to modify state. | ||
==== Methods which modify the object state ==== | ==== Methods which modify the object state ==== | ||
Object oriented classes must have fields which can be modified by the object itself. All object-oriented language also provide this construct. | Object oriented classes must have fields which can be modified by methods provided by the object itself. All object-oriented language also provide this construct. | ||
Reference the example above. The remove method is available to all instances of ListItem, while the setRight and setLeft methods are available only within the object. They reference other instances of ListItems using pointers to those instances. All of these methods only work on fields that have been clearly defined for the object. | |||
== Closures vs. Methods - Benefits of Closures == | == Closures vs. Methods - Benefits of Closures == | ||
Where the benefits of methods have been demonstrated in object-oriented systems, closures are finding their way into languages which haven't had them in the past, like C++ and Java. The C++ proposal for extending the language lists some of the benefits of supporting closures in the traditionally OO language, including notational simplicity, code sharing of common algorithms, and generality | Where the benefits of methods have been demonstrated in object-oriented systems, closures are finding their way into languages which haven't had them in the past, like C++ and Java. The C++ proposal for extending the language lists some of the benefits of supporting closures in the traditionally OO language, including notational simplicity, code sharing of common algorithms, and generality <ref name="stroustrup">Stroustrup, Bjarne and Jeremiah Willcock, Jaakko Jarvi, Doug Gregor, and Andrew Lumsdaine. "Lamba expressions and closures for C++", Technical Report N1968, ISO/IEC JTC 1, Information Technology, Subcommittee SC 22, Programming Language C++, 2006 http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2006/n1968.pdf</ref>. An IBM developerWorks article also lists the some common usages of closures as customization, iterating across collections, and managing resources and enforcing policy. | ||
We will broadly break these down into examples where Closure provide a clean implementation. | We will broadly break these down into examples where Closure provide a clean implementation. | ||
* Generic algorithms - iterating across collections | * [[ #generics | Generic algorithms - iterating across collections ]] | ||
* Callbacks - customizing behavior | * [[ #callbacks | Callbacks - customizing behavior ]] | ||
* Managing resources and enforcing policy | * [[ #management | Managing resources and enforcing policy ]] | ||
* Deferred execution | * [[ #deferred | Deferred execution ]] | ||
* Continuations | * [[ #continuations | Continuations ]] | ||
* Declarative syntax leads to less code | * [[ #syntax | Declarative syntax leads to less code ]] | ||
=== Generic Algorithms === | === <div id="generics">Generic Algorithms</div> === | ||
Generic algorithms provide a means of capturing a set of steps to perform on a data. Examples of generic algorithms would be mathematical formula like the area of a circle or the square of a number, or a generic process applied to a series of input. | |||
We can further segment generic algorithms down into the following examples: | |||
* [[ #algos | basic algorithm syntax ]] | |||
* [[ #curry | specialization or parameter currying ]] | |||
* [[ #parallel | parallel programming (e.g. the map-reduce problem) ]] | |||
The classic for_each template algorithm in C++ is called using the following syntax | ==== <div id="algos">Basic Algorithms</div> ==== | ||
The C++ proposal makes light of the fact that closures provide the ability to inject code into generic routines without the programmer having to provide adapters. "The lack of a syntactically light-weight way to define simple function objects is a hindrance to the effective use of several generic algorithms in the Standard Library" <ref name="stroustrup"/> This is a classic example where Closures provide clean implementation. | |||
The for_each template algorithm in C++ is called using the following syntax. | |||
<pre> | <pre> | ||
Line 180: | Line 184: | ||
The Closure is syntactically much cleaner. | The Closure is syntactically much cleaner. | ||
==== Specialization / | ==== <div id="curry">Specialization / Currying Parameters</div> ==== | ||
Currying generically refers to the act of transforming a procedure that takes multiple parameters and converting it to one that takes fewer. | [http://en.wikipedia.org/wiki/Currying Currying] generically refers to the act of transforming a procedure that takes multiple parameters and converting it to one that takes fewer.<ref>Wikipedia Currying https://secure.wikimedia.org/wikipedia/en/wiki/Currying</ref> Currying allows creation of specialized versions of generic functions using parameters bound by the closing environment. In languages without closure support it is necessary to explicitly create specialized classes containing the fields needed to call the generic function. | ||
A very simple example in Ruby that make use of an enclosing environment are a CSV line reader and an XML tag reader built from the string scan method | A very simple example in Ruby that make use of an enclosing environment are a CSV line reader and an XML tag reader built from the string scan method. | ||
<pre> | <pre> | ||
Line 223: | Line 227: | ||
For these simple types of functions, the Closure provides some benefit but a class specialization can provide the same function. | For these simple types of functions, the Closure provides some benefit but a class specialization can provide the same function. | ||
==== Parallel Programming - | ==== <div id="parallel">Parallel Programming - Map-Reduce</div> ==== | ||
Google popularized the technique of | Google popularized the technique of running parallel [http://en.wikipedia.org/wiki/MapReduce map reduction] with its search algorithm that simultaneously runs the same search in multiple environments and combines the results to produce a single answer. Within a programming environment Closures allow simple parallel map reduction because the closure provides contextual environment (results, handles, and other local storage) for individual threads. This state data would normally be contained in object instance fields created specifically for the parallel task. | ||
The following Ruby thread example<ref>Flanagan, David and Yukihiro Matsumoto, The Ruby Programming Language. Sebastopol: O'Reilly Media, Inc., 2008. http://oreilly.com/catalog/9780596516178, pg 382</ref> | The following Ruby thread example<ref>Flanagan, David and Yukihiro Matsumoto, The Ruby Programming Language. Sebastopol: O'Reilly Media, Inc., 2008. http://oreilly.com/catalog/9780596516178, pg 382</ref> demonstrates a multi-threaded operation in a closure. The concurrent map and reduce using a thread per file demonstrates the map-reduce algorithm. The Closure provides a thread context and simple function which counts lines in each file per thread and populates a map which is reduced to a total number of lines in all files. | ||
<pre> | <pre> | ||
Line 256: | Line 260: | ||
</pre> | </pre> | ||
=== Callbacks === | === <div id="callbacks">Callbacks</div> === | ||
Callbacks are often cited as the most desirable application of Closures because the environment around the callback is initialized without programmer intervention. In languages that don't provide Closures, interfaces, fields, or even parameters may have to be added to provide thunking and state information to object being called. | [http://en.wikipedia.org/wiki/Callback_(computer_programming) Callbacks] are often cited as the most desirable application of Closures<ref>"Learning more about Closures", StackOverflow.com. http://stackoverflow.com/questions/212401/javascript-how-do-i-learn-about-closures-usage</ref><ref>Morrison, Joe. "A Brief Introduction to Four Programming Language Concepts". http://joemorrison.org/projects/four-concepts/</ref> because the environment around the callback is initialized without programmer intervention. In languages that don't provide Closures, interfaces, fields, or even parameters may have to be added to provide thunking and state information to object being called. | ||
Below are two important callback forms that can demonstrate this capability in common use. | |||
* GUI behavior injection | * [[ #gui_handlers | GUI behavior injection ]] | ||
* generic exception handling | * [[ #exceptions | generic exception handling ]] | ||
==== GUI behavior Injection ==== | ==== <div id="gui_handlers">GUI behavior Injection</div> ==== | ||
Behavior injection is used heavily in Javascript libraries where they allow the creation of generic libraries which can be easily extended using a provided function. | Behavior injection is used heavily in Javascript libraries where they allow the creation of generic libraries which can be easily extended using a provided function. The Closure allows binding the in-scope GUI object to the callback function, freeing the programmer from creation of an object or interface. | ||
Below is an example using the JQuery bind method to provide some capability to an event framework. | Below is an example using the JQuery bind method to provide some capability to an event framework. | ||
Line 279: | Line 283: | ||
</pre> | </pre> | ||
The Closure is created in the context of each selected DOM object (in this case those objects with type of "p"). The anonymous function is then called when the "mouseenter" event fires. The context of the this pointer is unique to each DOM object because the Closure captured the environment (object) at the time of function creation. | The Closure is created in the context of each selected [http://en.wikipedia.org/wiki/Document_Object_Model DOM] object (in this case those objects with type of "p"). The anonymous function is then called when the "mouseenter" event fires. The context of the this pointer is unique to each DOM object because the Closure captured the environment (object) at the time of function creation. | ||
Consider the same callback behavior in a language like Java which does not provide Closures. | Consider the same callback behavior in a language like Java which does not provide Closures. | ||
Line 295: | Line 299: | ||
We must first define the interface or function which will provide the callback. It must provide the context under which the callback will execute. If we want access to more of the environment we must make provisions in the MouseClick class by adding state variables or pointers to the environment under which it was instanced. | We must first define the interface or function which will provide the callback. It must provide the context under which the callback will execute. If we want access to more of the environment we must make provisions in the MouseClick class by adding state variables or pointers to the environment under which it was instanced. | ||
==== Generic Exception Handling ==== | ==== <div id="exceptions">Generic Exception Handling</div> ==== | ||
Closures can be useful in exception handling because the context of the execution environment is maintained in the Closure. There is a guarantee that the objects referenced will still be in scope when the exception is taken. | |||
<pre> | <pre> | ||
Line 310: | Line 314: | ||
end | end | ||
ex_handler = exception_handler { puts "Just another Error" } | myobject = Thing.new | ||
ex_handler = exception_handler { puts "Just another Error #{myobject.error}" } | |||
ex_handler.call { | ex_handler.call { | ||
Line 317: | Line 322: | ||
</pre> | </pre> | ||
Where the exception handler is excuted in the example above the programmer is guaranteed that the reference to myobject is still valid and this can be used to provide information about the exception itself. | |||
=== <div id="management">Resource Management / Policy Enforcement</div> === | |||
Closures allow the creation of policies within an application or class because they provide an easy means of calling back into a piece of user specified code and state at appropriate times ensuring the resource and user environment has not gone out of scope. | |||
==== Maintaining Resources ==== | This category of usage can be demonstrated by the following examples. | ||
* [[ #resources | maintaining resources ]] | |||
* [[ #policy | resource policy ]] | |||
==== <div id="resources">Maintaining Resources</div> ==== | |||
This often cited example shows the encapsulation of file base policy using a C++ and a Ruby Closure.<ref>Wikipedia Resource Acquisition is Initialization: http://en.wikipedia.org/wiki/Resource_Acquisition_Is_Initialization</ref> | This often cited example shows the encapsulation of file base policy using a C++ and a Ruby Closure.<ref>Wikipedia Resource Acquisition is Initialization: http://en.wikipedia.org/wiki/Resource_Acquisition_Is_Initialization</ref> | ||
Line 373: | Line 382: | ||
The Closure is created with the file handle and the file name in scope. When the Closure goes out of the scope the file handle will be closed. | The Closure is created with the file handle and the file name in scope. When the Closure goes out of the scope the file handle will be closed. | ||
==== Resource Policy ==== | ==== <div id="policy">Resource Policy</div> ==== | ||
An example showing how a resource policy can implemented is shown below.<ref>Tate, Bruce, "Crossing borders: Closures". IBM developerWorks Technical Topics. https://www.ibm.com/developerworks/java/library/j-cb01097/index.html</ref> This example shows a database transaction handler allowing callback in the middle of the processing of the multiple step process of committing a database change. | |||
<pre> | <pre> | ||
Line 391: | Line 400: | ||
</pre> | </pre> | ||
The user of this policy would be able to define a transaction_handler, providing a Closure that will execute between steps of setup and commit. | The user of this policy would be able to define a transaction_handler, providing a Closure that will execute between steps of setup and commit. The policy of committing the transaction is wrapped in a Closure allow the developer to make a change only at a particular defined step in the process. | ||
The Ruby ActiveRecord library makes extensive use of this pattern to provide multiple callbacks throughout record creation | The Ruby ActiveRecord library makes extensive use of this pattern to provide multiple callbacks throughout record creation. For example it provides the following callbacks during the record creation process: before_validation_on_create, validation_on_create, after_validation_on_create, before_create, after_create. | ||
=== Deferred execution === | === <div id="deferred">Deferred execution</div> === | ||
Closures are subject to deferred execution. This means that they can be constructed, passed around, and executed at a later time or not at all. This saves the computer from performing unnecessary computations. | Closures are subject to [http://blogs.msdn.com/b/charlie/archive/2007/12/09/deferred-execution.aspx deferred execution]. This means that they can be constructed, passed around, and executed at a later time or not at all. This saves the computer from performing unnecessary computations. | ||
An example of deferred execution using LINQ in C# 3.5 or higher: | An example of deferred execution using LINQ in C# 3.5 or higher: | ||
Line 409: | Line 418: | ||
string occupation = "Teacher"; | string occupation = "Teacher"; | ||
IEnumerable<int> agesOfPeopleInOccupation = peeps.Where(p => p.Occupation == occupation).Select(p => p.Age); | // Because of deferred execution, the list is not enumerated here. | ||
IEnumerable<int> agesOfPeopleInOccupation = peeps.Where(p => p.Occupation == occupation).Select(p => p.Age); | |||
occupation = "Plumber"; | occupation = "Plumber"; | ||
// Average() causes the list to be enumerated. Because occupation is now | |||
// set to "Plumber", that is the value used to filter the list. | |||
double average = agesOfPeopleInOccupation.Average(); | |||
Console.WriteLine(average) // 29.0 | Console.WriteLine(average) // 29.0 | ||
</pre> | </pre> | ||
=== Continuation Passing Style === | === <div id="continuations">Continuation Passing Style</div> === | ||
Continuation Passing Style (CPS) in functional programming allows a programmer to finish a process at a later time. A method implementing CPS takes as an argument a function (closure) that is to be called instead of returning a value. The supplied function is instead invoked with the return value. | [http://en.wikipedia.org/wiki/Continuation-passing_style Continuation Passing Style (CPS)] in functional programming allows a programmer to finish a process at a later time. A method implementing CPS takes as an argument a function (closure) that is to be called instead of returning a value. The supplied function is instead invoked with the return value. | ||
The following example is taken from | The following example is taken from MSDN Magazine<ref>Miller, Jeremy. "Functional Programming for Everyday .NET Development", MSDN Magazine. http://msdn.microsoft.com/en-us/magazine/ee309512.aspx</ref>. | ||
The ICommandExecutor has two methods, which both implement CPS by requiring a closure to be passed as an argument. | The ICommandExecutor has two methods, which both implement CPS by requiring a closure to be passed as an argument. | ||
Line 467: | Line 479: | ||
</pre> | </pre> | ||
=== Declarative syntax leads to less code === | === <div id="syntax">Declarative syntax leads to less code</div> === | ||
Functional programming leads to more declarative programming where a programmer specifies what he wants to do rather than how to do it. | Functional programming leads to more [http://en.wikipedia.org/wiki/Declarative_programming declarative programming] where a programmer specifies what he wants to do rather than how to do it. | ||
To illustrate, we will again use LINQ and C#. In this example we tell the compiler to sort, filter, and partition a list of elements without ever specifying how it is to be done. The logic to do so has been written once by the language developers and can be reused in a multitude of ways by passing in lambda expressions (closures). | To illustrate, we will again use LINQ and C#. In this example we tell the compiler to sort, filter, and partition a list of elements without ever specifying how it is to be done. The logic to do so has been written once by the language developers and can be reused in a multitude of ways by passing in lambda expressions (closures). | ||
Line 495: | Line 507: | ||
== Further Reading == | == Further Reading == | ||
[http://martinfowler.com/bliki/Closure.html Martin Fowler - Closure] | [http://martinfowler.com/bliki/Closure.html Martin Fowler - Closure] : provides a basic definition of Closures and gives a few very basic examples. | ||
[http://gafter.blogspot.com/2007/01/definition-of-closures.html Neal Gafter - A Definition of Closures] | [http://gafter.blogspot.com/2007/01/definition-of-closures.html Neal Gafter - A Definition of Closures] : gives a high-level, historical definition of Closures. | ||
[http://onestepback.org/articles/invitationtoruby/reason4.html Top Ten Reasons I Like Ruby - Blocks and Closures] | [http://onestepback.org/articles/invitationtoruby/reason4.html Jim Weirich - Top Ten Reasons I Like Ruby - Blocks and Closures] : provides several examples of how Closures can be useful in Ruby. | ||
[http://www.skorks.com/2010/05/closures-a-simple-explanation-using-ruby/ Alan Skorkin | [http://www.skorks.com/2010/05/closures-a-simple-explanation-using-ruby/ Alan Skorkin - Closures - A Simple Explanation] : gives a high-level overview of Closures using Ruby examples. | ||
[http://www.ibm.com/developerworks/java/library/j-cb01097/index.html IBM Developerworks - Crossing borders : Closures] | [http://www.ibm.com/developerworks/java/library/j-cb01097/index.html IBM Developerworks - Crossing borders : Closures] : gives Ruby examples where Closures are useful. | ||
[http://samdanielson.com/2007/9/6/an-introduction-to-closures-in-ruby Sam Danielson - An Introduction to Closures in Ruby] | [http://samdanielson.com/2007/9/6/an-introduction-to-closures-in-ruby Sam Danielson - An Introduction to Closures in Ruby] : high-level overview of using Closures in Ruby. | ||
[http://csharpindepth.com/Articles/Chapter5/Closures.aspx C# in Depth - The Beauty of Closures] | [http://csharpindepth.com/Articles/Chapter5/Closures.aspx C# in Depth - The Beauty of Closures] : demonstrates Closures in C#. | ||
[http://en.wikipedia.org/wiki/Closure_(computer_science) Wikipedia - Closures] | [http://en.wikipedia.org/wiki/Closure_(computer_science) Wikipedia - Closures] : everything there is to know about Closures on one page. | ||
[http://ynniv.com/blog/2007/08/closures-in-python.html ynniv - Closures in Python] | [http://ynniv.com/blog/2007/08/closures-in-python.html ynniv - Closures in Python] : examples implementing Closures in Python. | ||
[http://msdn.microsoft.com/en-us/magazine/ee309512.aspx MSDN Magazine - Functional Programming .NET Development] | [http://msdn.microsoft.com/en-us/magazine/ee309512.aspx MSDN Magazine - Functional Programming .NET Development] : examples of functional style with Closures in C#. | ||
[http://www.randomhacks.net/articles/2007/02/01/some-useful-closures-in-ruby Eric Kidd - Some useful closures, in Ruby] | [http://www.randomhacks.net/articles/2007/02/01/some-useful-closures-in-ruby Eric Kidd - Some useful closures, in Ruby] : demonstrates more examples of curried and specialized Closures in Ruby. | ||
[http://weblog.raganwald.com/2007/01/closures-and-higher-order-functions.html Reg Braithwaite - Closures and Higher-Order Functions] | [http://weblog.raganwald.com/2007/01/closures-and-higher-order-functions.html Reg Braithwaite - Closures and Higher-Order Functions] : gives Ruby examples of specialized Closures. | ||
http://jibbering.com/faq/notes/closures | [http://jibbering.com/faq/notes/closures Javascript Closures] : demonstrates Javascript Closures in callbacks. |
Latest revision as of 23:37, 21 September 2011
Introduction
Closures are first-class functions that capture the free variables in the lexical scope in which they are defined. Closures are associated with functional programming. Functional programming is a programming paradigm in which higher-order functions act on other functions. Functional programming is motivated by lambda calculus which dates back to the 1930s. Object-oriented programming dates back to the 1960s. The philosophy was devised to simplify programming complex problems <ref name="SimulaPaper">like simulating intelligence</ref>. Objects in object-oriented programming are first-class entities that themselves contain instance data, referred to as fields or data members, and procedures or functions called methods. These methods can act on the instance data stored in an object instance. While object-oriented programming is widely (almost universally) used today, there are some tasks or problems that are solved more concisely, more extensibly, or more elegantly using closures than methods. Some of these tasks include iterating across a collection and performing some computation for each item in the collection, callbacks, parallelized reduction, policy enforcement, deferred execution, and continuations. In addition the functional programming style that relies upon closures offers a more declarative style of programming that often leads to better readability and less code to maintain.
Definitions
Closures
"A closure is a function that captures the bindings of free variables in its lexical context."<ref>Gafter, Neal. "A Definition of Closures" http://gafter.blogspot.com/2007/01/definition-of-closures.html</ref> In other words, a closure is a block of executable code together with a reference to the variables in scope at the site of its creation. These free variables are captured by the closure, and their lifetime is extended throughout the lifetime of the closure.
Closures can be created by defining a function within the body of another function as the following example illustrates. Note that the scope of the parameter n
is limited to the inside of the addGen
function. However, because addGen
returns a closure, that closure has access to n
whenever it is evoked.
def addGen(n) Proc.new { |x| x + n } end add5 = addGen(5) add10 = addGen(10) puts add5(6) # 11 puts add10(6) # 16
First-class functions
Closures are typically implemented as first-class functions. A first-class function is a function that can appear anywhere in a program that other first-class values (such as a number or a string) can appear. Namely, a first-class function can be assigned to a variable, passed as an argument to another function, and returned as the return value from a function.
Execution-time environment contains free variables
Early functional programming languages such as LISP relied upon dynamic scoping in which free variables are bound at run-time to the first matching variable name in the call stack. First the compiler would look inside the function for a definition of each variable. If none was found, it would next look at the calling function, and so on.
An example of dynamic scoping in a JavaScript-like language:
function alertX() { alert(x); // Note that with dynamic scoping, x need not be defined at the site of the function declaration } function fun1() { var x = 10; alertX(); // 10 } function fun2() { var x = "hello"; alertX(); // hello }
Closures, on the other hand, capture variables that are in scope at the time they are created, and the lifetime of those captured variables is extended throughout the lifetime of the closure. This is called lexical or static scoping. Free variables must be defined at compile time -- hence the name static scoping as opposed to dynamic scoping, where the values of free variables can only be known at run time. Closures were first implemented in the Scheme programming language in the 1960s.
The following example illustrates the fact that free variables inside of a closure are in fact bound to the variables defined at the site of the closure's definition. Here we have a function multipleClosures
that returns two closures that both act on the variable i
. Both are able to act on i
though it is not in scope at the site where they are called but rather at the site where they were defined.
def multipleClosures i = 0 [ Proc.new { i = i + 1 }, Proc.new { i = i - 1 } ] end closures = multipleClosures puts closures[0].call # 1 puts closures[0].call # 2 puts closures[1].call # 1
Methods
Object-oriented programming dates back to at least the middle 1960's. One of the fundamental principals of objects is the notion of encapsulation. One of the earliest descriptions of the methodology, dates back to a 1966 paper describing the SIMULA, a language designed to ease simulation of large systems. The SIMULA language used classes to encapsulate coupled procedures and data into an object, “an object is a self-contained program (block instance) having its own local data and actions defined by a 'class declaration'” <ref name="SimulaPaper">Ole-Johan Dahl and Kristen Nygaard. 1966. SIMULA: an ALGOL-based simulation language. Commun. ACM 9, 9 (September 1966), 671-678. DOI=10.1145/365813.365819 http://doi.acm.org/10.1145/365813.365819 </ref>. This 50 year-old concept of fields and procedures is central to the design of today's object-oriented languages such as Smalltalk, Java and C++.
A first-class object method requires:
- a finite set of static or instance data (commonly referred to as fields) created by the programmer which hold the state of the object
- actions or procedures which can modify the object state referenced using an implicit 'this' or 'self'
Instance or static class functions
Our definition of a method requires functions which can be bound to a first-class object. All object-oriented languages provide this construct. We can create objects and pass those objects around to be used by other objects or in the global environment. Methods receive an implicit 'this' pointer which gives the execution context access to fields of the object.
An example below uses a C++ class which has internal fields for holding state. The class has internal state represented by private fields and both public and private methods (explained below). The internal state is only accessible using the methods provided by the class.
class ListItem { public: ListItem( ListItem *in_left, ListItem *in_right, void *in_data ) : mLeft(in_left), mRight(in_right), mData(in_data) {}; ListItem *getLeft() const { return mLeft; } ListItem *getRight() const { return mRight; } void *getData() const { return mData; } ListItem *remove() { mLeft->setRight(mRight); mRight->setLeft(mLeft); return mData; } private: void *mData; ListItem *mLeft, *mRight; void setRight( ListItem *in_right ) { mRight = in_right; } void setLeft( ListItem *in_left ) { mLeft = in_left; } };
In the example the class itself has access to mData, mLeft, and mRight fields. The public instance functions getLeft, getRight, and getData provide read-only access to the internal state, while private instance functions setRight and setLeft can be used by other methods internal to the class to modify state.
Methods which modify the object state
Object oriented classes must have fields which can be modified by methods provided by the object itself. All object-oriented language also provide this construct.
Reference the example above. The remove method is available to all instances of ListItem, while the setRight and setLeft methods are available only within the object. They reference other instances of ListItems using pointers to those instances. All of these methods only work on fields that have been clearly defined for the object.
Closures vs. Methods - Benefits of Closures
Where the benefits of methods have been demonstrated in object-oriented systems, closures are finding their way into languages which haven't had them in the past, like C++ and Java. The C++ proposal for extending the language lists some of the benefits of supporting closures in the traditionally OO language, including notational simplicity, code sharing of common algorithms, and generality <ref name="stroustrup">Stroustrup, Bjarne and Jeremiah Willcock, Jaakko Jarvi, Doug Gregor, and Andrew Lumsdaine. "Lamba expressions and closures for C++", Technical Report N1968, ISO/IEC JTC 1, Information Technology, Subcommittee SC 22, Programming Language C++, 2006 http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2006/n1968.pdf</ref>. An IBM developerWorks article also lists the some common usages of closures as customization, iterating across collections, and managing resources and enforcing policy.
We will broadly break these down into examples where Closure provide a clean implementation.
- Generic algorithms - iterating across collections
- Callbacks - customizing behavior
- Managing resources and enforcing policy
- Deferred execution
- Continuations
- Declarative syntax leads to less code
Generic Algorithms
Generic algorithms provide a means of capturing a set of steps to perform on a data. Examples of generic algorithms would be mathematical formula like the area of a circle or the square of a number, or a generic process applied to a series of input.
We can further segment generic algorithms down into the following examples:
- basic algorithm syntax
- specialization or parameter currying
- parallel programming (e.g. the map-reduce problem)
Basic Algorithms
The C++ proposal makes light of the fact that closures provide the ability to inject code into generic routines without the programmer having to provide adapters. "The lack of a syntactically light-weight way to define simple function objects is a hindrance to the effective use of several generic algorithms in the Standard Library" <ref name="stroustrup"/> This is a classic example where Closures provide clean implementation.
The for_each template algorithm in C++ is called using the following syntax.
template <class InputIterator, class OutputIterator, class UnaryFunction> OutputIterator transform(InputIterator first, InputIterator last, OutputIterator result, UnaryFunction op); { for ( ; first != last; ++first, ++result ) *result = op(*first); return result; }
Because C++ does not have Closures, the fourth parameter requires either a named function or an object which supports "operator ()".
// static function int multiply_2(int x) { return x * 2; } class multiply_it { int multiplier; multiple_it(int x) : multiplier(x) {} int operator()(int x) const { return x * multiplier } } std::vect test(3,10); std::vect output(3); std::transform( test.begin(), test.end(), output.begin(), multiply ) std::transform( test.begin(), test.end(), output.begin(), multiply_it(2) )
If we use the static function and we want to change the multiply function, we either have to create a new named function or create a global variable and make sure the global variable is set correctly before the function is executed. Using the function object, we have can easily change the multiplier, but we were forced to create and manage a new class specifically for the integer multiply.
Consider the same code using Ruby and a Closure where the each function where the developer can pass a code block.
test = [10,10,10] output = test.collect { |x| x * 2 }
The Closure is syntactically much cleaner.
Specialization / Currying Parameters
Currying generically refers to the act of transforming a procedure that takes multiple parameters and converting it to one that takes fewer.<ref>Wikipedia Currying https://secure.wikimedia.org/wikipedia/en/wiki/Currying</ref> Currying allows creation of specialized versions of generic functions using parameters bound by the closing environment. In languages without closure support it is necessary to explicitly create specialized classes containing the fields needed to call the generic function.
A very simple example in Ruby that make use of an enclosing environment are a CSV line reader and an XML tag reader built from the string scan method.
def splitter( type ) in_regex = type lamba { |in_line| in_line.scan(in_regex) if block_given? result.each { |value| yield(value) } else result end } end cvs_splitter = splitter(/[^,]+/) xmltag_splitter = splitter(/<[^>]+>/) cvs_splitter.call("bob,fred"); # returns a list of fields xmltag_splitter.call("<name>text</name>"); # returns a list of tags
Building the same function in C++ again requires a specialization.
class Splitter { public: Splitter( boost::regex &in_regex ) : mregex(in_regex) {} std::vector operator()(std::string in_string) { [implement function to return list] } private: boost::regex mregex; }; Splitter cvs_splitter("/[^,]+/); Splitter xmltag_splitter("/<[^>]+>/");
For these simple types of functions, the Closure provides some benefit but a class specialization can provide the same function.
Parallel Programming - Map-Reduce
Google popularized the technique of running parallel map reduction with its search algorithm that simultaneously runs the same search in multiple environments and combines the results to produce a single answer. Within a programming environment Closures allow simple parallel map reduction because the closure provides contextual environment (results, handles, and other local storage) for individual threads. This state data would normally be contained in object instance fields created specifically for the parallel task.
The following Ruby thread example<ref>Flanagan, David and Yukihiro Matsumoto, The Ruby Programming Language. Sebastopol: O'Reilly Media, Inc., 2008. http://oreilly.com/catalog/9780596516178, pg 382</ref> demonstrates a multi-threaded operation in a closure. The concurrent map and reduce using a thread per file demonstrates the map-reduce algorithm. The Closure provides a thread context and simple function which counts lines in each file per thread and populates a map which is reduced to a total number of lines in all files.
module Enumerable def con_map(&block) threads = [] self.each do |item| threads << Thread.new { block.call(item) } end threads.map { |t| t.value } end end input = ['file1.txt','file2.txt','file3.txt'] lines = {} output = input.con_map { |file| lines[file] = 0 IO.foreach(file) { |in_line| lines[file] += 1 puts "adding #{file} #{lines[file]} #{in_line}\n" } lines[file] }.reduce { |total, val| total+=val }
Callbacks
Callbacks are often cited as the most desirable application of Closures<ref>"Learning more about Closures", StackOverflow.com. http://stackoverflow.com/questions/212401/javascript-how-do-i-learn-about-closures-usage</ref><ref>Morrison, Joe. "A Brief Introduction to Four Programming Language Concepts". http://joemorrison.org/projects/four-concepts/</ref> because the environment around the callback is initialized without programmer intervention. In languages that don't provide Closures, interfaces, fields, or even parameters may have to be added to provide thunking and state information to object being called.
Below are two important callback forms that can demonstrate this capability in common use.
GUI behavior Injection
Behavior injection is used heavily in Javascript libraries where they allow the creation of generic libraries which can be easily extended using a provided function. The Closure allows binding the in-scope GUI object to the callback function, freeing the programmer from creation of an object or interface.
Below is an example using the JQuery bind method to provide some capability to an event framework.
$("p").bind("mouseenter", function(event) { $(this).toggleClass('someclass'); } )
The Closure is created in the context of each selected DOM object (in this case those objects with type of "p"). The anonymous function is then called when the "mouseenter" event fires. The context of the this pointer is unique to each DOM object because the Closure captured the environment (object) at the time of function creation.
Consider the same callback behavior in a language like Java which does not provide Closures.
public class MouseClick implements MouseListener { [... class implementation ...] public void mousePressed(MouseEvent e) { this.toggleClass; } } window.addMouseListener( [instance of MouseClick class] );
We must first define the interface or function which will provide the callback. It must provide the context under which the callback will execute. If we want access to more of the environment we must make provisions in the MouseClick class by adding state variables or pointers to the environment under which it was instanced.
Generic Exception Handling
Closures can be useful in exception handling because the context of the execution environment is maintained in the Closure. There is a guarantee that the objects referenced will still be in scope when the exception is taken.
def exception_handler( &cleanup ) lambda { begin yield rescue e cleanup.call(e) end } end myobject = Thing.new ex_handler = exception_handler { puts "Just another Error #{myobject.error}" } ex_handler.call { File.open("text.txt","r") }
Where the exception handler is excuted in the example above the programmer is guaranteed that the reference to myobject is still valid and this can be used to provide information about the exception itself.
Resource Management / Policy Enforcement
Closures allow the creation of policies within an application or class because they provide an easy means of calling back into a piece of user specified code and state at appropriate times ensuring the resource and user environment has not gone out of scope.
This category of usage can be demonstrated by the following examples.
Maintaining Resources
This often cited example shows the encapsulation of file base policy using a C++ and a Ruby Closure.<ref>Wikipedia Resource Acquisition is Initialization: http://en.wikipedia.org/wiki/Resource_Acquisition_Is_Initialization</ref>
class file { public: file(const char* filename) : file_(std::fopen(filename, "w+")) { } ~file() { std::fclose(file_) } void write(const char* str) { fputs(str, file_); } private: std::FILE* file_; // prevent copying and assignment; not implemented file(const file &); file& operator=(const file &); }; file logfile("logfile.txt"); // open file (acquire resource) logfile.write("A line in the log\n");
A Ruby Closure showing the same example.
def log_write(filename) in_filename = filename logfile = File.open(in_filename, "w+") lambda { |output| logfile.write(output) } end log_writer = log_write("logfile.txt") log_writer.call("A line in the log\n")
The Closure is created with the file handle and the file name in scope. When the Closure goes out of the scope the file handle will be closed.
Resource Policy
An example showing how a resource policy can implemented is shown below.<ref>Tate, Bruce, "Crossing borders: Closures". IBM developerWorks Technical Topics. https://www.ibm.com/developerworks/java/library/j-cb01097/index.html</ref> This example shows a database transaction handler allowing callback in the middle of the processing of the multiple step process of committing a database change.
def transaction_handler lambda { begin setup_transaction yield commit_transaction rescue roll_back_transaction end } end
The user of this policy would be able to define a transaction_handler, providing a Closure that will execute between steps of setup and commit. The policy of committing the transaction is wrapped in a Closure allow the developer to make a change only at a particular defined step in the process.
The Ruby ActiveRecord library makes extensive use of this pattern to provide multiple callbacks throughout record creation. For example it provides the following callbacks during the record creation process: before_validation_on_create, validation_on_create, after_validation_on_create, before_create, after_create.
Deferred execution
Closures are subject to deferred execution. This means that they can be constructed, passed around, and executed at a later time or not at all. This saves the computer from performing unnecessary computations.
An example of deferred execution using LINQ in C# 3.5 or higher:
List<Person> peeps = new List<Person> { new Person { Name = "Joe", Age = 29, Occupation = "Plumber" }, new Person { Name = "Mary", Age = 44, Occupation = "Teacher" }, new Person { Name = "Bob", Age = 56, Occupation = "Teacher" } }; string occupation = "Teacher"; // Because of deferred execution, the list is not enumerated here. IEnumerable<int> agesOfPeopleInOccupation = peeps.Where(p => p.Occupation == occupation).Select(p => p.Age); occupation = "Plumber"; // Average() causes the list to be enumerated. Because occupation is now // set to "Plumber", that is the value used to filter the list. double average = agesOfPeopleInOccupation.Average(); Console.WriteLine(average) // 29.0
Continuation Passing Style
Continuation Passing Style (CPS) in functional programming allows a programmer to finish a process at a later time. A method implementing CPS takes as an argument a function (closure) that is to be called instead of returning a value. The supplied function is instead invoked with the return value.
The following example is taken from MSDN Magazine<ref>Miller, Jeremy. "Functional Programming for Everyday .NET Development", MSDN Magazine. http://msdn.microsoft.com/en-us/magazine/ee309512.aspx</ref>. The ICommandExecutor has two methods, which both implement CPS by requiring a closure to be passed as an argument.
public interface ICommandExecutor { // Execute an operation on a background thread that // does not update the user interface void Execute(Action action); // Execute an operation on a background thread and // update the user interface using the returned Action void Execute(Func<Action> function); }
public class Presenter { private readonly IView _view; private readonly IService _service; private readonly ICommandExecutor _executor; public Presenter(IView view, IService service, ICommandExecutor executor) { _view = view; _service = service; _executor = executor; } public void RefreshData() { _executor.Execute(() => { var data = _service.FetchDataFromExtremelySlowServiceCall(); return () => _view.UpdateDisplay(data); }); } }
Declarative syntax leads to less code
Functional programming leads to more declarative programming where a programmer specifies what he wants to do rather than how to do it.
To illustrate, we will again use LINQ and C#. In this example we tell the compiler to sort, filter, and partition a list of elements without ever specifying how it is to be done. The logic to do so has been written once by the language developers and can be reused in a multitude of ways by passing in lambda expressions (closures).
List<Person> peeps = new List<Person> { new Person { Name = "Joe", Age = 29, Occupation = "Plumber" }, new Person { Name = "Mary", Age = 44, Occupation = "Teacher" }, new Person { Name = "Bob", Age = 56, Occupation = "Teacher" }, new Person { Name = "John", Age = 23, Occupation = "Teacher" }, new Person { Name = "Susan", Age = 39, Occupation = "Plumber" } }; var subset = peeps.OrderBy(p => p.Age).Where(p => p.Occupation == "Teacher").Take(2); foreach (Person p in subset) Console.WriteLine(p.Name + " "); // John Mary
Topical References
<references/>
Further Reading
Martin Fowler - Closure : provides a basic definition of Closures and gives a few very basic examples.
Neal Gafter - A Definition of Closures : gives a high-level, historical definition of Closures.
Jim Weirich - Top Ten Reasons I Like Ruby - Blocks and Closures : provides several examples of how Closures can be useful in Ruby.
Alan Skorkin - Closures - A Simple Explanation : gives a high-level overview of Closures using Ruby examples.
IBM Developerworks - Crossing borders : Closures : gives Ruby examples where Closures are useful.
Sam Danielson - An Introduction to Closures in Ruby : high-level overview of using Closures in Ruby.
C# in Depth - The Beauty of Closures : demonstrates Closures in C#.
Wikipedia - Closures : everything there is to know about Closures on one page.
ynniv - Closures in Python : examples implementing Closures in Python.
MSDN Magazine - Functional Programming .NET Development : examples of functional style with Closures in C#.
Eric Kidd - Some useful closures, in Ruby : demonstrates more examples of curried and specialized Closures in Ruby.
Reg Braithwaite - Closures and Higher-Order Functions : gives Ruby examples of specialized Closures.
Javascript Closures : demonstrates Javascript Closures in callbacks.