CSC/ECE 517 Fall 2012/ch1 1w7 am

From Expertiza_Wiki
Revision as of 21:38, 11 September 2012 by Arvaidya (talk | contribs)
Jump to navigation Jump to search

Code Reuse - This page describes what is code reuse, how to achieve it and discusses the advantages and disadvantages of code reuse.

Definition

Code reuse is the way in which an existing code can be used to perform a specific service regardless of the application in which it is used. It involves the use of some previously constructed software artifacts like source code, library, components, etc. An example of it can be the design patterns in a new context or in a new development project.

Software reuse is the process of creating software systems from existing software rather than building software systems from scratch.<ref name = "definition_software_reuse" />

Overview

Code reuse is the idea that a partial computer program written at one time can be, should be, or is being used in another program written at a later time. The reuse of programming code is a common technique which attempts to save time and energy by reducing redundant work.

The software library is a good example of code reuse.

Programmers may decide to create internal abstractions so that certain parts of their program can be reused, or may create custom libraries for their own use.

The general practice of using a prior version of an extant program as a starting point for the next version, is also a form of code reuse.

Some so-called code "reuse" involves simply copying some or all of the code from an existing program into a new one. While organizations can realize time to market benefits for a new product with this approach, they can subsequently be saddled with many of the same code duplication problems caused by cut and paste programming.

Many researchers have worked to make reuse faster, easier, more systematic, and an integral part of the normal process of programming. These are some of the main goals behind the invention of object-oriented programming, which became one of the most common forms of formalized reuse.

A somewhat later invention is generic programming.

Another, newer means is to use software "generators", programs which can create new programs of a certain type, based on a set of parameters that users choose. Fields of study about such systems are generative programming and metaprogramming.

History

Ad hoc code reuse has been practiced from the earliest days of programming. Programmers have always reused sections of code, templates, functions, and procedures. Software reuse as a recognized area of study in software engineering, however, dates only from 1968 when Douglas McIlroy of Bell Laboratories proposed basing the software industry on reusable components.

There are many dimensions along which we can trace the history of reuse:

  • Software Development Life Cycle (SDLC) Model
  • Prototyping, Iterative, ESP, etc.
  • Requirements/Specification Method
  • Business Model
  • Implementation technology

History of software reuse based on implementation technology dimension <ref name = history_pros_cons/>

  • Mid 1980’s - Mature Third Generation Programming Languages -

Third generation languages were more portable than the earlier machine specific languages. Even though source code still had to be language, OS and platform specific. However they did achieve limited source code reuse for high value. e.g. math libraries.

  • Late 1980’s - Early Object Oriented Languages and SQL DB -

Introduction of object oriented languages like Common Lisp Object System (CLOS), C++ interpreter, etc provided encapsulation and packaging technology.Packaging and common object models promoted development of language “bindings”. Systematic and repeatable object oriented development processes were also introduced during this time. Although most portability constraints remained as they were, som reuse success was achieved in the for of high value data structure libraries (e.g. GRACE and Booch parts). The first successful reusable service of SQL RDBMS was developed during this period.

  • Early 1990’s - Mature OO Languages, Source Code Libraries -

This era saw development of mature OO languages like Smalltalk, ANSI CLOS, C++ compilers, GUI and Event Driven Programming Libraries and portable compiled software packages like shared Shared Libraries, Dynamic Link Libraries. Significant reuse research was carried out during this period(e.g. SPC CoE for Reuse/ARPA contract) which gave birth to early software reuse driven processes and reuse libraries (like management systems, not just content)

  • Mid 1990’s - Early DIAE Component Packaging -

Dynamically Integrable Autonomously Executable Components that are compiled and linked separately, integrate at run time, and run independently of each other were developed during this period. (e.g. COM and CORBA). Distributed systems which formed the foundation for Web Services and Component Based development processes were introduced. Advanced like these also gave rise to commercially successful code generators like Netron, Frame Technology and reusable component vendors like Infragistics.

  • Late 1990’s - Mature DIAE Components, Cross-Protocol Bridges -

Along with introduction of Cross Protocol Bridges which allow COM and CORBA systems to participate in the same system, Light weight integration protocols like SOAP were introduced in this period. New reusable component packaging mechanisms were invented. Reuse based development processes matured with time and specialized reusable component market got established.

  • 2000-2005 - Early Service Oriented Architecture Products -

Service Oriented Architecture Reuse of function (service), not code or component. Rediscovery of reuse research: Software Reuse (Asset) Management Systems Domain Specific Reuse, Domain Languages (Software Factories) Reuse must reuse more then code. Reuse requires drastic process changes. Patterns: systematic definition of large scale reusable components. Reuse Successes: Microsoft .Net Framework SRMS: Flashline and LogicLibrary

Techniques <ref name = techniques/>

In re-using code, we have several levels of granularity

Single lines of code

For example, in Ruby to print an array containing 5 elements we simply use the following

[ 'Sam', 'John', 'Gita' ].each {|name| print name, " " }

outputs : Sam John Gita.

Procedures, Macros

Procedures and macros are the low level code reuse techniques A macro is a group of repetitive instructions in a program which are codified only once and can be used as many times as necessary.

Position MACRO Row, Column 
PUSH AX 
PUSH BX 
PUSH DX 
MOV AH, 02H 
MOV DH, Row 
MOV DL, Column 
MOV BH, 0 
INT 10H 
POP DX 
POP BX 
POP AX 
ENDM

A procedure is a collection of instructions to which we can direct the flow of our program, and once the execution of these instructions is over control is given back to the next line to process of the code which called on the procedure.


Adding Proc Near ; Declaration of the procedure 
Mov Bx, 0 ; Content of the procedure 
Mov B1, Ah 
Mov Ah, 00 
Add Bx, Ax 
Ret ; Return directive 
Add Endp ; End of procedure declaration 

Functions/methods

For example, in Ruby a method can be written as follows

def convert_to_farenhite(temperature_in_celcius)
result = temperature_in_celcius * (9/5) +32
return result
end

Components

When you are building software, objects are created in code, and reusable objects that perform specific services in different circumstances are called components. When you use Microsoft® Office to build custom applications, you write code that leverages the power of Office components. Using an Office component means you not only do not have to write the code yourself, but you are using a component that has been tested and found reliable in different conditions.

Object orientation

High level languages also come with effective abstractions to build and share "abstract data types" and other "reusable libraries". Source code components are generally intended to be used as black-boxes: a public interface of usage is announced, formally or not (ruby has not equivalent of java interfaces, for example), while the realization of this interface is kept hidden (or at least intended to be). Think to C libraries, java's .jars, python's modules, ruby's gems, and so on. In many languages, "find a component, download it, and go" has become a reality!

Packages

A Java package is a mechanism for organizing Java classes into namespaces similar to the modules of Modula. Java packages can be stored in compressed files called JAR files, allowing classes to download faster as a group rather than one at a time. Programmers also typically use packages to organize classes belonging to the same category or providing similar functionality. A package provides a unique namespace for the types it contains. Classes in the same package can access each other's package-access members.

Modules

Modules are a way of grouping together methods, classes, and constants. An example of a Ruby module is as follows

module Trig
PI = 3.141592654
def Trig.sin(x)
# ..
end
def Trig.cos(x)
# ..
end

Inheritance

In object-oriented programming (OOP), inheritance is a way to reuse code of existing objects, or to establish a subtype from an existing object, or both, depending upon programming language support. In classical inheritance where objects are defined by classes, classes can inherit attributes and behavior from pre-existing classes called base classes, superclasses, parent classes or ancestor classes. The resulting classes are known as derived classes, subclasses or child classes. The relationships of classes through inheritance gives rise to a hierarchy.

For example, the following C++ code establishes an explicit inheritance relationship between classes B and A, where B is both a subclass and a subtype of A, and can be used as an A wherever a B is specified (via a reference, a pointer or the object itself).
class A 
{ public:
   void DoSomethingALike() const {}
};
 
class B : public A 
{ public:
   void DoSomethingBLike() const {}
};
 
void UseAnA(A const& some_A)
{
   some_A.DoSomethingALike();
}
 
void SomeFunc()
{
   B b;
   UseAnA(b); // b can be substituted for an A.
}

Generators

Parser generators (lex & yacc, antlr, rats!, treetop, pyPEG, and so on.) are probably the best-known examples of application generators. Other examples include wizards that one can find in Integrated Development Environments (to design and generate code of user interfaces, for example), tools that generate classes from UML class diagrams, generators of database schemas, report generators, etc. In note in passing that the Noe library I was talking about recently is of course a kind of application generator.

Software architectures

Software architectures are large-grain software frameworks. In contrast with source code components which are often black-boxes, software architecture are designed as grey-boxes: they are intended to be extended and provide specific extension points for this. We have a lot of reusable frameworks nowadays. Probably the most common are web frameworks (Ruby on Rails, django, ASP.net, ...), integrated development environment (IDE, Eclipse is worth mentioning due to its architecture), service oriented architectures (SOA), and so on.

Design and code scavenging

Design and code scavenging is simply a form of "find, copy-paste, adapt". The well-known design patterns[3] provide an organized form of design scavenging, by providing catalogues for applying such kind of reuse. Even if invented in the context of the object-oriented programming, design patterns had a great impact far beyond that programming paradigm.

Code scavenging is less organized as few such catalogues exists. Every day however, the web gives use better ways to apply such a reuse technique: google code search, github's gist, pastie, and so on. I also remember having read an book about Eclipse[4] whose authors encouraged applying the monkey see, monkey do rule, that is, copy pasting code from other Eclipse plugins whose source code is available. Application generators

Transformational systems

Transformational systems are the holy grail of computer science, nothing less! In that paradigm, software developers actually describe the behavior of the software using a high-level specification language (related to VHLL described before). In a second phase, the specifications are transformed in order to produce an executable system. The transformations are meant to enhance efficiency without changing the semantics.

Transformational systems emphasizes the what instead of the how. They actually bet on the concision of declarative statements over procedural ones to achieve effective reuse. While general purpose transformation systems remain mostly research topics, notable results have been achieved in some specific domains: relational systems come with effective query optimizers, some rapid prototyping approaches uses transformations from high-level descriptions down to code, etc.

Very High Level Languages (VHLL)

Very High Level Languages, also known as executable specification languages, are languages with very high level of abstraction. They are somewhat difficult to capture precisely and may lead to software reuse that is very specific to specialized domains. However, I would say that the recent advent of Domain Specific Languages (DSLs) and good support for them in dynamic languages (ruby, python, clojure, ...) can be seen as promoting and helping building VHLLs.

Best practices

Here are some of the best practices to be followed to make sure that the code that is being written is reusable.

  • The first consideration is to write code that uses a consistent naming convention, that is formatted properly, and that contains useful comments.
  • Examine the code to make sure that all the procedures have a single, specific purpose. Make sure that the procedure can be describedin a short, plain sentence. For example, "This procedure accept two numbers and calculates the product of those two numbers" If a procedure can not be described simply and clearly in a sentence, then it probably does too many things. Break down complicated procedures into smaller ones that do one thing each. Procedures should contain only code that clearly belongs together.
  • Avoid making specific reference to named application objects.
  • Try to minimize the number of arguments in a procedure and pass in only what is actually required by the procedure. In addition, make sure the procedures use all the arguments passed to them.
  • Group related procedures and the constants they use together in the same module, and where appropriate, consider grouping related procedures together in a class module with a clearly defined interface.
  • Keep procedures in standard modules and not in modules behind forms or documents. The code in form modules should be only the code that is tied directly to the form itself and the code required for calling general procedures stored in standard modules.
  • Communicate between procedures by passing data as arguments to the procedures. Persist data by writing it to disk. Avoid using a procedure to write to a global variable so another procedure can read data from that global variable. Avoid communicating with another procedure by passing data out of the application, for example, using one procedure to write data to a disk file so another procedure can read that data.

Advantages <ref name = history_pros_cons/>

  • Efficiency - Software reuse increases the software productivity and decreases the time required for the development of a software by reducing the time spent in designing or in coding
  • Standardization - By using the technique of software reuse, a company can improve software system interoperability and needs less people for software development. This provides a competitive advantage for the company and helps to produce better quality software and standardized software.
  • Consistency - Reuse of UI widgets in MacOS and Win32 leads to common “look-and-feel” between applications
  • Debugging -
  • Reliability - Systems that incorporate a high level of reusable components are more reliable and more easily constructed. Reused design/code is often tested design/code.
  • Profit! - Software reuse technique helps the company to reduce the costs involved in software development and maintenance.
  • Reduced Risk - Software reuse also reduces the risk involved in software development process.
  • Component Market - Reuse can lead to a market for component software. Real-world examples: ActiveX components, Hypercard stacks, Java packages, even software tools, such as xerces and xalan from xml.apache.org (they are often included in other software systems)

Trade-offs for code reuse at various levels of components

Different types of application code require varying levels of investment to achieve successful reuse

  • Reusable GUI objects reduce development time and improve quality and consistency but provide only modest payback in terms of overall application development costs.
  • Server-side components that constitute reusable business logic can provide significant payback but require extensive up-front analysis and design. They also require an architectural foundation but may experience a short shelf life.
  • Infrastructure components and services frameworks are generic services for transactions, messaging, security, and database connectivity. They eliminate the need to repeatedly build infrastructure that all applications use, but require extensive analysis and design, and complex programming. These standards-based components can often be purchased off-the-shelf.
  • High-level patterns allow organizations to achieve design reuse and identify components with high reuse potential, but developers must build or acquire the components.
  • Packaged applications provide the only guaranteed form of reuse, letting user companies acquire functionality for significantly less than the cost of building it themselves. However, these applications may not offer the exact functionality an organization needs; the subsequent customization that's required will add to the cost.

Practical problems <ref name = history_pros_cons/>

Although software reuse provides many benefits for the developers, there are some disadvantages.

Disadvantages for developers who reuse the code

  • Mismatch - Reused code or design may not completely match the situation, in that case, reusing the code may incur additional time and effort.
  • Non-functional characteristics of code may not match the situation - Consider a database that can scale to 10,000s of items, but you need it to scale to 100,000s of items.
  • Expense - Some components may be too expensive for the project’s budget. For instance, SGML (a precursor to HTML and XML) tools sell for 5000 dollars a license!

Disadvantages for developers who develop reusable code

  • Building reusable objects requires extensive analysis and design. The developer needs to understand and take into consideration all the generalities while coding the reusable module.
  • The developer needs to invest extra time in testing and quality assurance, optimization, and documentation. All this takes time and labor, which increases the cost of the code.
  • IT departments must also add to the payback equation the cost of tools to support reuse, such as version control and repositories.
  • The cost of administering an ongoing reuse program must be considered.

With all these elements, it becomes apparent that reuse doesn't come cheap.

Conclusion

The only way you can avoid code depreciation is through reuse but you need to be disciplined and follow a strict process. Reuse enables you to do more with less.

References

<references> <ref name = techniques> http://www.revision-zero.org/reuse</ref> <ref name = definition_software_reuse> http://www.biglever.com/papers/Krueger_AcmReuseSurvey.pdf </ref> <ref name = cde> http://www.codeproject.com/Articles/7746/Value-of-Code-Part-II-Reusing-Code </ref> <ref name = frameworks> http://www.inf.ufsc.br/~vilain/framework-thiago/p39-johnson.pdf </ref> <ref name = history_pros_cons> http://www.cs.colorado.edu/~kena/classes/3308/f04/lectures/lecture10.pdf </ref> <ref name = fgh> Software Reuse History- Thomas Pole </ref> </references>