CSC/ECE 517 Fall 2009/wiki2 8 rop: Difference between revisions
m (→Example) |
|||
(20 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
== Reflection-oriented programming == | == Reflection-oriented programming == | ||
Traditional programming has always separated data from the instructions. Even though the memory system that stores them doesn't make any distinction between them, the way they are handled is the only differentiator. Data is | Traditional programming has always separated data from the instructions. Even though the memory system that stores them doesn't make any distinction between them, the way they are handled is the only differentiator. Data is ''processed'' while instructions are ''executed''. Reflection-oriented programming is a technique in which the program is made intelligent enough to modify their behavior[[http://cs.indiana.edu/~jsobel/rop.html 1]]. | ||
== Overview == | == Overview == | ||
Refection-oriented programming is a | Refection-oriented programming is a different paradigm which deals with understanding the computation as a task rather than a program. Conceptually, it can be thought as the program setting up a ''"watchdog"'' to observe the changes and self-modify accordingly based on the observations. One thing to be mentioned here is that this paradigm goes well with dynamically typed languages such as Ruby, Lisp etc. The core behind this paradigm is the flexibility of treating the instructions as data so that all the policies of data modification can be applied onto them. | ||
== Reflective Paradigm == | == Reflective Paradigm == | ||
Line 12: | Line 12: | ||
* ''complex'' : this category of operations/tasks involve multiple steps (or atomic operations) and can be interrupted in the middle of computation | * ''complex'' : this category of operations/tasks involve multiple steps (or atomic operations) and can be interrupted in the middle of computation | ||
The reason why these categories are brought up here is that, atomic operations being single step do not have a specific structure associated with them. However, the complex tasks/blocks lose their structure when a program is compiled in a non-reflective language. | The reason why these categories are brought up here is that, atomic operations being single step do not have a specific structure associated with them. However, the complex tasks/blocks lose their structure when a program is compiled in a non-reflective language. But, it is very much necessary to preserve its structure in reflective programs. The block can be thought of as a state-machine which has pre-defined states data will be in or transition to in case of specific events. This whole package is the metadata that is saved for preserving the structure or architecture of the program[[http://biophp.net/index.php/Reflective 2]]. | ||
== Paradigm implementation == | == Paradigm implementation == | ||
Line 18: | Line 18: | ||
== Purpose and Uses == | == Purpose and Uses == | ||
The basic purpose of reflection oriented programming is to introduce flexibility. Broadly the uses of reflection-oriented programming can be classified as: | The basic purpose of reflection oriented programming is to introduce flexibility. Broadly, the uses of reflection-oriented programming can be classified as[[http://en.allexperts.com/e/s/se/self-modifying_code.htm 3]]: | ||
* Optimization of a state dependent loop | * Optimization of a state dependent loop | ||
* Application of a specialized algorithm to a data set depending on the runtime data information | * Application of a specialized algorithm to a data set depending on the runtime data information | ||
Line 26: | Line 27: | ||
* Universal system for different situations (adaptation and code simplicity) | * Universal system for different situations (adaptation and code simplicity) | ||
Apart from these, there are huge number of uses and many unexplored uses too. One interesting observation is the use of self-modifying code to hide copy-protection in DOS applications. It can also be used to hide the presence of a program. This technique may prove harmful if a malware made use of this paradigm. For such reason, some Operating Systems such as OpenBSD have explicit control over self-modifying code. | Apart from these, there are huge number of uses and many unexplored uses too. One interesting observation is the use of self-modifying code to hide copy-protection in DOS applications. It can also be used to hide the presence of a program. This technique may prove harmful if a malware made use of this paradigm. For such reason, some Operating Systems such as OpenBSD have explicit control over self-modifying code[[http://en.allexperts.com/e/s/se/self-modifying_code.htm 3]]. | ||
== List of | == List of reflective programming languages and platforms == | ||
Some of the languages that have inherent reflection oriented programming support are: | Some of the languages that have inherent reflection oriented programming support are[[http://en.allexperts.com/e/l/li/list_of_reflective_programming_languages_and_platforms.htm 4]]: | ||
* APL | * APL | ||
Line 55: | Line 56: | ||
== Example == | == Example == | ||
An example of a self-modifying pseudo-code for Optimization of a state dependent loop is shown: | An example of a self-modifying pseudo-code for Optimization of a state dependent loop is shown [[http://en.allexperts.com/e/s/se/self-modifying_code.htm 3]]: | ||
Non-reflective approach: | Non-reflective approach: | ||
<code> | |||
repeat N times | |||
{ | |||
if STATE is 1 | |||
increase A by one | |||
else | |||
decrease A by one | |||
do something with A | |||
} | |||
</code> | |||
The optimized version of the pseudo-code with a reflective approach is: | |||
<code> | |||
repeat N times | |||
{ | |||
increase A by one | |||
do something with A | |||
} | |||
when STATE has to switch { replace the opcode "increase" above with the opcode to decrease } | |||
</code> | |||
== | == Definitions == | ||
; atomic : An atomic operation in computer science refers to a set of operations that can be combined so that they appear to the rest of the system to be a single operation with only two possible outcomes: success or failure. | |||
; metadata : Metadata is "data about data", of any sort in any media. It aids in clarifying and finding the actual data. | |||
; malware : Malware, short for malicious software, is software designed to infiltrate a computer without the owner's informed consent. | |||
== References == | == References == | ||
# http://cs.indiana.edu/~jsobel/rop.html | # [http://cs.indiana.edu/~jsobel/rop.html An Introduction to Reflection-Oriented Programming] | ||
# [http://biophp.net/index.php/Reflective Reflective] | |||
# [http://en.allexperts.com/e/s/se/self-modifying_code.htm Self-modifying code] | |||
# [http://en.allexperts.com/e/l/li/list_of_reflective_programming_languages_and_platforms.htm List of reflective programming languages and platforms] |
Latest revision as of 20:17, 13 October 2009
Reflection-oriented programming
Traditional programming has always separated data from the instructions. Even though the memory system that stores them doesn't make any distinction between them, the way they are handled is the only differentiator. Data is processed while instructions are executed. Reflection-oriented programming is a technique in which the program is made intelligent enough to modify their behavior[1].
Overview
Refection-oriented programming is a different paradigm which deals with understanding the computation as a task rather than a program. Conceptually, it can be thought as the program setting up a "watchdog" to observe the changes and self-modify accordingly based on the observations. One thing to be mentioned here is that this paradigm goes well with dynamically typed languages such as Ruby, Lisp etc. The core behind this paradigm is the flexibility of treating the instructions as data so that all the policies of data modification can be applied onto them.
Reflective Paradigm
Reflected-oriented programming is the concept used to write Reflective programs. This allows the program architecture to be decided during the course of execution and is not "hard-wired" to the procedural structure. This paradigm supplements the already-existing object-oriented programming to enhance it with the self-modifying-code feature. The control flow is something which is decided at runtime; this is important since dynamically altered program can change course of execution of a particular event due to modification. Hence, this paradigm requires that the program contain sequence of decisions be implemented rather than the strictness of decision, based on the run-time data.
On a broad perspective, tasks can be classified into:
- atomic : this category of operations/tasks cannot be interrupted and usually consist of a single step
- complex : this category of operations/tasks involve multiple steps (or atomic operations) and can be interrupted in the middle of computation
The reason why these categories are brought up here is that, atomic operations being single step do not have a specific structure associated with them. However, the complex tasks/blocks lose their structure when a program is compiled in a non-reflective language. But, it is very much necessary to preserve its structure in reflective programs. The block can be thought of as a state-machine which has pre-defined states data will be in or transition to in case of specific events. This whole package is the metadata that is saved for preserving the structure or architecture of the program[2].
Paradigm implementation
In most of the languages as the code is compiled/assembled the structure of the program is lost. However, the key to implementing this programming paradigm is to preserve the structure so that the program can be analyzed and modified accordingly. To enable this, the program structure is preserved as metadata in the code. This ensures that even though the compiled/assembled code doesn't maintain the organization, in favor of optimization mostly, the code can be made self-modified. There are some languages like Lisp which do not separate compile-time and run-time. In such cases, there is no difference in the code.
Purpose and Uses
The basic purpose of reflection oriented programming is to introduce flexibility. Broadly, the uses of reflection-oriented programming can be classified as[3]:
- Optimization of a state dependent loop
- Application of a specialized algorithm to a data set depending on the runtime data information
- Obfuscating the code to prevent reverse-engineering
- Protection of code from malware
- Decompression of code at runtime
- Universal system for different situations (adaptation and code simplicity)
Apart from these, there are huge number of uses and many unexplored uses too. One interesting observation is the use of self-modifying code to hide copy-protection in DOS applications. It can also be used to hide the presence of a program. This technique may prove harmful if a malware made use of this paradigm. For such reason, some Operating Systems such as OpenBSD have explicit control over self-modifying code[3].
List of reflective programming languages and platforms
Some of the languages that have inherent reflection oriented programming support are[4]:
- APL
- ColdFusion MX
- Curl
- Delphi
- ECMAScript (ActionScript, DMDScript, JavaScript, JScript)
- Java
- Lisp
- Logo
- C#
- Visual Basic .NET
- Objective-C
- Perl
- PHP
- Pico
- Pliant
- POP-11
- Prolog
- Python
- Ruby
- Scheme
- Smalltalk
- Tcl
Example
An example of a self-modifying pseudo-code for Optimization of a state dependent loop is shown [3]:
Non-reflective approach:
repeat N times
{
if STATE is 1
increase A by one
else
decrease A by one
do something with A
}
The optimized version of the pseudo-code with a reflective approach is:
repeat N times
{
increase A by one
do something with A
}
when STATE has to switch { replace the opcode "increase" above with the opcode to decrease }
Definitions
- atomic
- An atomic operation in computer science refers to a set of operations that can be combined so that they appear to the rest of the system to be a single operation with only two possible outcomes: success or failure.
- metadata
- Metadata is "data about data", of any sort in any media. It aids in clarifying and finding the actual data.
- malware
- Malware, short for malicious software, is software designed to infiltrate a computer without the owner's informed consent.