Copyright (c) 1997 by the University of Washington. All rights reserved. This is a proprietary and confidential document. Under no circumstances may this document be used, modified, copied, distributed, or sold without the express written permission of the copyright holder.

Guide to the Etch Source Tree

Introduction
Major Components of Etch: Overview
The Etch Directory Hierarchy
Visual Etch Graphical User Interface
DLL Discovery (dllwatch)
Module Management
Executable File Interfaces
x86 Disassembler
Code Discovery
Etch Rewriting Engine
Import Table Patching (dllwalk)
Etch Runtime
Hardware Performance Counter Interface

1. Introduction

This document describes the organization of the Etch source tree. The intended audience is developers who are evaluating the feasibility of using, modifying, extending, and improving Etch. This document therefore provides answers to questions like:

What is the overall structure of the implementation of Etch?
What are the major interfaces and data structures used by Etch?
What directories and files contain the implementation of specific components of Etch?

This guide is only one part of the documentation of Etch. For a picture of Etch from the user's point of view, consult the Visual Etch User's guide. For more detailed information about the implementation of specific components of Etch than provided in this document, consult the documentation in the source files themselves.

2. Major Components of Etch: Overview

Etch is composed a number of major components, introduced in this section. To understand the role of these components, it is easiest to describe the typical steps that occur when a program is etched. Consider, for example, what happens when the user applies a tool to an executable and the DLLs that it uses, and runs the resulting etched program.

First, the user invokes vetch, bringing up the Visual Etch graphical user interface. Before instrumenting the program, the user uses DLL Discovery (the DLLwatch tool) to identify the full set of DLLs imported by the application.

After using Visual Etch to select the application, tool, and any special parameter settings, the user then clicks "Execute Action". This passes the user-selected settings to the Module Management routines, which perform three major tasks: instrumenting the binary and each of the selected DLLs, creating the environment in which to run the etched program, and presenting the results.

Each module (the binary and each DLL) goes through several phases as it is rewritten. All of the phases rely on the Executable File Interfaces to provide an abstract interface to the Windows32 executable file format, and on the Disassembler to provide an abstract interface to the x86 instruction set. Code Discovery analyzes the control flow of the program and separates code from data. Then the main rewriting loop of the Etch rewriting engine iterates through the source program, making appropriate callbacks to the tool instrumentation code. Finally, after all the modules are rewritten, the import tables are patched to reflect the names of the etched modules by the Import Table Patching (dllwalk) tool.

As the etched program runs, Etch Runtime code is invoked as necessary to emulate the environment of the original program and to support to the tool runtime code. The tool runtime code may also make use of the interface to the Hardware Performance Counter Interface.

3. The Etch Directory Hierarchy

etch/UI: Visual Basic files comprising Visual Etch.

etch/apps/dllwatch: Source for the dynamic DLL discovery tool, DLLwatch.
etch/apps/bin: Compiled tool instrumentation and runtime DLLs. (Populated when Etch is built.)
etch/apps/: Source code for Etch tools, and some support libraries for these tools. There is one directory per tool or library.

etch/auxbin: Binaries for some required 3rd-party utilities used by Etch, such as Perl.

etch/auxbin/Perl-License: Copy of the Perl license.

etch/bin: All of the executables and DLLs that comprise Etch, as well as some of the Perl and batch scripts, that implement module management (i.e., invocation of etch.exe on behalf of the user interface).

etch/bin/scripts: All the perl scripts used by Visual Etch to encode the tool-specific behavior.

etch/decoder

etch/include

etch/hwperf: User-level code to access P5 and P6 hardware performance counters.

etch/hwperf/drivers: NT and Windows 95 device drivers to access P5 and P6 hardware performance counters.

etch/hwperf/drivers/bin: Pre-built versions of those drivers.

etch/instrument: Sources for the Etch rewriting engine and runtime libraries and other support programs.

etch/util: Utility programs and libraries.

etch/util/perl/Etch-extensions: Extensions to Perl to allow scripts to use NT API calls that deal with events and file handles.

etch/util/pipe: Library for communication between Visual Etch and Perl.

etch/util/simpledialog: Stand-alone dialog box, used by etchwrap.dll to alert user to runtime failures.

etch/util/sysinfo: Utility program to report OS and CPU type.

4. Visual Etch Graphical User Interface

Visual Etch is the GUI front-end to Etch. It allows the user to select a program to transform, select the transformation ("tool") to apply, and customize set of DLLs to transform. It invokes Etch via the Module Management component.

Visual Etch provides several major services:

As a GUI, it lets the user specify experiment parameters:
- the application to transform,
- the tool to apply,
- tool options,
- a customized set of DLLs to transform,
- per DLL specialized etch options, and
- selectively picking the set of procedures to etch
It manages these parameters through projects
It provides a mechanism for the tool scripts to provide feedback to the user.
It takes the output of the tools and presents it to the user.
It manages the input and output of Etch and the tool scripts in a simulated console.

Main User Interface files

This module provides the basic components of what the user sees. The main UI screen is in UI/formSimpleUI.frm. The parameters file that affects this interface is bin/params.mdb. Tool specific options are handled by UI/formParams.frm, while per module etch options are handled by UI/formdllparams.frm. Finally, the UI for specifying selectively etched procedures is in UI/formProcOptions.frm. The options are kept track in projects through the Project Management module.

Directory: etch/UI, etch/bin
Files:
- formSimpleUI.frm: This is the main form. Aside from defining the actions associated with the form, we also define properties that require the tool name in this form since it is the one that knows about tool names.
- formStartup.frm: form that gets shown at the very start of Visual Etch.
- formParams.frm: Handles tool specific parameter passing. Each tool can ask for specific things from the user and this is the code that allows this flexibility. An example is the machine options dialog box.
- formProcOptions.frm: This form handles the interface to selective instrumentation. It is a little bit tricky because we want changes to the text file modulename.options to be reflected to the interface.
- formdllparams.frm: This is the Module Parameter dialog box. It takes per module etch parameters and updates the project file with these parameters. It's set up to be general so that various parameters can be put into params.mdb and the form would be able to handle it correctly.
- formList.frm: This is a form that just displays a list of names
- params.mdb: contains several tables that control Visual Etch. These are described in more detail in the Visual Etch API document. The tables included are: 1) toolsparam - contain parameters for the different tools that show up in the tools list box in the main UI window. 2) Etch - the rows give the etch flags that show up in the module windows 3) Module EtchFlags - the default etch flags for "special" modules in NT. These are usually system modules that should be etched in a special way. 4) Module EtchFlags 95 - the same a Module EtchFlags but for Win95. 5) CPU - default CPU parameters given in "Computer Parameters" dialog box

Project management

This module keeps track of user specified options through project files.

Directory: etch/UI, etch/bin
Files:
- classProject.cls: Encapsulates the project file concept. It keeps track of the state associated with the project and makes sure that what is visible to the user is reflected in the underlying .vpr file and vice-versa.
- classModuleParams.cls: This class encapsulates the things needed to keep a per module list of etch parameters
- ProjectTemplate.mdb: A template for a project file. Contains three tables: 1) project - contains project parameters 2) etchflags - contains per module etch options (e.g. shouldetch, fastcontextswitch, etc..) 3) debuginfo - debug info for the different modules and tools

Communication with tool scripts

This module sets up the communication channel to talk with the tool scripts. It also takes care of parsing and acknowledging the communication with the scripts and carrying out the commands that the tool scripts give the UI. When the tool scripts ask to show their output, this module passes it over to the "Showing tool output" module.

Directory: etch/UI
Files:
- formMeasureCheckList.frm: This is the "running experiment" dialog box. It handles reflecting the status of the run back to the user.
- RPC.bas: This module contains the routines that parses the messages sent by the tool scripts and converts these into actions taken by Visual Etch.

Showing tool output

This module takes care of showing the output of tool scripts.

Directory: etch/UI
Files:
- formDisplayFile.frm: Displays a text file and allows the user to copy it to a different file.
- formGraph.frm: This form shows a graph given an input file. You can choose between the different kinds of graphs provided by Visual Basic.
- Results.frm: This form controls the dialog box with the table and the bar graph.

Console Management

This module takes care of taking console input and output when an etch experiment runs. This allows the user to view the diagnostic messages that etch and the tool scripts print out occasionally. It also allows the user to interact with console applications like Perl.

Directory: etch/UI, util/pipe
Files:
- classPipe.cls: This class encapsulates the mechanism by which the tool script and Visual Etch communicate. That is the two anonymous pipes through which the tools send messages to Visual Etch and through which Visual Etch sends acks to the tool scripts. It also takes care of synchronizing the script and Visual Etch through events.
- pipe.c: pipe.c - implements PipedShell command which emulate popen on the Unix platform
- sync.c: This module is meant to convert any read into a signalable event, since NT does not support the select system call
- formConsole.frm: This form handles emulating a console Window. It takes the output to stdin and stdout of a console processes and shows these in a window.

Command line parsing

This module takes care of parsing the command line sent to Visual Etch and of running the commands as specified from the command line interface.

Directory: etch/UI
Files:
- CommandLine.bas: Module that takes care of parsing and interpreting the command line arguments that gets passed to Visual Etch.
- classMacro.cls: class that implements a very simple Macro language for the UI. This allows an OLE call into a classMacro object which can control how the UI works. This is also how we run things from the etch command line --- CommandLine.bas calls into the routines defined in this class module.

Auxiliary routines

This module just contains auxiliary routines that the other modules require to perform their functions. Visual Etch calls getmodules.pl to get the closure of the modules that are implicitly imported by the list of known modules.

Directory: etch/UI, etch/bin
Files:
- Main.bas: - This module contains the routines that are called when Visual Etch starts up and when Visual Etch is shut down. i.e. initialization and cleanup routines - It also contains the routines that take care of global variables that don't require the tool name.
- module2.bas: This module contains utility routines that the other modules use to do small tasks.
- formDebug.frm: Just a plain old text box where we print debugging messages.
- formDirectory.frm: This is a form that lets you select a directory.
- formCopy.frm: This form shows the "View and Copy Output and Log Files" dialog box and handles actions associated with this.
- getmodules.pl: - script for traversing through the dependencies of different DLLs given a list of "root" dlls - starts with the set of "root" dlls and calls dllwalk on each of these

5. DLL Discovery

DLLwatch reports the set of DLLs loaded by a running application. It simply runs the target application as a debuggee process, and is notified by the operating system for each DLL loaded by the application. It logs the resulting DLL names to a file, which is post-processed by the Perl script dllwatch.pl.

Directory: etch/apps/dllwatch
Files:
- dllwatch.cxx: Run the program specified on the command line, and report all DLLs loaded.
- dllwatch.pl: Driver script for the DllWatch tool.
- etch-dllwatch-filter.bat: Front end to etch-dllwatch-filter.pl, for filtering DllWatch output when run from the command line.
- etch-dllwatch-filter.pl: Filters dllwatch output -- invoked from command-line after user runs dllwatch outside the UI. Simply runs the dllwatch postprocessing code in dllwatch.pl against the raw output of dllwatch, and leaves it the right place

6. Module Management

This component manages the instrumentation of an application and its DLLS given a tool name, an application, a set of modules to be transformed, and any special options. It is invoked by Visual Etch, and invokes etch.exe (multiple times, if necessary).

Visual Etch calls coordinator.pl, which parses the project files that Visual Etch provides and calls tool specific scripts that are mentioned in the file list. The tool scripts in turn can invoke Perl library routines provided by this module to:

parse the actions required of it (Etch only, Execute Action, or Run Only) through action.pl,
interact with Visual Etch through the routines in message.pl, and
run etch.exe and the application through routines in the Running Etch and the Etched Application module.

At the end of the run, coordinator.pl calls routines in log.pl to create a log file.

Interacting with Visual Etch

These files manage the interaction with Visual Etch.

Directories: etch/bin/scripts, etch/bin
Files:
- coordinator.pl: Perl script for coordinating processes between the Visual Etch UI and the per tool scripts. coordinator.pl sets up the environment and global variables that the tools require and calls the tool scripts. After the tool scripts are done running, it copies over the project files to the experiment folders and dumps etch.log into it
- message.pl: perl script for sending messages to the UI
- action.pl: contains routines that tell the tools what kind of actions are expected from it
- list: contains a newline separated list of perl scripts that contain routines to implement tool actions. coordinator.pl reads this file and "require"'s each of the perl file listed.

Running Etch and the Etched Application

This module contains support routines needed to run etch.exe with the appropriate arguments for the options that the user specified.

The user specifies options through Visual Etch. Visual Etch puts these options into text files which coordinator.pl parses into perl arrays. The routines in this module use these perl arrays to determine the appropriate flags to Etch.

Directories: etch/bin/scripts
Files:
- cvdump.pl: Contains routines for managing debug files and selective etching option files
- etch.pl: Contains routines that help in etching applications
- run.pl: contains routines for running the etched executable

Tool specific scripts

These are tool specific scripts which implement the actions required to implement the tool including pre- and post-processing. simple.pl contains simple scripts that are good to use as templates for simple tools. cgprof.pl contains an example of more complicated tool that requires extensive pre- and post-processing. These scripts are included by coordinator.pl through the file list.

Directories: etch/bin/scripts
Files:
- simple.pl: Contains scripts for tools that use etch in a straight forward manner: i.e. call cvdump (if debug info is used), call etch, run program.
- cache.pl: Script for cache simulator and the cache animation tools. Actions for both tools are identical, so both routines just call cache_common which does all the work.
- fasticount.pl: Subroutines for invoking and running the fasticount tool. The fasticount tool is an instruction counting tool that uses basic block boundaries to do the counting. It can also be configured to use procedure callbacks or inlined instructions. This is the script module of the tool.
- cgprof.pl: This file contains scripts used to invoke the two CGProf actions: Call Graph Profile - cycle counts Call Graph Profile - instruction counts
- xprof.pl: Contains scripts for optimization-related tools: Profile For Code Layout Optimize for Code Layout Measure Optimized Performance
- dllwatch.pl: Driver script for the DllWatch tool.
- monitor.pl: Subroutines for invoking and running the monitor tool. The monitor tool counts instrumented and uninstrumented instructions in the etched application. It also captures all DLL load events. This is the script module of the tool.
- instrcheck.pl: Script to do the "actions" with the coverage tool (fast, approx) is selected from within Vetch.
- perfutil.pl: Contains scripts to measure program performance using the hardware performance counters.
- sanity.pl: Code discovery sanity check tool. Uses instrumentation information and runtime information to count instruction, basic block, and procedure entry/exit points. These counts are performed in numerous independent ways to validate the various instrumentation points. This is the script module of the tool.

Auxiliary routines

Directories: etch/bin/scripts
Files:
- dirs.pl: contains routines associated with cache directories -- $CACHEDIR and $WORKINGDIR
- log.pl: Routines for dumping out etch.log to disk
- util.pl: utility routines

7. Executable File Interfaces

The executable file interfaces consist of C++ classes that represent a Win32/x86 executable file, including file headers, import and export tables, relocation information, and debugging information.

Executable File Header

These classes provide an interface to the Windows 32-bit executable file header. They specify both the position of sections in the executable file itself, as well as the position of the sections as mapped into the address space of Etch during instrumention. (As a result, all translations between file offsets and virtual addresses in the input executable file are mediated by the ExeFile class.) They allow manipulation of the file header, including methods to add sections; change the entry point; change the location of the relocation, import, and export tables; and write the header to a file.

Directories: etch/instrument
Files:
- exefile.cxx: Classes that describe executable files that are read-only or modified in place. Writable executable files are implemented in outexefile.cxx
- exefile.h: Routines to read, modify, and write PE (32-bit executable) file headers.
- outexefile.cxx: Routines to modify the executable file header and to write it to a stream.

Import Tables

These classes represent Windows 32-bit executable import tables. In addition to allowing the import tables to be read, these classes support "wrapping" (replacing selected functions with calls to similarly named functions in a wrapper DLL, a la etchwrap.dll); and adding the export table for a specified DLL to the import table (for example, adding the routines exported by a tool runtime DLL to the etched program's import table).

Directories: etch/instrument
Files:
- import-inst.cxx: contains functions which determine if an instruction is a DLL call statement
- import-inst.h: functions which determine if an instruction is a DLL call statement
- import.cxx: Classes describing PE executable file import tables and how to read, query, modify, and write them. Based on the data structures defined in WINNT.H.
- import.h: Classes describing PE executable file import tables and how to read, query, modify, and write them.
- oldimport.cxx: This is an ancient implementation of import tables that we keep around provide a way of rewriting import tables for dllwalk when it is patching import tables.
- outimport.cxx: Implements the part of import tables that have to do with modifying or writing the table.

Export Tables

These classes represent a Windows 32-bit executable's export table, allowing the export table to be scanned for exported functions and functions "forwarded" to other DLLs.

Directories: etch/instrument
Files:
- outexport.cxx: Implements the part of export tables that have to do with modifying or writing the table.
- export.cxx: Functions to manipulate export tables.
- export.h: Representation of executable file export table.

Relocation Information

These classes represent the relocation records in the executable file as a simple array of relative virtual addresses. Etch uses relocation records in order to identify pointers in the original program. For example, this is used in code discovery to identify possible procedure pointers. In addition, Etch must update the relocation records correctly and augment the relocation records to reflect any new pointers added in order for the new program to run correctly if it is relocated at load time.

Directories: etch/instrument
Files:
- reloc.cxx: Classes representing relocation records.
- reloc.h: Classes representing relocation records.

CodeView Debug Information

This code provides an interface to the NB09 debugging information in a binary, if present. Used to associate names with procedures and as a source of additional information for code discovery.

Directory: etch/instrument
Files:
- cv.h: Constants and data structures used in the parsing of CodeView debugging information (version NB09).
- cvdump.cxx: This program is used to locate and dump the contents of the Codeview debugging information contained in an executable or dll. It knows how to read Codeview version NB09.
- debuginfo.cxx: The DebugInfo class is used to store and describe information extracted from the debug section of each module about the procedures contained withing that module - the name, the start and end address, the unique procedure number assigned by Etch, etc...
- debuginfo.h: Data structures for storing the extracted debug information.

8. x86 Disassembler

Standalone x86 decoding library. The instruction information is used by code discovery, the main rewriting loop, and tool writer instrumentation code. The decoding of x86 instructions is driven by tables. The instruction opcodes index into these tables. The main parsing tables are defined in maps.c. An instruction eventually maps to an entry that gives the mnemonic, number of operands, and operand encodings. For example, an instruction that starts with 0x01 maps to the entry {ADD,2,"Ev","Gv",0}. The operand encodings have the same meanings as the addressing and operand type encodings of the opcode maps in the Intel manuals.

To determine the operand read/write type, dcinfo_map (maps.c) has an entry for each instruction mnemonic (DCOpcode_t in instruction.h) indicating whether the operand is read, written, or both.

Directory: etch/decoder
Files:
- decode.h: Call to return a structure with a decoded x86 instruction.
- decode.c: Call to return a structure (DCInstruction_t) with a decoded x86 instruction. How do we do it? The instruction bytes that we are munging index our opcode maps (in maps.c). These maps are from the Intel manuals. Each entry in the table gives us a opmap_t: {mnemonic, # of operands, operand strings...} The operand strings are operand addressing mode encodings, similar to those in the manuals. They are defined in maps.h.
- instruction.h: Define the decoded instruction structure (DCInstruction_t) and its associated fields, the addressing modes, our instruction mnemonics, and some macros for parsing DCInstruction_t
- instruction_api.h: Define macros to access a DCInstruction_t structure.
- machine.h: Define x86-specific enumerations and register set structure.
- maps.h: Define x86 instruction operand structures and maps/tables to decode x86 instructions.
- maps.c: Define x86 instruction decoding maps/tables.

Testing Support

We provide a partial disassembler, disasm.exe, to help test the decoder. Its input is the raw text section of a program. It will print to stdout how we decoded the instructions. The output format is very similar to "dumpbin /disasm".

What is missing to make this program truly standalone and disassemble executables/dlls is to provide a simple wrapper inside of disasm.exe that reads in an executable and points the print routine to the start and end of various text sections.

Directory: etch/decoder
Files:
- disasm.c: Our version of the disassembler. It uses the decoder to decode and this just prints that structure to stdout. It takes as input raw text. What is missing is the code to walk an exe and point it to text sections. The main use of this is to see if we are decoding correctly.

Adding instructions

To add instructions to the decoder, three modifications have to be made. The first is to add the appropriate entries in the opcode maps in maps.c. If the mnemonic does not already exist for it in the DCOpcode_t enumeration (instruction.h), add it. If a mnemonic was added, an entry must be added (in the same order as DCOpcode) for the instruction in dcinfo_map (maps.c). The function check_dcinfo() checks to see that dcinfo_map is in the same order as DCOpcode_t.

Directory: etch/decoder
Files:
- decode.h
- decode.c
- instruction.h
- maps.c

9. Code Discovery

Distinguishing Between Code and Data

The primary purpose of code discovery is to distinguish between code and data in the text sections of a binary. Code discovery is coordinated by the routine discover.cxx:DiscoverCode. Functionally, DiscoverCode initializes code discovery using traverse.cxx:TraverseInit, and then invokes code discovery on various entry points in the binary. These entry points include the entry point of the binary (TraverseAll), export entries (TraverseExportEntries), relocations in the text and data sections (TraverseCodePointersInText and TraverseCodePointersInData), and procedure entries from debugging info (TraverseDebugInfo). Once all entry points have been processed, the main Etch engine finalizes code discovery using TraverseCollapse, and the results of code discovery can now be used by the rest of the Etch engine.

If code discovery is set to its most aggressive mode (which it is by default), then, just before TraverseCollapse is invoked, DiscoverCode also invokes TraverseUnknownByteSequences. TraverseUnknownByteSequences examines the remaining unknown byte sequences in the binary's text section, and heuristically tries to determine whether those unknown byte sequences contain code or data.

Internally, the code discovery engine maintains a data structure that keeps track of the state of every byte in all text sections of the binary. The state of a byte can be kCode, kData, or kUnknown, corresponding to code discovery's estimate of whether the byte is code, data, or status still unknown. This data structure is an array of packed longs stored in gDataBitArray, with two bits used to represent a byte in a text section. Accessing and setting the state of a byte are handled by two procedures, ExtractBits and InstallBits. A set of macros, AddrIsCode, AddrIsData, AddrIsUnknown, AddrIsKnown, and MarkAddrIsCode, MarkAddrIsData, MarkAddrIsUnknown, is used as a set of convenience functions for manipulating the entries of the array.

Basic Blocks and Procedures

Another purpose of the code discovery engine is to determine the boundaries of all basic blocks and procedures in a binary's text section, in particular when no debugging information is available. Code discovery accomplishes this in two steps. First, as the code discovery engine discovers instructions, it records information from particular instructions that helps it reconstruct basic block and procedures. For basic blocks, code discovery records the targets of all control flow instructions in an array named gControlTargets. For procedures, code discovery records the targets of all call instructions in an array named gCallTargets.

Once code discovery is finished, it then uses the recorded information to reconstruct basic block and procedure boundaries. The boundaries are reconstructed using the two procedures DetermineBasicBlockBoundaries and DetermineProcedureBoundaries, both of which are invoked from TraverseCollapse.

Directory: etch/instrument
Files:
- discover.h: Top level interface to code discovery.
- discover.cxx: Driver routines for code discovery.
- traverse.cxx: The code discovery engine. Exports routines for traversing the text sections of a module given various entry points (module entry point, exports, relocations, etc.) and classifying the bytes in the text sections as code or data. Also determines basic block and procedure boundaries in discovered code.
- traverse.h: The code discovery engine. Exports routines for traversing the text sections of a module given various entry points (module entry point, exports, relocations, etc.) and classifying the bytes in the text sections as code or data. Also determines basic block and procedure boundaries in discovered code.
- finddata.cxx: Keep track of embedded pointers to code used to refer to data.
- finddata.h: Keep track of embedded pointers to code used to refer to data.

10. Etch Rewriting Engine

The main engine for Etch takes as input a binary program, tool-specific callbacks (as described in the Etch tool-writers API). It produces as output a transformed binary. The engine is divided into the following subcomponents:

a top level driver, which calls into other modules as needed to generate each of section in the output binary;
the text section rewriter, which iterates over the instructions in the input binary's text section(s), transforming them and writing them to the output binary, but leaving data unmodified;
instruction transformations, which applies transformations to a single instruction;
the output stream, which writes the results of transformations to the output file while invoking label management and pointer management as needed to allow resolution of forward branches and generation of relocation information in a second pass.
label management, which manages unresolved forward references;
pointer management, which manages the mapping between pointers in the original and transformed binaries.

Top Level Driver

After argument parsing and initial setup, the rewriting process is directed by the driver routine program.cxx:PatchProgram. PatchProgram creates InProgram and OutProgram objects, which contain all the components of the input and output binaries (file headers, relocations, imports, exports, and so on). PatchProgram also invokes code discovery, if necessary.

Finally, PatchProgram calls OutProgram::WriteFile, which writes the file header directly to the output stream, and then writes each section ("EtchSection" or a subclass) of the output file. When the EtchedTextSection is written out, it calls TransformAllInstructions to invoke the text section rewriting subcomponent. OutProgram::WriteFile then rewrites the file header to account the actual section sizes, and finishes by calling into the label management subcomponent to resolve any unresolved labels.

(The top-level also calls program.cxx:PatchRuntime to create a custom version of etchrt.dll that includes calls to code inserted by the tool's InstrumentProgram callbacks; and if necessary, calls program.cxx:CreateHusk if the -husk option was specified on the command line.)

Directory: etch/instrument
Files:
- etchmain.cxx: Simply invokes etch top-level routine. We split it out into a separate routine so we have just a single entry point into etchlib.dll, namely EtchTopLevel
- etch.cxx: The main program for etch. Calls the command line processing routines, sets up some global state, and then calls the top-level program rewriting routines in program.cxx to actually perform the transformations.
- etch.h: Definitions of a few string constants -- output section names and tags to leave in the output binary to help out monitor.
- program.cxx: Top-level classes that read and write PE format binaries, and top-level routines to rewrite a binary, including all of the sections of the output binary.
- program.h: Top-level routines to rewrite a binary.
- husk.cxx: Implementation of "husks" (see husk.h for details). Experimental.
- husk.h: Experimental classes that support creating a 'husk', a shadow of the original binary that redirects upcalls to hardwired entry points from the shadow to the etched binary.
- program-private.h: Classes that describe a binary being transformed. Implements top level routines exported by program.h.
- parseargs.cxx: Routines to process the command line and set global variables.
- parseargs.h: Routines to parse the command line

Text Section Rewriter The rewriting loop for instructions is driven by instrument.cxx:TransformAllInstructions. This routine walks through the input text sections (in the order specified by the code layout file, if provided). For each instruction found, it makes the appropriate calls into the tool instrumentation code (InstrumentInstruction, InstrumentBasicBlock, and so on.) Data is left in place, although embedded pointers will be updated appropriately. Transformed instructions are represented by an Instruction object, described in the subsection below on instruction transformations. After all modifications to the instruction are complete, it is written to the output stream. A notation is made in the PCMap describing the mapping from the original location of the instruction to its new location.

Directory: etch/instrument
Files:
instrument.cxx: The main text instrumentation/rewriting loop -- applies transformations to each (selected) instruction in the input binary.
instrument.h: The main instrumentation/rewriting loop.

Instruction Transformations The instruction transformation routines manage the details of modifying a single instruction and writing out the modified instruction. Modifying the instruction is driven by calls from the tool instrumentation code through the etch instrumentation API, typically to insert calls to the tool runtime code. These transformations are reflected by promoting the original instruction from a simple 'MachineInstruction' (unstructured bytes) to an 'InstrumentedInstruction' (the original instruction with linked lists of inserted code before and after). Other variations on the Instruction class reflect transformations with specific properties; for example, transformations to provide a callback on each memory reference, which in some cases requires simulating the effect of the REP prefix. Once the instruction is completely transformed it is written out, making use of a simple code generator for a subset of the x86 instructions set. As the instruction is written, placeholders are left behind for unresolved pointers and forward branches.

Directory: etch/instrument
Files:
jump.cxx: Class to patch up direct jump instructions -- expand 8 to 32 bit jumps, and when writing out, arrange to jump the correct target in the etched program.
inst.cxx: Classes that represeent instructions, starting with raw machine instructions (MachineInstruction) and proceeding to InstrumentedInstruction (the original instruction as transformed by the tool instrumentation code) and variants.
insttype.cxx: contains functions which determine the type of instruction given the instruction bytes
instrument_rep.cxx: Special kind of instruction class to support simulating REP prefix and making per-memory reference runtime callbacks.
codegen.cxx: Routines for generating the low-level x86 instructions that implement the higher-level operations used during instrumentation.
etch-api.cxx: This file contains the support routines for the etch instrumentation API. This means that all of the API calls made by a given tool's instrumentation dll are implemented in this file.
etch-api.h: Instrumentation support header file. Defines prototypes for all routines in the Etch API manual.

Output Stream

All output data is written through a FileStream object. (There will be one FileStream per output file section.) Stream classes provide methods for writing data, as well as special methods for writing pointers, branch offsets, and various special cases for pointers (according to whether they should be updated reflect the movement of code in the etched binary, whether to generate a relocation record, and so on.) Pointers and branch offsets whose new value cannot be determined use the label management subcomponent to leave a placeholder behind to be fixed on the second pass.

Directory: etch/instrument
Files:
- streams.cxx: Streams provide methods for writing data, as well as special methods for writing pointers, branch offsets, and so on. Pointers and branch offsets whose new value cannot be determined use the label management subcomponent to leave a placeholder behind to be fixed on the second pass.
- streams.h: Streams provide methods for writing data, as well as special methods for writing pointers, branch offsets, and so on. Pointers and branch offsets whose new value cannot be determined use the label management subcomponent to leave a placeholder behind to be fixed on the second pass.

Label Management

The label management routines provide placeholders for a variety of values that cannot always be determined on a single pass: forward branch offset, forward pointers, and so on. As the output file is generated a set of unresolved branches grows, and then are resolved on a second pass.

The Buffer class simply mediates between RVAs and file offsets, and consists of an array of (file position, RVA base address) pairs, one per output section.

To eliminate label management overhead for "nearby" forward references, the BufferedStream object buffers the output stream in memory briefly before writing to the "real" Stream.

Directory: etch/instrument
Files:
- labels.cxx: Placeholders for values that cannot be determined on the first pass: forward branch offset, forward pointers, etc.
- labels.h: Placeholders for values that cannot be determined on the first pass: forward branch offset, forward pointers, etc.
- bufstream.cxx: To eliminate label management overhead for "nearby" forward references, the BufferedStream object buffers the output stream in memory briefly before writing to the "real" Stream.
- bufstream.h: A wrapper around a stream that buffers the output in hopes of resolving "nearby" forward references.
- buffer.h: Maps between file stream offsets and virtual addresses in the output binary. Primarily used by labels.{h,cxx} when filling in 'holes' in the output file left on the first pass by unresolved forward branches.
- buffer.cxx: Maps between stream offset (actually, file stream) and virtual addresses in the output binary. Primarily used to fill in 'holes' in the output file left on the first pass by unresolved forward branches.

Pointer Management

The PCMap is a collection of mappings from original address to etched address, one mapping per section. It is updated to reflect the new position of instructions in the output binary, and the new location of the import table in the output binary. It is consulted as needed to determine the correct value of pointers and branch targets.

Directory: etch/instrument
Files:
- simplemap.c: A data structure for representing mappings from original PCs to new PCs. The mappings are maintained in an array indexed by PC, with a magic value for unknown mappings.
- simplemap.h: A data structure for representing mappings from original PCs to new PCs. The mappings are maintained in an array indexed by PC, with a magic value for unknown mappings.
- pcmaps.h: Inline procedures for handling arrays of PCMaps. See also simplemap.h.

11. Import Table Patching (dllwalk)

Dllwalk is used for two purposes. The first is to report the complete set of DLLs that a binary depends upon (the import closure of the binary). This set includes the DLLs explicitly listed in the import table of the binary, as well as the DLLs imported by the imported DLLs, etc. Dllwalk determines the import closure by mapping the binary, walking the import table, and then recursively repeating the process on the DLLs listed in the import table until all DLLs have been checked and listed.

The second purpose is to update the import table of a binary so that it imports etched or patched DLLs rather than the originals. The updating is done according to entries in a map file, which is typically generated by Visual Etch according to the configuration of the Modules dialog. Dllwalk updates a binary's import table by mapping the binary and iteratively checking the original imports against the entries in the map file. For those imports that match entries in the map file, the imports are changed to the new values specified by the map file entries. Since the new import names are typically larger than the original names, dllwalk relies upon Etch to provide sufficient storage in the import table for the new names when Etch transforms the binary.

Directory: etch/instrument
Files:
- dllwalk.cxx: The dllwalk utility for manipulating the import tables of executable and DLLs. During the etching process, dllwalk is used to both list the imports used by an application, as well as to patch an application's import table. Given an executable file, dllwalk will list all of the static imports that the file depends on, recursively walking imported DLLs until closure is reached. Given an executable file and a import mapping file, dllwalk will change references in the import table of the executable file from the names of original imports into new names specified in the map file.
- dllwalk-oldimport.cxx: Dllwalk routines that need to use the old import table classes that allow import table entries to be modified in place. These routines are in a separate file because of type name conflicts between the new and old import headers.
- dllwalk.h: The dllwalk utility for manipulating the import tables of executable and DLLs. During the etching process, dllwalk is used to both list the imports used by an application, as well as to patch an application's import table. Given an executable file, dllwalk will list all of the static imports that the file depends on, recursively walking imported DLLs until closure is reached. Given an executable file and a import mapping file, dllwalk will change references in the import table of the executable file from the names of original imports into new names specified in the map file.

12. Etch Runtime

The Etch Runtime provides two primary services. First, etchwrap.dll mimics the Windows execution environment of the original program (primarily, the directory and file names of the original executable and DLLs).

Second, etchrt.dll provides support to tool run-time code: in particular, support for code inserted during the ModuleBefore, ModuleAfter, ProgramBefore, and ProgramAfter callbacks, and support for the ArgBranchTarget argument type.

etchwrap.dll

etchwrap.dll mimics the Windows execution environment by intercepting application calls to the Windows32 API calls LoadLibrary, GetModuleHandle, and GetModuleFileName, and modifying the arguments to refer to etched and patched DLLs rather than the original DLLs.

etchwrap.dll is linked into the etched module as part of the rewriting (etching or patching process), as specified by the '-wrap' arguments on the Etch command line. All statically identified calls to the relevant calls to LoadLibrary, etc. in kernel32.dll are replaced with calls to EtchLoadLibrary, etch. in etchwrap.dll. The code for replacing the calls lives in outimport.cxx:OutImportTable::ReplaceImports.

At runtime, etchwrap.dll reads a configuration file (etch.config) generated by the Module Management component that provides the directory of the original executable and the names of any DLLs that were neither etched or patched (and hence should not be replaced with etched or patched DLLs.) For each attempt to refer to an unmodified DLL X.dll, etchwrap.dll searches for X-patch.dll and X-etch.dll. If neither is found it alerts the user that a problem exists (using the utility program simpledialog.exe), and asks the user whether to proceed with the experiment. As a side-effect, it generates a file, etch-orphans.log, that lists all DLLs for which no etched or patched version was found. This logfile can then be incorporated into the Visual Etch modules window for the next experiment.

Directory: etch/instrument
Files:
- etchwrap.c: This module takes care to implement LoadLibrary and friends from kernel32.dll. Everytime a program tries to load a library, we find the right etched or patched version of the library and load it instead. If no etched or patched version is found, we complain to the user and record the missing name in etch-orphans.log. Someone else is responsible for later ensuring that the log entries have been etched or patched.

Tool runtime support

etchrt.dll provides runtime support to Etch tools. This includes runtime code inserted during the ModuleBefore, ModuleAfter, ProgramBefore, and ProgramAfter instrumention-time callbacks (etchrt*), and to support the ArgBranchTarget argument type for indirect calls (backmap*).

ModuleBefore, ModuleAfter, ProgramBefore, ProgramAfter

At instrumentation time, any code inserting during the ModuleBefore and ModuleAfter callbacks is inserted into the etched binary. For DLLs, the entry point of the DLL is replaced with a call to etchrt.c:EtchHandleDllEntry. (For executables, the entry point is simply replaced with code that calls the ModuleBefore code, if present, and then transfers control to the etched version of the original entry point.) In addition, etchrt.dll is rewritten to include stubs that call any code inserted during the ProgramBefore and ProgramAfter callbacks. This code is implemented in instrument.c:{InstallModuleCallbacks,InstallProgramCallbacks}.

At runtime, etchrt.c:EtchHandleDllEntry makes the appropriate calls to the ProgramBefore/ProgramAfter/ModuleBefore/ModuleAfter routines, and then transfers controls to the etched version of the original entry point.

Directory: etch/instrument
Files:
- etchrt.c: Support for code inserted during the ModuleBefore, ModuleAfter, ProgramBefore, and ProgramAfter callbacks,

ArgBranchTarget support

Instructions whose branch target is determined dynamically (indirect branches and returns) need special support when tools use the ArgBranchTarget argument type to tool runtime code. At runtime these instructions will generate the branch target in the context of the etched program, while typically the toolwriter will be interested in the branch target in the context of the original program. To support this functionality, a map from etched pc to original pc is inserted into the etched module, if needed. At instrumentation time, calldll.cxx:PushBranchTarget inserts code into the etched binary to consult this table and pass it along to the tool runtime code.

Directory: etch/instrument
Files:
- backmap-plus.cxx: Write a backmap to an output stream.
- backmap-plus.h: Function to write a backmap to an output stream.
- backmap-private.h: Declarations for internals of backmaps, which map 'new' (etched) to 'old' (unetched) instruction addresses. We store RVAs in the backmap array, sorted by new RVA, and also keep track of the old and new image bases for the module. We also have a simple direct-mapped cache to accelerate lookups.
- backmap-rt.c: Runtime code for initializing and accessing backmaps. Used to compute the original target of etched indirect jumps (i.e. ArgBranchTarget). Linked into etchrt.dll.
- backmap.c: Functions to construct a backmap from etched to original PCs.
- backmap.h: Maintain a table mapping new target -> original PC. This is used to generate PCs of indirect branches. We say target because it is the place you would branch to in order to execute any instrumentation stubs that have been inserted before the original instruction (or its patched replacement). Implemented in C rather than C++ to make it easier if we ever want to install into an existing program.

13. P5/P6 Performance Counter Interface

Device drivers and user-level routines to control the hardware performance counters. These routines are provided to assist creators of etch tools.

Both the NT and the Win95 drivers use the same ioctl based interface, so the user-level code that interacts with the drivers is almost identical. This code is in the hwcounter dll, and the source is in hwcounter.c. On NT, the drivers are installed as an NT service, and we use a command line program to perform the installation. The source code for this lives in driverinst.c. The perfselect utility in this directory is a window based interface used by Visual Etch to allow the user to select which hardware counters should be active for measuring a given application. Finally, the perfutil program is a command line interface used to activate counters and measure programs using the counters. Perfutil is a client of the hwcounter dll.

Directory: etch/hwperf
Files:
- hwperf.c: Windows NT driver to access the hardware performance counters. Since the P5 and P6 drivers are very similar, this file contains the source for both drivers, with ifdefs in the places where they differ. This driver borrows bits and pieces from the NT DDK sample code. The interaction between a user-level program and the device driver is mainly through the use of ioctl's. The P5 supports the following ioctl's: IOCTL_P5WNT_READ_ALL - reads the values of both active counters and the cycle counter (TSC) into a buffer. IOCTL_P5WNT_READ_CTRLWD - reads the control word, used to determine which counters are active. IOCTL_P5WNT_WRITE_CTRLWD - sets the control word, used to select which counters are active. IOCTL_P5WNT_RESET_ALL - resets the values of both active counters and the cycle counter (TSC) to zero. The P6 supports similar ioctl's to those above, and two additional ioctls: IOCTL_P6WNT_USERENABLE - enables the RDPMC instruction to work in a user-level process on the P6 processor. This allows for fine-grained performance measurements without the overhead of entering the device driver. IOCTL_P6WNT_USERDISABLE - disables use of RDPMC from user-level.
- driverinst.c: Simple application that installs a device driver as a service. This code was based on some sample code from the NT Device Driver Development Kit (NT DDK).
- etcherr.h: Routines for printing error messages to one of - stderr - a file specified by the ETCHERRORLOG environment variable. - a file specified by ESetAltOut() if ETCHERRORLOG isn't set
- hwcounter.c: This module provides an interface to the hardware counters. It currently supports both the P5 and the P6, on Windows NT and 95 (assuming the appropriate drivers are available).
- hwcounter.h: Interface to the hardware counters. Procedures Definitions: InitHardwareCounters() - initialization routine that must be called before any other routines are called. ReadActiveCounters() - read the values of the currently active counters. ResetActiveCounters() - set the values of the currently active counters to zero. SetActiveCounters() - choose which hardware counters to activate. Only two counters can be active at any given time. The available counters are enumerated in p5defs.h and p6defs.h. CloseCounters() - clean up routine. Closes the device. GetCycleCounter() - routine to read the hardware timestamp counter. GetSystemType() - returns the current system CPU and OS type. GetCounterName() - given the system type and a counter value, return a string that describes the counter. GetCounterVal() - given the system type and a counter name, return the value needed to activate the counter. GetNumCounters() - for a given system type, how many counters are supported. GetCounterByIndex() - for a given system type, return the value of the Nth counter. Can be used with GetNumCounters to enumerate all of the supported counters.
- math64.h: Set of routines for dealing with 64 bit values. This code is for backward compatibility only, new code should be written to use the MSVC built-in support for 64 bit values, since the built-in code is more efficient.
- p5defs.h: This header file contains the Pentium (P5) specific definitions that help use the P5 hardware performance counters.
- p6defs.h: This header file contains the Pentium Pro (P6) specific definitions that help use the P6 hardware performance counters.
- perfselect.c: A graphical utility for performing the selection of active hardware performance counters. This program uses the hwcounter library to determine the platform type and the names of the available counters, and then presents that info to the user is selectable listboxes. Some of the code in this file is based on examples from the Charles Petzold Programming Windows 95 book.
- perfutil.c: Command line interface to the hardware performance counters. Uses the hwcounter library to interact with the hardware counters.

Guide to the Etch Source Tree

Contents

1. Introduction

2. Major Components of Etch: Overview

3. The Etch Directory Hierarchy

4. Visual Etch Graphical User Interface

Main User Interface files

Project management

Communication with tool scripts

Showing tool output

Console Management

Command line parsing

Auxiliary routines

5. DLL Discovery

6. Module Management

Interacting with Visual Etch

Running Etch and the Etched Application

Tool specific scripts

Auxiliary routines

7. Executable File Interfaces

Executable File Header

Import Tables

Export Tables

Relocation Information

CodeView Debug Information

8. x86 Disassembler

Testing Support

Adding instructions

9. Code Discovery

Distinguishing Between Code and Data

Basic Blocks and Procedures

10. Etch Rewriting Engine

11. Import Table Patching (dllwalk)

12. Etch Runtime

etchwrap.dll

Tool runtime support

ModuleBefore, ModuleAfter, ProgramBefore, ProgramAfter

ArgBranchTarget support

13. P5/P6 Performance Counter Interface