Versions Compared
Version | Old Version 3 | New Version 4 |
---|---|---|
Changes made by | ||
Saved on |
Key
- This line was added.
- This line was removed.
- Formatting was changed.
Overview
Large mature systems often contain symbols that are no longer used. These may be methods that are no longer called or variables and data structures that are no longer referenced by active logic. These unused symbols are often referred to as "dead code". Dead code is a form of technical debt, but it can difficult to identify dead code and risky to remove requiring a lot of testing. gmStudio can help with this problem.An Furthermore, an Upgrade Project is an optimal time to remove dead code because this "cleaning house" can make it makes the overall Upgrade effort proceed more efficiently. A And, a big part of the cost of removing dead code is the need to retest the system after making the changes; enter the Upgrade to .NET. Migrating to a new software platform . A platform migration is a comprehensive maintenance effort – everything is changing. An Upgrade deserves an extensive regression testing effort. Consequently, a Upgrade Project , so it is an ideal opportunity to remove dead code because you will be doing extensive regression testing anyway.
gmStudio's Refactor/Remove command directs the removal of symbol declarations, and optionally, symbol references as well. The Refactor/Remove command requires you to specify the identifier of the symbol to remove. In some cases, the Upgrade Team may already know the identifiers of dead code, but typically many cases of dead code are not known and they are difficult to find and verify. Teams can benefit an automated analysis identifying the dead code. This document describes how gmStudio can help automate the process of identifying and removing dead code.
Identifying Dead Code
gmStudio is distributed with gmGlobal.exe, a tool that can help you identify and report unused symbols in various ways. gmGlobal is a .NET executable built using gmAPI it is a console application that takes a gmPL script on its command line. The script tells gmGlobal what code to analyse, how to analyse it, and how to report the results. gmGlobal recognizes and internally processes the following statements:
- InformationFiles,
- FindCallByName,
- RemoveUnused, and
- ReportRemovals.
gmGlobal can analyse a symbol usage across collection of inter-related components, and also identifies dead code as both symbols that are not referenced at all, and also symbols that are referenced only from dead code.
Warning |
---|
CAUTION: The Unused symbol analysis depends on the detailed symbol reference information gathered by gmBasic: unreferenced symbols are unused symbols. However, this approach fails to identify references made through late calls. You should complete type inference optimization and other techniques to reduce late calls prior to doing the Unused analysis. You should also plan to take advantage of the FindCallByName and DoNotRemove features to fine tune the removals. Always take time to consider these limitations and review the removal scripts generated by gmGlobal as well as the codes that have had symbols removed. |
Info |
---|
Event handlers (e.g. Form and Control event handlers, Class_Terminate, Class_InItialize, Sub Main, etc.) and symbols declaring or implementing interfaces are excluded from removals because they are needed even if they are not referenced. |
Using gmGlobal in gmStudio
You may integrate gmGlobal into your gmStudio project as a special gmGlobal task. Typically this will be inserted into the task list to run after the VBPs are translated to create the VBI files to be analysed. When you Translate this special task, gmStudio will run prepare an actual script and run gmGlobal.exe passing the actual script as the first argument.
GMTOOL:C:\Program Files (x86)\GreatMigrations\gmStudio\gmGlobal.exe
Here are some attributes of a sample gmGlobal task:
Source Name = [RemoveUnused1] Source Location = [C:\gmSpec\Util\UnUsedTest\proj\usr] Source File Name = [RemoveUnused1.xml] Translation Script = [GMTOOL:C:\Program Files (x86)\GreatMigrations\gmStudio\gmGlobal.exe] Task Command Script = [UserCmds.cmd] Code Bundle Path = [C:\gmSpec\Util\UnUsedTest\proj\log\UnUsedTest-RemoveUnused1-std-csh.bnd]
A sample RemoveUnused Script
I will demonstrate the Unused Symbol Analysis and Reporting with a small demo project. Project has default translations setup for two related VBPs:
- projUnusedDll.vbp
- UnUsedTestEXE.vbp
The first example script is RemoveUnused1.xml:
<gmGlobal> <Select Progress="1" /> <InformationFiles> <Load id="UnUsedTest-projUnusedDll-std-csh.vbi" /> <Load id="UnUsedTest-UnUsedTestEXE-std-csh.vbi" /> </InformationFiles> <Output Status="New" FileName="%bndPath%" /> <RemoveUnused /> <Output Status="Close" /> </gmGlobal>This script directs gmGlobal.exe to load the VBIs for the two VBP translations and generate a RemoveUnused report. The report is placed in the BndFile associated with the task to facilitate integration with gmStudio progress tracking and reporting.The InformationFiles statement supplies the list of information files to be processed by down stream statements like FindCallByName or RemoveUnused . The list is initiated either by the statement <InformationFiles site="folder"> or by a statement <InformationFiles> with no site attribute specified. When "site" is specified the list consists of all files with the "vbi" extension in the specified folder. When not, each file is introduced by a statement <Load id="pathname">.
gmGlobal also produces a log of its operations with verbosity corresponding to select.Progress.
Select.Progress=1
Code Block |
---|
gmGlobal V40.34x86(09/22/22) System Build(09/22/22 14:48:10) The InformationFiles list contains 2 files. The Local Removal Scan of <UnUsedTest-projUnusedDll-std-csh.vbi> required 2 passes. The Local Removal Scan of <UnUsedTest-UnUsedTestEXE-std-csh.vbi> required 3 passes. Performing Global Removal pass 1 Performing Global Removal pass 2 Performing Global Removal pass 3 Performing Global Removal pass 4 |
Select.Progress=2
Code Block |
---|
gmGlobal V40.34x86(09/22/22) System Build(09/22/22 14:48:10) The InformationFiles list contains 2 files as follows: Information File(1): UnUsedTest-projUnusedDll-std-csh.vbi Information File(2): UnUsedTest-UnUsedTestEXE-std-csh.vbi The Local Removal Scan of <UnUsedTest-projUnusedDll-std-csh.vbi> required 2 passes. The Local Removal Scan of <UnUsedTest-UnUsedTestEXE-std-csh.vbi> required 3 passes. Performing Global Removal pass 1 The information file <UnUsedTest-projUnusedDll-std-csh.vbi> has 3 unUsed members projUnusedDll.Class1.DLLexposedUsedbyClient:42265 has 0 global references projUnusedDll.Class1.dllPropUsedbyClient:42327 has 0 global references projUnusedDll.Class1.dllPropNotUsedbyClient:42584 has 0 global references Performing Global Removal pass 2 The information file <UnUsedTest-projUnusedDll-std-csh.vbi> has 1 unUsed members projUnusedDll.Class1.DLLexposedNotUsedbyClient:42202 has 0 global references Performing Global Removal pass 3 The information file <UnUsedTest-projUnusedDll-std-csh.vbi> has 1 unUsed members projUnusedDll.Class1.pubOnlyUsedinDLL:42140 has 0 global references Performing Global Removal pass 4 |
The output of the run produces this file with refactoring commands.
Code Block |
---|
<Registry type="RefactorFile" Source="...\UnusedTestDLL.vbp"> <Refactor errorStatus="warn"> <Remove identifier="projUnusedDll.Class1.privNotUsedinDLL"/> <Remove identifier="projUnusedDll.Class1.pubUsedOnlyFromPrivate"/> <Remove identifier="projUnusedDll.Class1.dllPropUsedbyClient.Let.val"/> <Remove identifier="projUnusedDll.Class1.dllOnlyGetUsedbyClient.Let.val"/> </Refactor> </Registry> <Registry type="RefactorFile" Source="...\UnUsedTest.vbp"> <Refactor errorStatus="warn"> <Remove identifier="UnUsedTestEXE.modUnUsedTest.NotUsedDecl"/> <Remove identifier="UnUsedTestEXE.modUnUsedTest.UnreachableDecl"/> <Remove identifier="UnUsedTestEXE.notUsedSub"/> <Remove identifier="UnUsedTestEXE.notUsedFunc"/> <Remove identifier="UnUsedTestEXE.UnreachableSub"/> <Remove identifier="UnUsedTestEXE.UnreachableFunc"/> </Refactor> </Registry> |
The commands above may be used as a starting point for implementing rules that will remove unused code from you translations. The commands would be integrated with a GlobalSettings script.
Reporting
Following the application of <RemoveUnused /> to a set of VBI files, the list of unused symbols may be reported using code such as the following:
<Output Status="New" FileName="%MigName%-ReportUnused.tab" syntax="Tabbed" /> <ReportRemovals /> <Output Status="Close" />
The resulting report is written to the file named in the FileName attribute and looks like this:
Controlling the Types of symbols in the Unused Analysis
The "RemoveUnused" command itself has two mutually exclusive attributes: Include and Exclude. These specify what types of members – methods, fields, constants, properties, declarations, events, enumerations, or structures – are eligible for removal. By default, all of the above types are eligible for removal. Automatically removing all of these unused member types from the application code may be going too far in many cases. The user may be mainly looking for help with identifying and removing unused subprograms since they typically reference each other creating a complex web of dependencies that are tedious to unravel. Or a user might want to retain all events rather than singling out particular ones via a DoNotRemove list.
The types of symbols included in the Unused Symbols analysis may be set the Include attribute of the RemoveUnused command; for example:
<RemoveUnused Include="method, declare, property" />
Alternatively, the types of symbols excluded from the Unused Symbols analysis may be set the Exclude attribute of the RemoveUnused command; for example:
<RemoveUnused Exclude="field, constant, structure, enumeration, event" />
The following type names are recognized
method field constant property declare structure enumeration event
Preventing symbols from being marked as Unused
In addition to its Include and Exclude attributes the RemoveUnused statement also supports a set of DoNotRemove subcommands. The DoNotRemove statements store a series of DoNotRemove identifiers classified by their scope and then scans a set of loaded information files to locate and mark any members that are specified in the list. There are three scope levels: Global = 0, Library = 1, and member = 2+. The level is simply computed by counting the periods in the identifier. Each subcommand has a single "id" attribute. The example below shows using a DoNotRemove element to suppress removal of a specific symbol.
<gmGlobal> <InformationFiles> <Load id="UnUsedTest-projUnusedDll-std-csh.vbi" /> <Load id="UnUsedTest-UnUsedTestEXE-std-csh.vbi" /> </InformationFiles> <Output Status="New" FileName="%bndPath%" /> <RemoveUnused> <DoNotRemove id="projUnusedDll.Class1.privNotUsedinDLL"/> </RemoveUnused> <Output Status="Close" /> </gmGlobal>
FindCallByName
As mentioned above, the Unused Analysis risks marking late called symbols as unused. Typically the user will have to make adjustments for this by adding DoNotRemove commands and carefully reviewing the reported removals for symbols known to be accessed by a late call. This can be a difficult process, and gmGlobal includes an additional analyzer to assist you: the FindCallByName command illustrated below:
gmAPI_CallByName.xml
Code Block |
---|
<gmGlobal> <Storage Action="Create" Identifier="gmAPI_CallByName" /> <LoadEnvironment /> <Select MaxOutputWidth="2048" /> <Output Status="New" Filename="gmAPI_CallByName.out" /> <InformationFiles> <Load id="UnUsedTest-projUnusedDll-std-csh.vbi" /> <Load id="UnUsedTest-UnUsedTestEXE-std-csh.vbi" /> </InformationFiles> <FindCallByName ShowDetails="on" /> <Storage Action="Close" /> </gmGlobal> |
To be continued...
Table of Contents |
---|