Improving Data-flow Analysis with Path Profiles - Notes | CS 6463 | Papers Computer Science

Improving Data-flow Analysis with Path Profiles*

Glenn Ammons James R. Larust

[email protected] [email protected]

Department of Computer Sciences

University of Wisconsin-Madison

1210 West Dayton St.

Madison, WI 53706

Abstract

Data-flow analysis computes its solutions over the paths in

a control-flow graph. These paths-whether feasible or in-

feasible, heavily or rarely executed-contribute equally to a

solution. However, programs execute only a small fraction

of their potential paths and, moreover, programs’ execution

time and cost is concentrated in a far smaller subset of hot

paths.

This paper describes a new approach to analyzing and

optimizing programs, which improves the precision of data

flow analysis along hot paths. Our technique identifies

and duplicates hot paths, creating a hot path graph in

which these paths are isolated. After flow analysis, the

graph is reduced to eliminate unnecessary duplicates of un-

profitable paths. In experiments on SPEC95 benchmarks,

path qualification identified 2-112 times more non-local con-

stants (weighted dynamically) than the Wegman-Zadek con-

ditional constant algorithm, which translated into l-7%

more dynamic instructions with constant results.

1 Introduction

Data-flow analysis computes its solutions over the paths in

a control-flow graph. The well-known, meet-over-all-paths

formulation produces safe, precise solutions for general data-

flow problems. All paths-whether feasible or infeasible,

heavily or rarely executed-contribute equally to a solution.

This egalitarian approach, unfortunately, is at odds with

the realities of program behavior. Even moderately large

programs execute only a few tens of thousands of paths (out

of a universe of billions of acyclic paths) and, moreover,

programs’ execution time and cost is concentrated in a far

smaller subset of hot paths [BL96, ABL97].

This paper presents a new data-flow analysis technique

that attempts to compute more precise solutions along the

hot paths in a program. Improved analysis along these paths

“This research supported by: NSF NY1 Award CCR-9357779, with

support

from

Sun Microsystems

and Intel, and NSF Grant MIP-

9625558.

‘On sabbatical at Microsoft Research.

Permission 10 make digital or hard copies of all or pan of this wcrk for

personal or classroom use is granted without ka provided that

copies are not made or distributed for profit or ccmmwcial advan-

tage and that copies bear this notice and the full citation on the first page.

To copy otherwise, 10 republish, 10 pcsf on sawen w 10

redistributa 10 lists, requires prior specific psrmiasion and/w a fw.

SIGPLAN ‘98 Montrasl, Canada

@ 1998 ACM 0-89791~987.4/98/0006...$5.00

can aid a compiler in optimizing these heavily executed por-

tions of a program. Path-qualified data-flow analysis con-

sists of the following steps:

1.

2.

3.

4.

5.

Identify hot paths by profiling a program. We use a

Ba&Larus path profile [BL96] to determine how often

acyclic paths in a program execute.

Identify and isolate the hot paths in the program’s

control-flow graph (CFG). This step produces a new

CFG in which each hot path is duplicated. Since a

hot path is separated from other paths, data-flow facts

along the path do not merge with facts from other,

overlapping paths. Moreover, as programs do not exe-

cute many hot paths, this hot-path graph (HPG) is not

much larger than the original graph.

Perform data-flow analysis on the HPG. The solutions

found by this technique are conservative in the hot

path graph-not in the original control-flow graph.

Reduce the graph to preserve only valuable solutions.

The HPG duplicates code for paths whose solutions

did not improve. Extra code both increases the cost

of subsequent compiler analyses and adversely affects

a processor’s instruction cache and branch predictor.

Reduction uses results from the data-flow analysis

and

frequencies from the path profile to decide which paths

to preserve in the

TI&K~

hot-path graph (THPG).

Translate the original path profile into a path pro-

file for the rHPG, so profiling information is avail-

able for subsequent analyses and optimizations. Ball-

Lams path profiles are determined by a set of recording

edges, which start and end paths. The algorithm that

produces an HPG also identifies recording edges in the

HPG, which allows interpretation of the original path

profile as a path profile of the HPG. The reduction

step properly maintains these recording edges.

The technique can be applied to any data-flow prob-

lem, although this paper focuses on constant propagation.

In experiments on SPEC95 benchmarks, path qualification

identified 2-112 times more non-local constants (weighted

dynamically) than the Wegman-Zadek conditional constant

algorithm, which translated into l-7% more dynamic in-

structions with constant results. Moreover, the technique is

practical. With the exception of the go benchmark, the hot-

path graphs were 3-32% larger and the reduced hot-path

graphs were only l-7% larger than the original CFG. On

72

Improving Data-flow Analysis with Path Profiles - Notes | CS 6463, Papers of Computer Science

Related documents

Partial preview of the text

Download Improving Data-flow Analysis with Path Profiles - Notes | CS 6463 and more Papers Computer Science in PDF only on Docsity!