Why Refactoring Legacy Code Is Hard (And How Transpilers Help)

January 5, 2025

The 4 Major Components of Programming Languages

Refactoring legacy code is difficult – whether it is done manually, or you try to automate it. To understand why refactoring code is so difficult, we need to start by understanding the 4 major components of a programming language:

Syntax and grammar – The set of rules defining how code is written and structured (e.g., semicolons, brackets, indentation)
Data types and structures – Define the kind of data (e.g., integers, strings, lists) the language works with
Functions – Reusable blocks of code designed to perform a specific task
Control flow - Mechanisms that control the sequencing of functions like Conditionals (if, else, switch) to make decisions or Loops (for, while) to repeat actions

Why Legacy Code Refactoring Is Challenging

While programming languages share many fundamental concepts, there are also critical differences. Legacy languages often implement these components in simpler, procedural ways, whereas modern languages emphasize modularity, abstraction, and advanced capabilities. These differences form the foundation of the challenges faced when refactoring code, and fall into three categories:

Syntactical and grammatical differences: Differences in how code is written, which affect its appearance but not its logic.
Native capability differences: When a data type or structure or function exists in one language but not in another.
Implementation differences: When the same task is performed differently in each language, often involving differences in memory management, variable handling, or control flow.

Examples of Challenges in Legacy Code Refactoring

Let’s go through a few specific examples of native capability differences and implementation differences that illustrate the complexities of automated refactoring.

Data type mismatches: COBOL has packed decimals (COMP-3) for precise fixed-point arithmetic. Modern languages like Java lack native support and require the use of BigDecimal. This data type mismatch requires additional handling, such as defining precision and scale, which could lead to errors if not done correctly.

COBOL: The value 12345.67 is stored in a compact binary-coded decimal format, with two decimal places

A code snippet illustrating how COBOL uses packed decimals

Java: To handle precise decimal arithmetic, Java relies on the BigDecimal class, which is not a native data type but part of the java.math library

A code snippet illustrating Java relies on the BigDecimal class to provide decimals

Function mismatches: COBOL has a native SORT statement to handle file-based sorting directly within the language, while Java requires developers to implement sorting manually or use library-based solutions.

COBOL directly sorts a file based on a specified key without additional code.

Java requires significantly more boilerplate code to achieve the same result. This gap makes direct mapping impossible and often leads to increased complexity when refactoring.

Global variable handling: In COBOL, global variables are often shared across multiple procedures, creating hidden dependencies. During refactoring, you must identify every place where a variable is read or written to ensure equivalent functionality in Java. This requires extensive analysis. Additionally, in COBOL, variables persist throughout the program’s execution unless explicitly reset. In Java, variables are typically created and destroyed dynamically, requiring careful management of their lifecycle to avoid memory leaks or unintended data loss.

Procedural constructs: COBOL’s control flow relies heavily on procedural constructs like GO TO, which can make the program flow less structured and harder to trace. Modern languages enforce structured programming principles, which discourage goto-style jumps and rely on clearly scoped loops and conditionals. Refactoring COBOL’s GO TO and procedure-driven logic into Java’s structured approach requires analyzing the entire program flow to avoid introducing unintended side effects.

COBOL

Java

What This Means For Automation of Legacy Code Refactoring

The good news is that most syntactical, grammatical, and capability (data type, function) differences can be addressed with static, rule-based mappings. The bad news is that implementation differences are significantly harder to resolve. These differences are deeply tied to how a language handles variables, memory, and control flow—all of which are interdependent. Properly refactoring through implementation differences requires understanding the entire logical structure of a program and inference of intent to ensure functionality is preserved.

So is the automation of refactoring a lost cause? The short answer is “No!” Automation of refactoring is possible if you have:

Rules to map data types and functions where a 1:1 relationship exists, and to fill capability gaps with equivalent constructs or libraries.
A mechanism to decompose the source code into its logical structure, ensuring that variable handling, control flow, and interdependencies are preserved during the transformation.
A capability that abstracts logical constructs, enabling better mapping when implementation styles differ drastically between the source and target languages.
Context-aware processing that goes beyond static rules, leveraging analysis tools to infer intent, variable scope, and control flow relationships, ensuring functional equivalence is maintained across paradigms.

The even-better news – these capabilities already exist today – they’re called transpilers, but probably not the transpilers you’re thinking of.

When most developers think of transpilers, they think of static, rules-based transpilers which struggle because (1) implementation differences create interdependencies that aren’t linear, so the volume of rules necessary to solve them become exorbitant, (2) the pace of change of modern languages is difficult for rules to keep up with.

But there is another class of transpilers – that use a tertiary language as an intermediary. Tertiary languages abstract logical constructs and decouple source and target languages, creating a bridge that allows for more generalized mappings and dynamic handling of implementation differences. Transpilers that use a tertiary language as an intermediary not only avoid the issues caused by static rules, but - by the very nature of their structure - perform more accurately, and improve performance more quickly. It is for this reason that many researchers in the Generative AI space are adopting this technique to enhance their models (which still don’t outperform transpilers). See here, here and here.

If you’d like to learn more about how tertiary language-based transpilers work, click here.

Why Refactoring Legacy Code Is Hard (And How Transpilers Help)

The 4 Major Components of Programming Languages

Why Legacy Code Refactoring Is Challenging

Examples of Challenges in Legacy Code Refactoring

What This Means For Automation of Legacy Code Refactoring

More like this