Notice of Disclaimer
The information disclosed to you hereunder (the “Materials”) is provided solely for the selection and use of Xilinx products. To the maximum extent permitted by applicable law: (1) Materials are made available “AS IS” and with all faults, Xilinx hereby DISCLAIMS ALL WARRANTIES AND CONDITIONS, EXPRESS, IMPLIED, OR STATUTORY, INCLUDING BUT NOT LIMITED TO WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, OR FITNESS FOR ANY PARTICULAR PURPOSE; and (2) Xilinx shall not be liable (whether in contract or tort, including negligence, or under any other theory of liability) for any loss or damage of any kind or nature related to, arising under, or in connection with, the Materials (including your use of the Materials), including for any direct, indirect, special, incidental, or consequential loss or damage (including loss of data, profits, goodwill, or any type of loss or damage suffered as a result of any action brought by a third party) even if such damage or loss was reasonably foreseeable or Xilinx had been advised of the possibility of the same. Xilinx assumes no obligation to correct any errors contained in the Materials or to notify you of updates to the Materials or to product specifications. You may not reproduce, modify, distribute, or publicly display the Materials without prior written consent. Certain products are subject to the terms and conditions of the Limited Warranties which can be viewed at http://www.xilinx.com/warranty.htm; IP cores may be subject to warranty and support terms contained in a license issued to you by Xilinx. Xilinx products are not designed or intended to be fail-safe or for use in any application requiring fail-safe performance; you assume sole risk and liability for use of Xilinx products in Critical Applications: http://www.xilinx.com/warranty.htm#critapps.

Automotive Applications Disclaimer
XILINX PRODUCTS ARE NOT DESIGNED OR INTENDED TO BE FAIL-SAFE, OR FOR USE IN ANY APPLICATION REQUIRING FAIL-SAFE PERFORMANCE, SUCH AS APPLICATIONS RELATED TO: (I) THE DEPLOYMENT OF AIRBAGS, (II) CONTROL OF A VEHICLE, UNLESS THERE IS A FAIL-SAFE OR REDUNDANCY FEATURE (WHICH DOES NOT INCLUDE USE OF SOFTWARE IN THE XILINX DEVICE TO IMPLEMENT THE REDUNDANCY) AND A WARNING SIGNAL UPON FAILURE TO THE OPERATOR, OR (III) USES THAT COULD LEAD TO DEATH OR PERSONAL INJURY. CUSTOMER ASSUMES THE SOLE RISK AND LIABILITY OF ANY USE OF XILINX PRODUCTS IN SUCH APPLICATIONS.

© 2002 – 2012 Xilinx, Inc. Xilinx, the Xilinx logo, Artix, ISE, Kintex, Spartan, Virtex, Zynq, and other designated brands included herein are trademarks of Xilinx in the United States and other countries. All other trademarks are the property of their respective owners. AMBA, AMBA Designer, ARM, ARM1176JZ-S, CoreSight, Cortex, and PrimeCell are trademarks of ARM in the EU and other countries.

Revision History
The following table shows the revision history for this document.

<table>
<thead>
<tr>
<th>Date</th>
<th>Version</th>
<th>Revision</th>
</tr>
</thead>
<tbody>
<tr>
<td>10/01/02</td>
<td>1.0</td>
<td>Xilinx EDK 3.1 release</td>
</tr>
<tr>
<td>03/11/03</td>
<td>2.0</td>
<td>Xilinx EDK 3.2 release</td>
</tr>
<tr>
<td>09/24/03</td>
<td>3.0</td>
<td>Xilinx EDK 6.1 release</td>
</tr>
<tr>
<td>02/20/04</td>
<td>3.1</td>
<td>Xilinx EDK 6.2 release</td>
</tr>
<tr>
<td>08/24/04</td>
<td>4.0</td>
<td>Xilinx EDK 6.3 release</td>
</tr>
<tr>
<td>09/21/04</td>
<td>4.1</td>
<td>Minor corrections for EDK 6.3 SP1 release</td>
</tr>
<tr>
<td>11/18/04</td>
<td>4.2</td>
<td>Minor corrections for EDK 6.3 SP2 release</td>
</tr>
<tr>
<td>01/20/05</td>
<td>5.0</td>
<td>Xilinx EDK 7.1 release</td>
</tr>
<tr>
<td>04/02/05</td>
<td>5.1</td>
<td>Minor corrections for EDK 7.1 SP1 release</td>
</tr>
<tr>
<td>05/09/05</td>
<td>5.2</td>
<td>Minor corrections for EDK 7.1 SP2 release</td>
</tr>
<tr>
<td>10/05/05</td>
<td>5.3</td>
<td>Minor corrections for EDK 8.1 release</td>
</tr>
<tr>
<td>02/21/06</td>
<td>5.4</td>
<td>Corrections for EDK 8.1 SP2 release</td>
</tr>
<tr>
<td>06/01/06</td>
<td>6.0</td>
<td>Xilinx EDK 8.2 release</td>
</tr>
<tr>
<td>Date</td>
<td>Version</td>
<td>Revision</td>
</tr>
<tr>
<td>-----------</td>
<td>---------</td>
<td>-----------------------------------------------</td>
</tr>
<tr>
<td>07/24/06</td>
<td>6.1</td>
<td>Minor corrections for EDK 8.2 SP1 release</td>
</tr>
<tr>
<td>08/21/06</td>
<td>6.2</td>
<td>Minor corrections for EDK 8.2 SP2 release</td>
</tr>
<tr>
<td>08/29/06</td>
<td>6.3</td>
<td>Minor corrections for EDK 8.2 SP2 release</td>
</tr>
<tr>
<td>09/15/06</td>
<td>7.0</td>
<td>Xilinx EDK 9.1 release</td>
</tr>
<tr>
<td>02/22/07</td>
<td>7.1</td>
<td>Minor corrections for EDK 9.1 SP1 release</td>
</tr>
<tr>
<td>03/27/07</td>
<td>7.2</td>
<td>Minor corrections for EDK 9.1 SP2 release</td>
</tr>
<tr>
<td>06/25/07</td>
<td>8.0</td>
<td>Xilinx EDK 9.2 release</td>
</tr>
<tr>
<td>10/12/07</td>
<td>8.1</td>
<td>Minor corrections for EDK 9.2 SP2 release</td>
</tr>
<tr>
<td>01/17/08</td>
<td>9.0</td>
<td>Xilinx EDK 10.1 release</td>
</tr>
<tr>
<td>03/04/08</td>
<td>9.1</td>
<td>Minor corrections for EDK 10.1 SP1 release</td>
</tr>
<tr>
<td>05/14/08</td>
<td>9.2</td>
<td>Minor corrections for EDK 10.1 SP2 release</td>
</tr>
<tr>
<td>07/14/08</td>
<td>9.3</td>
<td>Minor corrections for EDK 10.1 SP3 release</td>
</tr>
<tr>
<td>02/04/09</td>
<td>10.0</td>
<td>Xilinx EDK 11.1 release</td>
</tr>
<tr>
<td>04/15/09</td>
<td>10.1</td>
<td>Xilinx EDK 11.2 release</td>
</tr>
<tr>
<td>05/28/09</td>
<td>10.2</td>
<td>Xilinx EDK 11.3 release</td>
</tr>
<tr>
<td>10/26/09</td>
<td>10.3</td>
<td>Xilinx EDK 11.4 release</td>
</tr>
<tr>
<td>04/19/10</td>
<td>11.0</td>
<td>Xilinx EDK 12.1 release</td>
</tr>
<tr>
<td>07/23/10</td>
<td>11.1</td>
<td>Xilinx EDK 12.2 release</td>
</tr>
<tr>
<td>09/21/10</td>
<td>11.2</td>
<td>Xilinx EDK 12.3 release</td>
</tr>
<tr>
<td>11/15/10</td>
<td>11.3</td>
<td>Minor corrections for EDK 12.4 release</td>
</tr>
<tr>
<td>11/15/10</td>
<td>11.4</td>
<td>Xilinx EDK 12.4 release</td>
</tr>
<tr>
<td>03/01/11</td>
<td>12.0</td>
<td>Xilinx EDK 13.1 release</td>
</tr>
<tr>
<td>06/22/11</td>
<td>13.2</td>
<td>Xilinx EDK 13.2 release</td>
</tr>
<tr>
<td>10/19/11</td>
<td>13.3</td>
<td>Xilinx EDK 13.3 release</td>
</tr>
<tr>
<td>01/18/12</td>
<td>13.4</td>
<td>Xilinx EDK 13.4 release</td>
</tr>
<tr>
<td>04/24/12</td>
<td>14.1</td>
<td>Xilinx EDK 14.1 release</td>
</tr>
<tr>
<td>07/25/12</td>
<td>14.2</td>
<td>Xilinx EDK 14.2 release</td>
</tr>
<tr>
<td>10/16/12</td>
<td>14.3</td>
<td>Xilinx EDK 14.3 release</td>
</tr>
</tbody>
</table>
# Table of Contents

Revision History .................................................. 2

**Chapter 1: Introduction**

Guide Contents .................................................. 7

**Chapter 2: MicroBlaze Architecture**

Overview ......................................................... 9
Data Types and Endianness ..................................... 13
Instructions ....................................................... 14
Registers .......................................................... 25
Pipeline Architecture ........................................... 50
Memory Architecture ............................................. 52
Privileged Instructions .......................................... 53
Virtual-Memory Management ................................ 54
Reset, Interrupts, Exceptions, and Break ................. 64
Instruction Cache ................................................. 71
Data Cache ........................................................ 73
Floating Point Unit (FPU) ........................................ 77
Stream Link Interfaces ......................................... 81
Debug and Trace .................................................. 82
Fault Tolerance .................................................... 83
Lockstep Operation .............................................. 89

**Chapter 3: MicroBlaze Signal Interface Description**

Overview ......................................................... 93
MicroBlaze I/O Overview ........................................ 94
AXI4 Interface Description .................................... 104
Processor Local Bus (PLB) Interface Description ........ 107
Local Memory Bus (LMB) Interface Description .......... 107
Fast Simplex Link (FSL) Interface Description .......... 117
Xilinx CacheLink (XCL) Interface Description .......... 119
Lockstep Interface Description ................................ 124
Debug Interface Description ................................... 130
Trace Interface Description ................................... 130
MicroBlaze Core Configurability ............................. 133

**Chapter 4: MicroBlaze Application Binary Interface**

Data Types ......................................................... 145
Register Usage Conventions .................................................. 146
Stack Convention ..................................................................... 147
Memory Model .......................................................................... 149
Interrupt and Exception Handling .......................................... 150

Chapter 5: MicroBlaze Instruction Set Architecture
Notation ..................................................................................... 153
Formats ...................................................................................... 155
Instructions ............................................................................... 155

Appendix A: Additional Resources
EDK Documentation .................................................................... 255
Additional Resources ................................................................. 255
Chapter 1

Introduction

The MicroBlaze™ Processor Reference Guide provides information about the 32-bit soft processor, MicroBlaze, which is part of the Embedded Processor Development Kit (EDK). The document is intended as a guide to the MicroBlaze hardware architecture.

Guide Contents

This guide contains the following chapters:

- **Chapter 2, MicroBlaze Architecture**, contains an overview of MicroBlaze features as well as information on Big-Endian and Little-Endian bit-reversed format, 32-bit general purpose registers, cache software support, and Fast Simplex Link interfaces.

- **Chapter 3, MicroBlaze Signal Interface Description**, describes the types of signal interfaces that can be used to connect MicroBlaze.

- **Chapter 4, MicroBlaze Application Binary Interface**, describes the Application Binary Interface important for developing software in assembly language for the soft processor.

- **Chapter 5, MicroBlaze Instruction Set Architecture**, provides notation, formats, and instructions for the Instruction Set Architecture of MicroBlaze.

- **Appendix A, Additional Resources**, provides links to EDK documentation and additional resources.
Chapter 2

MicroBlaze Architecture

This chapter contains an overview of MicroBlaze™ features and detailed information on MicroBlaze architecture including Big-Endian or Little-Endian bit-reversed format, 32-bit general purpose registers, virtual-memory management, cache software support, and Fast Simplex Link (FSL) or AXI4-Stream interfaces.

Overview

The MicroBlaze™ embedded processor soft core is a reduced instruction set computer (RISC) optimized for implementation in Xilinx® Field Programmable Gate Arrays (FPGAs). Figure 2-1 shows a functional block diagram of the MicroBlaze core.

Figure 2-1: MicroBlaze Core Block Diagram
Chapter 2: MicroBlaze Architecture

Features

The MicroBlaze soft core processor is highly configurable, allowing you to select a specific set of features required by your design.

The fixed feature set of the processor includes:

- Thirty-two 32-bit general purpose registers
- 32-bit instruction word with three operands and two addressing modes
- 32-bit address bus
- Single issue pipeline

In addition to these fixed features, the MicroBlaze processor is parameterized to allow selective enabling of additional functionality. Older (deprecated) versions of MicroBlaze support a subset of the optional features described in this manual. Only the latest (preferred) version of MicroBlaze (v8.00) supports all options.

Xilinx recommends that all new designs use the latest preferred version of the MicroBlaze processor.

Table 2-1, page 10 provides an overview of the configurable features by MicroBlaze versions.

<table>
<thead>
<tr>
<th>Feature</th>
<th>v7.30</th>
<th>v8.00</th>
<th>v8.10</th>
<th>v8.20</th>
<th>v8.30</th>
<th>v8.40</th>
</tr>
</thead>
<tbody>
<tr>
<td>Version Status</td>
<td>obsolete</td>
<td>deprecated</td>
<td>deprecated</td>
<td>deprecated</td>
<td>deprecated</td>
<td>preferred</td>
</tr>
<tr>
<td>Processor pipeline depth</td>
<td>3/5</td>
<td>3/5</td>
<td>3/5</td>
<td>3/5</td>
<td>3/5</td>
<td>3/5</td>
</tr>
<tr>
<td>On-chip Peripheral Bus (OPB) data side interface</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td>On-chip Peripheral Bus (OPB) instruction side interface</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td>Local Memory Bus (LMB) data side interface</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Local Memory Bus (LMB) instruction side interface</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Hardware barrel shifter</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Hardware divider</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Hardware debug logic</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Stream link interfaces</td>
<td>0-15 FSL</td>
<td>0-15 FSL/AXI</td>
<td>0-15 FSL/AXI</td>
<td>0-15 FSL/AXI</td>
<td>0-15 FSL/AXI</td>
<td>0-15 FSL/AXI</td>
</tr>
<tr>
<td>Machine status set and clear instructions</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Instruction cache over IOPB interface</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td>Data cache over DOPB interface</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td>Instruction cache over Cache Link (IXCL) interface</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Data cache over Cache Link (DXCL) interface</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>4 or 8-word cache line</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
</tbody>
</table>
Table 2-1: Configurable Feature Overview by MicroBlaze Version

<table>
<thead>
<tr>
<th>Feature</th>
<th>v7.30</th>
<th>v8.00</th>
<th>v8.10</th>
<th>v8.20</th>
<th>v8.30</th>
<th>v8.40</th>
</tr>
</thead>
<tbody>
<tr>
<td>Hardware exception support</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Pattern compare instructions</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Floating point unit (FPU)</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Disable hardware multiplier(^1)</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Hardware debug readable ESR and EAR</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr>
<td>Processor Version Register (PVR)</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Area or speed optimized</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Hardware multiplier 64-bit result</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>LUT cache memory</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Processor Local Bus (PLB) data side interface</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Processor Local Bus (PLB) instruction side interface</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Floating point conversion and square root instructions</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Memory Management Unit (MMU)</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Extended stream instructions</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Use Xilinx Cache Link for All I-Cache Memory Accesses</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Use Xilinx Cache Link for All D-Cache Memory Accesses</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Use Write-back Caching Policy for D-Cache</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Cache Link (DXCL) protocol for D-Cache</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Cache Link (IXCL) protocol for I-Cache</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Branch Target Cache (BTC)</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Streams for I-Cache</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Victim handling for I-Cache</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Victim handling for D-Cache</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>AXI4 (M_AXI_DP) data side interface</td>
<td>-</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>AXI4 (M_AXI_IP) instruction side interface</td>
<td>-</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>AXI4 (M_AXI_DC) protocol for D-Cache</td>
<td>-</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>AXI4 (M_AXI_IC) protocol for I-Cache</td>
<td>-</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>AXI4 protocol for stream accesses</td>
<td>-</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Fault tolerant features</td>
<td>-</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Tool selectable endianness</td>
<td>-</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
<tr>
<td>Force distributed RAM for cache tags</td>
<td>-</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
<td>option</td>
</tr>
</tbody>
</table>
**Table 2-1: Configurable Feature Overview by MicroBlaze Version**

<table>
<thead>
<tr>
<th>Feature</th>
<th>MicroBlaze Versions</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>v7.30</td>
</tr>
<tr>
<td>Configurable cache data widths</td>
<td>-</td>
</tr>
<tr>
<td>Count Leading Zeros instruction</td>
<td>-</td>
</tr>
<tr>
<td>Memory Barrier instruction</td>
<td>-</td>
</tr>
<tr>
<td>Stack overflow and underflow detection</td>
<td>-</td>
</tr>
<tr>
<td>Allow stream instructions in user mode</td>
<td>-</td>
</tr>
<tr>
<td>Lockstep support</td>
<td></td>
</tr>
<tr>
<td>Configurable use of FPGA primitives</td>
<td></td>
</tr>
<tr>
<td>Low-latency interrupt mode</td>
<td></td>
</tr>
<tr>
<td>Swap instructions</td>
<td></td>
</tr>
<tr>
<td>Sleep mode and sleep instruction</td>
<td></td>
</tr>
<tr>
<td>Relocatable base vectors</td>
<td></td>
</tr>
</tbody>
</table>

1. Used in Virtex®-4 and subsequent families, for saving MUL18 and DSP48 primitives.
Data Types and Endianness

MicroBlaze uses Big-Endian or Little-Endian format to represent data, depending on the parameter \texttt{C\_ENDIANNESS}. The hardware supported data types for MicroBlaze are word, half word, and byte. When using the reversed load and store instructions LHUR, LWR, SHR and SWR, the bytes in the data are reversed, as indicated by the byte-reversed order.

The bit and byte organization for each type is shown in the following tables.

\textit{Table 2-2: Word Data Type}

<table>
<thead>
<tr>
<th>Big-Endian Byte Address</th>
<th>n</th>
<th>n+1</th>
<th>n+2</th>
<th>n+3</th>
</tr>
</thead>
<tbody>
<tr>
<td>Big-Endian Byte Significance</td>
<td>MSByte</td>
<td></td>
<td>LSByte</td>
<td></td>
</tr>
<tr>
<td>Big-Endian Byte Order</td>
<td>n</td>
<td>n+1</td>
<td>n+2</td>
<td>n+3</td>
</tr>
<tr>
<td>Big-Endian Byte-Reversed Order</td>
<td>n+3</td>
<td>n+2</td>
<td>n+1</td>
<td>n</td>
</tr>
<tr>
<td>Little-Endian Byte Address</td>
<td>n+3</td>
<td>n+2</td>
<td>n+1</td>
<td>n</td>
</tr>
<tr>
<td>Little-Endian Byte Significance</td>
<td>MSByte</td>
<td></td>
<td>LSByte</td>
<td></td>
</tr>
<tr>
<td>Little-Endian Byte Order</td>
<td>n+3</td>
<td>n+2</td>
<td>n+1</td>
<td>n</td>
</tr>
<tr>
<td>Little-Endian Byte-Reversed Order</td>
<td>n</td>
<td>n+1</td>
<td>n+2</td>
<td>n+3</td>
</tr>
<tr>
<td>Bit Label</td>
<td>0</td>
<td></td>
<td>31</td>
<td></td>
</tr>
<tr>
<td>Bit Significance</td>
<td>MSBit</td>
<td></td>
<td>LSBit</td>
<td></td>
</tr>
</tbody>
</table>

\textit{Table 2-3: Half Word Data Type}

<table>
<thead>
<tr>
<th>Big-Endian Byte Address</th>
<th>n</th>
<th>n+1</th>
</tr>
</thead>
<tbody>
<tr>
<td>Big-Endian Byte Significance</td>
<td>MSByte</td>
<td>LSByte</td>
</tr>
<tr>
<td>Big-Endian Byte Order</td>
<td>n</td>
<td>n+1</td>
</tr>
<tr>
<td>Big-Endian Byte-Reversed Order</td>
<td>n+1</td>
<td>n</td>
</tr>
<tr>
<td>Little-Endian Byte Address</td>
<td>n+1</td>
<td>n</td>
</tr>
<tr>
<td>Little-Endian Byte Significance</td>
<td>MSByte</td>
<td>LSByte</td>
</tr>
<tr>
<td>Little-Endian Byte Order</td>
<td>n+1</td>
<td>n</td>
</tr>
<tr>
<td>Little-Endian Byte-Reversed Order</td>
<td>n</td>
<td>n+1</td>
</tr>
<tr>
<td>Bit Label</td>
<td>0</td>
<td>15</td>
</tr>
<tr>
<td>Bit Significance</td>
<td>MSBit</td>
<td>LSBit</td>
</tr>
</tbody>
</table>

\textit{Table 2-4: Byte Data Type}

<table>
<thead>
<tr>
<th>Byte Address</th>
<th>n</th>
</tr>
</thead>
<tbody>
<tr>
<td>Bit Label</td>
<td>0</td>
</tr>
<tr>
<td>Bit Significance</td>
<td>MSBit</td>
</tr>
</tbody>
</table>
Instructions

Instruction Summary

All MicroBlaze instructions are 32 bits and are defined as either Type A or Type B. Type A instructions have up to two source register operands and one destination register operand. Type B instructions have one source register and a 16-bit immediate operand (which can be extended to 32 bits by preceding the Type B instruction with an imm instruction). Type B instructions have a single destination register operand. Instructions are provided in the following functional categories: arithmetic, logical, branch, load/store, and special. Table 2-6 lists the MicroBlaze instruction set. Refer to Chapter 5, MicroBlaze Instruction Set Architecture for more information on these instructions. Table 2-5 describes the instruction set nomenclature used in the semantics of each instruction.

Table 2-5: Instruction Set Nomenclature

<table>
<thead>
<tr>
<th>Symbol</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>Ra</td>
<td>R0 - R31, General Purpose Register, source operand a</td>
</tr>
<tr>
<td>Rb</td>
<td>R0 - R31, General Purpose Register, source operand b</td>
</tr>
<tr>
<td>Rd</td>
<td>R0 - R31, General Purpose Register, destination operand</td>
</tr>
<tr>
<td>SPR[x]</td>
<td>Special Purpose Register number x</td>
</tr>
<tr>
<td>MSR</td>
<td>Machine Status Register = SPR[1]</td>
</tr>
<tr>
<td>ESR</td>
<td>Exception Status Register = SPR[5]</td>
</tr>
<tr>
<td>EAR</td>
<td>Exception Address Register = SPR[3]</td>
</tr>
<tr>
<td>FSR</td>
<td>Floating Point Unit Status Register = SPR[7]</td>
</tr>
<tr>
<td>PVRx</td>
<td>Processor Version Register, where x is the register number = SPR[8192 + x]</td>
</tr>
<tr>
<td>BTR</td>
<td>Branch Target Register = SPR[11]</td>
</tr>
<tr>
<td>PC</td>
<td>Execute stage Program Counter = SPR[0]</td>
</tr>
<tr>
<td>x[y]</td>
<td>Bit y of register x</td>
</tr>
<tr>
<td>x[y:z]</td>
<td>Bit range y to z of register x</td>
</tr>
<tr>
<td>x̄</td>
<td>Bit inverted value of register x</td>
</tr>
<tr>
<td>Imm</td>
<td>16 bit immediate value</td>
</tr>
<tr>
<td>Immx</td>
<td>x bit immediate value</td>
</tr>
<tr>
<td>FSLx</td>
<td>4 bit Fast Simplex Link (FSL) or AXI4-Stream port designator, where x is the port number</td>
</tr>
<tr>
<td>C</td>
<td>Carry flag, MSR[29]</td>
</tr>
<tr>
<td>Sa</td>
<td>Special Purpose Register, source operand</td>
</tr>
<tr>
<td>Sd</td>
<td>Special Purpose Register, destination operand</td>
</tr>
<tr>
<td>s(x)</td>
<td>Sign extend argument x to 32-bit value</td>
</tr>
<tr>
<td>*Addr</td>
<td>Memory contents at location Addr (data-size aligned)</td>
</tr>
<tr>
<td>:=</td>
<td>Assignment operator</td>
</tr>
</tbody>
</table>
Table 2-5:  **Instruction Set Nomenclature (Continued)**

<table>
<thead>
<tr>
<th>Symbol</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>=</td>
<td>Equality comparison</td>
</tr>
<tr>
<td>!=</td>
<td>Inequality comparison</td>
</tr>
<tr>
<td>&gt;</td>
<td>Greater than comparison</td>
</tr>
<tr>
<td>&gt;=</td>
<td>Greater than or equal comparison</td>
</tr>
<tr>
<td>&lt;</td>
<td>Less than comparison</td>
</tr>
<tr>
<td>&lt;=</td>
<td>Less than or equal comparison</td>
</tr>
<tr>
<td>+</td>
<td>Arithmetic add</td>
</tr>
<tr>
<td>*</td>
<td>Arithmetic multiply</td>
</tr>
<tr>
<td>/</td>
<td>Arithmetic divide</td>
</tr>
<tr>
<td>&gt;&gt; x</td>
<td>Bit shift right x bits</td>
</tr>
<tr>
<td>&lt;&lt; x</td>
<td>Bit shift left x bits</td>
</tr>
<tr>
<td>and</td>
<td>Logic AND</td>
</tr>
<tr>
<td>or</td>
<td>Logic OR</td>
</tr>
<tr>
<td>xor</td>
<td>Logic exclusive OR</td>
</tr>
<tr>
<td>op1 if cond else op2</td>
<td>Perform op1 if condition cond is true, else perform op2</td>
</tr>
<tr>
<td>&amp;</td>
<td>Concatenate. E.g. “0000100 &amp; Imm7” is the concatenation of the fixed field “0000100” and a 7 bit immediate value.</td>
</tr>
<tr>
<td>signed</td>
<td>Operation performed on signed integer data type. All arithmetic operations are performed on signed word operands, unless otherwise specified</td>
</tr>
<tr>
<td>unsigned</td>
<td>Operation performed on unsigned integer data type</td>
</tr>
<tr>
<td>float</td>
<td>Operation performed on floating point data type</td>
</tr>
<tr>
<td>clz(r)</td>
<td>Count leading zeros</td>
</tr>
</tbody>
</table>

Table 2-6:  **MicroBlaze Instruction Set Summary**

<table>
<thead>
<tr>
<th>Type A</th>
<th>0-5</th>
<th>6-10</th>
<th>11-15</th>
<th>16-20</th>
<th>21-31</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>ADD Rd,Ra,Rb</td>
<td>000000</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>Rd := Rb + Ra</td>
</tr>
<tr>
<td>RSUB Rd,Ra,Rb</td>
<td>000001</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>Rd := Rb + Ra + 1</td>
</tr>
<tr>
<td>ADDC Rd,Ra,Rb</td>
<td>000010</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>Rd := Rb + Ra + C</td>
</tr>
<tr>
<td>RSUBC Rd,Ra,Rb</td>
<td>000011</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>Rd := Rb + Ra + C</td>
</tr>
<tr>
<td>ADDK Rd,Ra,Rb</td>
<td>000100</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>Rd := Rb + Ra</td>
</tr>
<tr>
<td>RSUBK Rd,Ra,Rb</td>
<td>000101</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>Rd := Rb + Ra + 1</td>
</tr>
<tr>
<td>ADDKC Rd,Ra,Rb</td>
<td>000110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>Rd := Rb + Ra + C</td>
</tr>
</tbody>
</table>
### MicroBlaze Instruction Set Summary (Continued)

<table>
<thead>
<tr>
<th>Type</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>0-5</td>
<td>6-10</td>
</tr>
<tr>
<td>Type A</td>
<td></td>
</tr>
<tr>
<td>RSUBKC Rd, Ra, Rb</td>
<td>Rb := Ra + C</td>
</tr>
<tr>
<td>CMP Rd, Ra, Rb</td>
<td>Rd[0] := 0 if (Rb &gt;= Ra) else Rd[0] := 1</td>
</tr>
<tr>
<td>CMPU Rd, Ra, Rb</td>
<td>Rd := Rb + Ra + 1 (unsigned)</td>
</tr>
<tr>
<td>ADDI Rd, Ra, Imm</td>
<td>Rd := s(Imm) + Ra</td>
</tr>
<tr>
<td>RSUBI Rd, Ra, Imm</td>
<td>Rd := s(Imm) + Ra</td>
</tr>
<tr>
<td>ADDIC Rd, Ra, Imm</td>
<td>Rd := s(Imm) + Ra + C</td>
</tr>
<tr>
<td>RSUBIC Rd, Ra, Imm</td>
<td>Rd := s(Imm) + Ra + C</td>
</tr>
<tr>
<td>ADDIK Rd, Ra, Imm</td>
<td>Rd := s(Imm) + Ra</td>
</tr>
<tr>
<td>RSUBIK Rd, Ra, Imm</td>
<td>Rd := s(Imm) + Ra + 1</td>
</tr>
<tr>
<td>ADDIKC Rd, Ra, Imm</td>
<td>Rd := s(Imm) + Ra + C</td>
</tr>
<tr>
<td>MUL Rd, Ra, Rb</td>
<td>Rd := Ra * Rb</td>
</tr>
<tr>
<td>MULH Rd, Ra, Rb</td>
<td>Rd := (Ra * Rb) &gt;&gt; 32 (signed)</td>
</tr>
<tr>
<td>MULHU Rd, Ra, Rb</td>
<td>Rd := (Ra * Rb) &gt;&gt; 32 (unsigned)</td>
</tr>
<tr>
<td>MULHSU Rd, Ra, Rb</td>
<td>Rd := (Ra, signed * Rb, unsigned) &gt;&gt; 32 (signed)</td>
</tr>
<tr>
<td>BSRA Rd, Ra, Rb</td>
<td>Rd := s(Ra &gt;&gt; Rb)</td>
</tr>
<tr>
<td>BSLL Rd, Ra, Rb</td>
<td>Rd := 0 &amp; (Ra &gt;&gt; Imm5)</td>
</tr>
<tr>
<td>MULI Rd, Ra, Imm</td>
<td>Rd := Ra * s(Imm)</td>
</tr>
<tr>
<td>BSRLI Rd, Ra, Imm</td>
<td>Rd := = 0 &amp; (Ra &gt;&gt; Imm5)</td>
</tr>
<tr>
<td>BSRAI Rd, Ra, Imm</td>
<td>Rd := s(Ra &gt;&gt; Imm5)</td>
</tr>
<tr>
<td>BSLLI Rd, Ra, Imm</td>
<td>Rd := (Ra &lt;&lt; Imm5) &amp; 0</td>
</tr>
<tr>
<td>IDIV Rd, Ra, Rb</td>
<td>Rd := Rb/Ra</td>
</tr>
<tr>
<td>IDIVU Rd, Ra, Rb</td>
<td>Rd := Rb/Ra, unsigned</td>
</tr>
<tr>
<td>TNEAGETD Rd, Rb</td>
<td>Rd := FSL Rb[28:31] (data read)</td>
</tr>
</tbody>
</table>

### Table 2-6: MicroBlaze Instruction Set Summary

<table>
<thead>
<tr>
<th>Type</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>0-5</td>
<td>6-10</td>
</tr>
<tr>
<td>Type B</td>
<td></td>
</tr>
<tr>
<td>MUL Rd, Ra, Rb</td>
<td>Rd := Ra * Rb</td>
</tr>
<tr>
<td>MULH Rd, Ra, Rb</td>
<td>Rd := (Ra * Rb) &gt;&gt; 32 (signed)</td>
</tr>
<tr>
<td>MULHU Rd, Ra, Rb</td>
<td>Rd := (Ra * Rb) &gt;&gt; 32 (unsigned)</td>
</tr>
<tr>
<td>MULHSU Rd, Ra, Rb</td>
<td>Rd := (Ra, signed * Rb, unsigned) &gt;&gt; 32 (signed)</td>
</tr>
<tr>
<td>BSRA Rd, Ra, Rb</td>
<td>Rd := s(Ra &gt;&gt; Rb)</td>
</tr>
<tr>
<td>BSLL Rd, Ra, Rb</td>
<td>Rd := 0 &amp; (Ra &gt;&gt; Imm5)</td>
</tr>
<tr>
<td>MULI Rd, Ra, Imm</td>
<td>Rd := Ra * s(Imm)</td>
</tr>
<tr>
<td>BSRLI Rd, Ra, Imm</td>
<td>Rd := = 0 &amp; (Ra &gt;&gt; Imm5)</td>
</tr>
<tr>
<td>BSRAI Rd, Ra, Imm</td>
<td>Rd := s(Ra &gt;&gt; Imm5)</td>
</tr>
<tr>
<td>BSLLI Rd, Ra, Imm</td>
<td>Rd := (Ra &lt;&lt; Imm5) &amp; 0</td>
</tr>
<tr>
<td>IDIV Rd, Ra, Rb</td>
<td>Rd := Rb/Ra</td>
</tr>
<tr>
<td>IDIVU Rd, Ra, Rb</td>
<td>Rd := Rb/Ra, unsigned</td>
</tr>
<tr>
<td>TNEAGETD Rd, Rb</td>
<td>Rd := FSL Rb[28:31] (data read)</td>
</tr>
</tbody>
</table>

MSR[FSL] := 1 if (FSL_S_Control = 1) MSR[C] := not FSL_S_Exists if N = 1
Instructions

Table 2-6: MicroBlaze Instruction Set Summary (Continued)

<table>
<thead>
<tr>
<th>Type A</th>
<th>Type B</th>
<th>0-5</th>
<th>6-10</th>
<th>11-15</th>
<th>16-20</th>
<th>21-31</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>TNA/PUTD Ra,Rb</td>
<td>010011</td>
<td>00000</td>
<td>Ra</td>
<td>Rb</td>
<td>0N0740 0000</td>
<td>FSL Rb[28:31] := Ra (data write) MSR[C] := FSL_M_Full if N = 1</td>
<td></td>
</tr>
<tr>
<td>TNECA/GETD Rd,Rb</td>
<td>010011</td>
<td>Rd</td>
<td>00000</td>
<td>Rb</td>
<td>0N1TA0 0000</td>
<td>Rd := FSL Rb[28:31] (control read) MSR[FSL] := 1 if (FSL_S_Control = 0) MSR[C] := not FSL_S_Exists if N = 1</td>
<td></td>
</tr>
<tr>
<td>TNA/PUTD Ra,Rb</td>
<td>010011</td>
<td>00000</td>
<td>Ra</td>
<td>Rb</td>
<td>0N1TA0 0000</td>
<td>FSL Rb[28:31] := Ra (control write) MSR[C] := FSL_M_Full if N = 1</td>
<td></td>
</tr>
<tr>
<td>FADD Rd,Ra,Rb</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>0000000000</td>
<td>Rd := Rb+Ra, float1</td>
<td></td>
</tr>
<tr>
<td>FRSUB Rd,Ra,Rb</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>0000000000</td>
<td>Rd := Rb-Ra, float1</td>
<td></td>
</tr>
<tr>
<td>FMUL Rd,Ra,Rb</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>0000000000</td>
<td>Rd := Rb*Ra, float1</td>
<td></td>
</tr>
<tr>
<td>FDIV Rd,Ra,Rb</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>0011000000</td>
<td>Rd := Rb/Ra, float1</td>
<td></td>
</tr>
<tr>
<td>FCMP.UN Rd,Ra,Rb</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>0100000000</td>
<td>Rd := 1 if (Rb = NaN or Ra = NaN, float1) else Rd := 0</td>
<td></td>
</tr>
<tr>
<td>FCMP.LT Rd,Ra,Rb</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>0100010000</td>
<td>Rd := 1 if (Rb &lt; Ra, float1) else Rd := 0</td>
<td></td>
</tr>
<tr>
<td>FCMP.EQ Rd,Ra,Rb</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>0100010000</td>
<td>Rd := 1 if (Rb = Ra, float1) else Rd := 0</td>
<td></td>
</tr>
<tr>
<td>FCMP.LE Rd,Ra,Rb</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>0100000000</td>
<td>Rd := 1 if (Rb &lt;= Ra, float1) else Rd := 0</td>
<td></td>
</tr>
<tr>
<td>FCMP.GT Rd,Ra,Rb</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>0100100000</td>
<td>Rd := 1 if (Rb &gt; Ra, float1) else Rd := 0</td>
<td></td>
</tr>
<tr>
<td>FCMP.NE Rd,Ra,Rb</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>0100100000</td>
<td>Rd := 1 if (Rb != Ra, float1) else Rd := 0</td>
<td></td>
</tr>
<tr>
<td>FCMP.GE Rd,Ra,Rb</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>0100100000</td>
<td>Rd := 1 if (Rb &gt;= Ra, float1) else Rd := 0</td>
<td></td>
</tr>
<tr>
<td>FLT Rd,Ra</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>0</td>
<td>0101000000</td>
<td>Rd := float (Ra)1</td>
<td></td>
</tr>
<tr>
<td>FINT Rd,Ra</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>0</td>
<td>0110000000</td>
<td>Rd := int (Ra)1</td>
<td></td>
</tr>
<tr>
<td>FSQRT Rd,Ra</td>
<td>010110</td>
<td>Rd</td>
<td>Ra</td>
<td>0</td>
<td>0111000000</td>
<td>Rd := sqrt (Ra)1</td>
<td></td>
</tr>
<tr>
<td>TNEA/GETd Rd,FSLx</td>
<td>011011</td>
<td>Rd</td>
<td>00000</td>
<td>0N07AE000000 &amp; FSLx</td>
<td>FSLx := FSLx (data read, blocking if N = 0) MSR[FSLx] := 1 if (FSLx_S_Control = 1) MSR[C] := not FSLx_S_Exists if N = 1</td>
<td></td>
<td></td>
</tr>
<tr>
<td>TNA/PUTd Ra,FSLx</td>
<td>011011</td>
<td>00000</td>
<td>Ra</td>
<td>1N0740000000 &amp; FSLx</td>
<td>FSLx := Ra (data write, blocking if N = 0) MSR[C] := FSLx_M_Full if N = 1</td>
<td></td>
<td></td>
</tr>
<tr>
<td>TNECA/GETd Rd,FSLx</td>
<td>011011</td>
<td>Rd</td>
<td>00000</td>
<td>0N1TA000000 &amp; FSLx</td>
<td>FSLx := FSLx (control read, blocking if N = 0) MSR[FSLx] := 1 if (FSLx_S_Control = 0) MSR[C] := not FSLx_S_Exists if N = 1</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>
Table 2-6: MicroBlaze Instruction Set Summary (Continued)

<table>
<thead>
<tr>
<th>Type A</th>
<th>0-5</th>
<th>6-10</th>
<th>11-15</th>
<th>16-20</th>
<th>21-31</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>Type B</td>
<td>0-5</td>
<td>6-10</td>
<td>11-15</td>
<td>16-15</td>
<td>16-31</td>
<td></td>
</tr>
<tr>
<td>TNCAPUT Ra,FSLx</td>
<td>011011</td>
<td>00000</td>
<td>Ra</td>
<td>11110000000000 &amp; FSLx</td>
<td>FSLx := Ra (control write, blocking if (N = 0)) MSR[C] := FSLx.M_Full if (N = 1)</td>
<td></td>
</tr>
<tr>
<td>OR Rd,Ra,Rb</td>
<td>100000</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>Rd := Ra or Rb</td>
</tr>
<tr>
<td>AND Rd,Ra,Rb</td>
<td>100001</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>Rd := Ra and Rb</td>
</tr>
<tr>
<td>XOR Rd,Ra,Rb</td>
<td>100010</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>Rd := Ra xor Rb</td>
</tr>
<tr>
<td>ANDN Rd,Ra,Rb</td>
<td>100011</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>Rd := Ra and Rb</td>
</tr>
<tr>
<td>PCMPEQ Rd,Ra,Rb</td>
<td>100010</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>10000000000</td>
<td>Rd := 1 if ((Rd = Ra)) else Rd := 0</td>
</tr>
<tr>
<td>PCMPNE Rd,Ra,Rb</td>
<td>100011</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>10000000000</td>
<td>Rd := 1 if ((Rd != Ra)) else Rd := 0</td>
</tr>
<tr>
<td>SRA Rd,Ra</td>
<td>100100</td>
<td>Rd</td>
<td>Ra</td>
<td>000000000000001</td>
<td>Rd := s(Ra &gt;&gt; 1) C := Ra[31]</td>
<td></td>
</tr>
<tr>
<td>SRC Rd,Ra</td>
<td>100100</td>
<td>Rd</td>
<td>Ra</td>
<td>000000000000001000001</td>
<td>Rd := C &amp; (Ra &gt;&gt; 1) C := Ra[31]</td>
<td></td>
</tr>
<tr>
<td>SRL Rd,Ra</td>
<td>100100</td>
<td>Rd</td>
<td>Ra</td>
<td>000000000000001000001</td>
<td>Rd := 0 &amp; (Ra &gt;&gt; 1) C := Ra[31]</td>
<td></td>
</tr>
<tr>
<td>SEXT8 Rd,Ra</td>
<td>100100</td>
<td>Rd</td>
<td>Ra</td>
<td>00000000001100000</td>
<td>Rd := s(Ra[24:31])</td>
<td></td>
</tr>
<tr>
<td>SEXT16 Rd,Ra</td>
<td>100100</td>
<td>Rd</td>
<td>Ra</td>
<td>00000000001100000</td>
<td>Rd := s(Ra[16:31])</td>
<td></td>
</tr>
<tr>
<td>CLZ Rd,Ra</td>
<td>100100</td>
<td>Rd</td>
<td>Ra</td>
<td>00000000011100000</td>
<td>Rd = clz(Ra)</td>
<td></td>
</tr>
<tr>
<td>SWAPB Rd,Ra</td>
<td>100100</td>
<td>Rd</td>
<td>Ra</td>
<td>00000001111000000</td>
<td>Rd = (Ra)[24:31, 16:23, 8:15, 0:7]</td>
<td></td>
</tr>
<tr>
<td>SWAPH Rd,Ra</td>
<td>100100</td>
<td>Rd</td>
<td>Ra</td>
<td>00000001111000010</td>
<td>Rd = (Ra)[16:31, 0:15]</td>
<td></td>
</tr>
<tr>
<td>WIC Ra,Rb</td>
<td>100100</td>
<td>00000</td>
<td>Ra</td>
<td>Rb</td>
<td>00001101000</td>
<td>ICache_Line[Ra &gt;&gt; 4].Tag := 0 if (C.ICACHE_LINE_LEN = 4) ICache_Line[Ra &gt;&gt; 5].Tag := 0 if (C.ICACHE_LINE_LEN = 8)</td>
</tr>
<tr>
<td>WDC Ra,Rb</td>
<td>100100</td>
<td>00000</td>
<td>Ra</td>
<td>Rb</td>
<td>00001100100</td>
<td>Cache line is cleared, discarding stored data. DCache_Line[Ra &gt;&gt; 4].Tag := 0 if (C.DCACHE_LINE_LEN = 4) DCache_Line[Ra &gt;&gt; 5].Tag := 0 if (C.DCACHE_LINE_LEN = 8)</td>
</tr>
</tbody>
</table>
### Table 2-6: MicroBlaze Instruction Set Summary (Continued)

<table>
<thead>
<tr>
<th>Type</th>
<th>0-5</th>
<th>6-10</th>
<th>11-15</th>
<th>16-20</th>
<th>21-31</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td><strong>Type B</strong></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>WDC.FLUSH Ra,Rb</td>
<td>100100</td>
<td>00000</td>
<td>Ra</td>
<td>Rb</td>
<td>00001110100</td>
<td>Cache line is flushed, writing stored data to memory, and then cleared. Used when \texttt{C_DCACHE_USE_WRITEBACK} = 1.</td>
</tr>
<tr>
<td>WDC.CLEAR Ra,Rb</td>
<td>100100</td>
<td>00000</td>
<td>Ra</td>
<td>Rb</td>
<td>00001110110</td>
<td>Cache line with matching address is cleared, discarding stored data. Used when \texttt{C_DCACHE_USE_WRITEBACK} = 1.</td>
</tr>
<tr>
<td>MBAR Imm</td>
<td>101110</td>
<td>Imm</td>
<td>00010</td>
<td>0000000000000100</td>
<td>PC := PC + 4; Wait for memory accesses.</td>
<td></td>
</tr>
<tr>
<td>MTS Sd,Ra</td>
<td>100101</td>
<td>00000</td>
<td>Ra</td>
<td>11 &amp; Sd</td>
<td></td>
<td>SPR[Sd] := Ra, where:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x0001] is MSR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x0007] is FSR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x0080] is SLR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x0082] is SHR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x1000] is PID</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x1001] is ZPR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x1002] is TLBX</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x1003] is TLBLO</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x1004] is TLBHI</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x1005] is TLBSX</td>
</tr>
<tr>
<td>MFS Rd,Sa</td>
<td>100101</td>
<td>Rd</td>
<td>00000</td>
<td>10 &amp; Sa</td>
<td></td>
<td>Rd := SPR[Sa], where:</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x0000] is PC</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x0001] is MSR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x0003] is EAR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x0005] is ESR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x0007] is FSR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x0008] is SLR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x000A] is SHR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x1000] is PID</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x1001] is ZPR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x1002] is TLBX</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x1003] is TLBLO</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• SPR[0x2000 to 0x200B] is PVR[0 to 11]</td>
</tr>
<tr>
<td>MSRCLR Rd,Imm</td>
<td>100101</td>
<td>Rd</td>
<td>00001</td>
<td>00 &amp; Imm14</td>
<td></td>
<td>Rd := MSR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• MSR := MSR and \texttt{Imm14}</td>
</tr>
<tr>
<td>MSRSET Rd,Imm</td>
<td>100101</td>
<td>Rd</td>
<td>00000</td>
<td>00 &amp; Imm14</td>
<td></td>
<td>Rd := MSR</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>• MSR := MSR or \texttt{Imm14}</td>
</tr>
<tr>
<td>BR Rb</td>
<td>100110</td>
<td>00000</td>
<td>00000</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb</td>
</tr>
<tr>
<td>BRD Rb</td>
<td>100110</td>
<td>00000</td>
<td>10000</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb</td>
</tr>
</tbody>
</table>
Chapter 2: MicroBlaze Architecture

Table 2-6: MicroBlaze Instruction Set Summary (Continued)

<table>
<thead>
<tr>
<th>Type A</th>
<th>0-5</th>
<th>6-10</th>
<th>11-15</th>
<th>16-20</th>
<th>21-31</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>BRLD Rd,Rb</td>
<td>100110</td>
<td>Rd</td>
<td>10100</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb</td>
</tr>
<tr>
<td>BRA Rb</td>
<td>100110</td>
<td>00000</td>
<td>01000</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := Rb</td>
</tr>
<tr>
<td>BRAD Rb</td>
<td>100110</td>
<td>00000</td>
<td>11000</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := Rb</td>
</tr>
<tr>
<td>BRALD Rd,Rb</td>
<td>100110</td>
<td>Rd</td>
<td>11100</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := Rb</td>
</tr>
<tr>
<td>BRK Rd,Rb</td>
<td>100110</td>
<td>Rd</td>
<td>01100</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := Rb</td>
</tr>
<tr>
<td>BEQ Ra,Rb</td>
<td>100111</td>
<td>00000</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra = 0</td>
</tr>
<tr>
<td>BNE Ra,Rb</td>
<td>100111</td>
<td>00001</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra != 0</td>
</tr>
<tr>
<td>BLT Ra,Rb</td>
<td>100111</td>
<td>00010</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra &lt; 0</td>
</tr>
<tr>
<td>BLE Ra,Rb</td>
<td>100111</td>
<td>00011</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra &lt;= 0</td>
</tr>
<tr>
<td>BGT Ra,Rb</td>
<td>100111</td>
<td>00100</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra &gt; 0</td>
</tr>
<tr>
<td>BGE Ra,Rb</td>
<td>100111</td>
<td>00101</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra &gt;= 0</td>
</tr>
<tr>
<td>BEQD Ra,Rb</td>
<td>100111</td>
<td>10000</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra = 0</td>
</tr>
<tr>
<td>BNED Ra,Rb</td>
<td>100111</td>
<td>10001</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra != 0</td>
</tr>
<tr>
<td>BLTD Ra,Rb</td>
<td>100111</td>
<td>10010</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra &lt; 0</td>
</tr>
<tr>
<td>BLED Ra,Rb</td>
<td>100111</td>
<td>10011</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra &lt;= 0</td>
</tr>
<tr>
<td>BGT D Ra,Rb</td>
<td>100111</td>
<td>10100</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra &gt; 0</td>
</tr>
<tr>
<td>BGED Ra,Rb</td>
<td>100111</td>
<td>10101</td>
<td>Ra</td>
<td>Rb</td>
<td>00000000000</td>
<td>PC := PC + Rb if Ra &gt;= 0</td>
</tr>
<tr>
<td>ORI Rd,Ra,Imm</td>
<td>101000</td>
<td>Rd</td>
<td>Ra</td>
<td>Imm</td>
<td>Rd := Ra or s(Imm)</td>
<td></td>
</tr>
<tr>
<td>ANDI Rd,Ra,Imm</td>
<td>101001</td>
<td>Rd</td>
<td>Ra</td>
<td>Imm</td>
<td>Rd := Ra and s(Imm)</td>
<td></td>
</tr>
<tr>
<td>XORI Rd,Ra,Imm</td>
<td>101010</td>
<td>Rd</td>
<td>Ra</td>
<td>Imm</td>
<td>Rd := Ra xor s(Imm)</td>
<td></td>
</tr>
<tr>
<td>ANDNI Rd,Ra,Imm</td>
<td>101011</td>
<td>Rd</td>
<td>Ra</td>
<td>Imm</td>
<td>Rd := Ra and s(Imm)</td>
<td></td>
</tr>
<tr>
<td>IMM Imm</td>
<td>101100</td>
<td>00000</td>
<td>00000</td>
<td>Imm</td>
<td>Imm[0:15] := Imm</td>
<td></td>
</tr>
<tr>
<td>RTSD Ra,Imm</td>
<td>101101</td>
<td>10000</td>
<td>Ra</td>
<td>Imm</td>
<td>PC := Ra + s(Imm)</td>
<td></td>
</tr>
<tr>
<td>RTID Ra,Imm</td>
<td>101101</td>
<td>10001</td>
<td>Ra</td>
<td>Imm</td>
<td>PC := Ra + s(Imm)</td>
<td></td>
</tr>
<tr>
<td>RTBD Ra,Imm</td>
<td>101101</td>
<td>10010</td>
<td>Ra</td>
<td>Imm</td>
<td>PC := Ra + s(Imm)</td>
<td></td>
</tr>
<tr>
<td>RTED Ra,Imm</td>
<td>101101</td>
<td>10100</td>
<td>Ra</td>
<td>Imm</td>
<td>PC := Ra + s(Imm)</td>
<td></td>
</tr>
</tbody>
</table>
### Instructions

#### Table 2-6: MicroBlaze Instruction Set Summary (Continued)

<table>
<thead>
<tr>
<th>Type A</th>
<th>0-5</th>
<th>6-10</th>
<th>11-15</th>
<th>16-20</th>
<th>21-31</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>BRI Imm</td>
<td>101110 00000 00000</td>
<td>Imm</td>
<td>PC := PC + s(Imm)</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BRID Imm</td>
<td>101110 00000 10000</td>
<td>Imm</td>
<td>PC := PC + s(Imm)</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BRLID Rd,Imm</td>
<td>101110 Rd 10100</td>
<td>Imm</td>
<td>PC := PC + s(Imm)</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BRAI Imm</td>
<td>101110 00000 01000</td>
<td>Imm</td>
<td>PC := s(Imm)</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BRAID Imm</td>
<td>101110 00000 11000</td>
<td>Imm</td>
<td>PC := s(Imm)</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BRALID Rd,Imm</td>
<td>101110 Rd 11100</td>
<td>Imm</td>
<td>PC := s(Imm)</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BRKI Rd,Imm</td>
<td>101110 Rd 01100</td>
<td>Imm</td>
<td>PC := s(Imm)</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BEQI Ra,Imm</td>
<td>101111 00000 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra = 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BNEI Ra,Imm</td>
<td>101111 00001 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra != 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BLTI Ra,Imm</td>
<td>101111 00010 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra &lt; 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BLEI Ra,Imm</td>
<td>101111 00011 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra &lt;= 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BGTI Ra,Imm</td>
<td>101111 00100 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra &gt; 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BGEI Ra,Imm</td>
<td>101111 00101 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra &gt;= 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BEQID Ra,Imm</td>
<td>101111 10000 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra = 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BNEID Ra,Imm</td>
<td>101111 10001 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra != 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BLTID Ra,Imm</td>
<td>101111 10010 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra &lt; 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BLEID Ra,Imm</td>
<td>101111 10011 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra &lt;= 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BGTD Ra,Imm</td>
<td>101111 10100 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra &gt; 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>BGEID Ra,Imm</td>
<td>101111 10101 Ra</td>
<td>Imm</td>
<td>PC := PC + s(Imm) if Ra &gt;= 0</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>LBU Rd,Ra,Rb</td>
<td>110000 Rd Ra Rb</td>
<td>Addr := Ra + Rb</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>LBUR Rd,Ra,Rb</td>
<td>110000 Rd Ra Rb</td>
<td>Rd[0:23] := 0</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>LHU Rd,Ra,Rb</td>
<td>110001 Rd Ra Rb</td>
<td>Addr := Ra + Rb</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>LHUR Rd,Ra,Rb</td>
<td>110001 Rd Ra Rb</td>
<td>Rd[0:15] := 0</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>LW Rd,Ra,Rb</td>
<td>110010 Rd Ra Rb</td>
<td>Addr := Ra + Rb</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>LWR Rd,Ra,Rb</td>
<td>110010 Rd Ra Rb</td>
<td>Rd := *Addr</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>LWX Rd,Ra,Rb</td>
<td>110010 Rd Ra Rb</td>
<td>Addr := Ra + Rb</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>Reservation := 1</td>
<td></td>
</tr>
</tbody>
</table>
Semaphore Synchronization

The LWX and SWX instructions are used to implement common semaphore operations, including test and set, compare and swap, exchange memory, and fetch and add. They are also used to implement spinlocks.

These instructions are typically used by system programs and are called by application programs as needed. Generally, a program uses LWX to load a semaphore from memory, causing the reservation to be set (the processor maintains the reservation internally). The program can compute a result based on the semaphore value and conditionally store the result back to the same memory location using the SWX instruction. The conditional store is performed based on the existence of the reservation established by the preceding LWX instruction. If the reservation exists when the store is executed, the store is performed and MSR\[C\] is cleared to 0. If the reservation does not exist when the store is executed, the target memory location is not modified and MSR\[C\] is set to 1.

If the store is successful, the sequence of instructions from the semaphore load to the semaphore store appear to be executed atomically—no other device modified the semaphore location between

---

### Table 2-6: MicroBlaze Instruction Set Summary (Continued)

<table>
<thead>
<tr>
<th>Type A</th>
<th>0-5</th>
<th>6-10</th>
<th>11-15</th>
<th>16-20</th>
<th>21-31</th>
<th>Semantics</th>
</tr>
</thead>
<tbody>
<tr>
<td>Type B</td>
<td>0-5</td>
<td>6-10</td>
<td>11-15</td>
<td>16-31</td>
<td></td>
<td></td>
</tr>
<tr>
<td>---------</td>
<td>-----</td>
<td>------</td>
<td>-------</td>
<td>-------</td>
<td></td>
<td></td>
</tr>
<tr>
<td>SB Rd,Ra,Rb</td>
<td>110100</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>Addr := Ra + Rb</td>
<td></td>
</tr>
<tr>
<td>SBR Rd,Ra,Rb</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>*Addr[0:8] := Rd[24:31]</td>
<td></td>
</tr>
<tr>
<td>SH Rd,Ra,Rb</td>
<td>110101</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>Addr := Ra + Rb</td>
<td></td>
</tr>
<tr>
<td>SHR Rd,Ra,Rb</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>*Addr[0:16] := Rd[16:31]</td>
<td></td>
</tr>
<tr>
<td>SW Rd,Ra,Rb</td>
<td>110110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>Addr := Ra + Rb</td>
<td></td>
</tr>
<tr>
<td>SWR Rd,Ra,Rb</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>*Addr := Rd</td>
<td></td>
</tr>
<tr>
<td>SWX Rd,Ra,Rb</td>
<td>110110</td>
<td>Rd</td>
<td>Ra</td>
<td>Rb</td>
<td>Addr := Ra + Rb</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>*Addr := Rd if Reservation = 1</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>Reservation := 0</td>
<td></td>
</tr>
<tr>
<td>LBUI Rd,Ra,Imm</td>
<td>111000</td>
<td>Rd</td>
<td>Ra</td>
<td>Imm</td>
<td>Addr := Ra + s(Imm)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>Rd[0:23] := 0</td>
<td></td>
</tr>
<tr>
<td>LHUI Rd,Ra,Imm</td>
<td>111001</td>
<td>Rd</td>
<td>Ra</td>
<td>Imm</td>
<td>Addr := Ra + s(Imm)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>Rd[0:15] := 0</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>Rd[16:31] := *Addr[0:15]</td>
<td></td>
</tr>
<tr>
<td>LWI Rd,Ra,Imm</td>
<td>111010</td>
<td>Rd</td>
<td>Ra</td>
<td>Imm</td>
<td>Addr := Ra + s(Imm)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>Rd := *Addr</td>
<td></td>
</tr>
<tr>
<td>SBI Rd,Ra,Imm</td>
<td>111100</td>
<td>Rd</td>
<td>Ra</td>
<td>Imm</td>
<td>Addr := Ra + s(Imm)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>*Addr[0:7] := Rd[24:31]</td>
<td></td>
</tr>
<tr>
<td>SHI Rd,Ra,Imm</td>
<td>111101</td>
<td>Rd</td>
<td>Ra</td>
<td>Imm</td>
<td>Addr := Ra + s(Imm)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>*Addr[0:15] := Rd[16:31]</td>
<td></td>
</tr>
<tr>
<td>SWI Rd,Ra,Imm</td>
<td>111110</td>
<td>Rd</td>
<td>Ra</td>
<td>Imm</td>
<td>Addr := Ra + s(Imm)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>*Addr := Rd</td>
<td></td>
</tr>
</tbody>
</table>

1. Due to the many different corner cases involved in floating point arithmetic, only the normal behavior is described. A full description of the behavior can be found in Chapter 5, “MicroBlaze Instruction Set Architecture.”
the read and the update. Other devices can read from the semaphore location during the operation. For a semaphore operation to work properly, the LWX instruction must be paired with an SWX instruction, and both must specify identical addresses. The reservation granularity in MicroBlaze is a word. For both instructions, the address must be word aligned. No unaligned exceptions are generated for these instructions.

The conditional store is always performed when a reservation exists, even if the store address does not match the load address that set the reservation.

Only one reservation can be maintained at a time. The address associated with the reservation can be changed by executing a subsequent LWX instruction. The conditional store is performed based upon the reservation established by the last LWX instruction executed. Executing an SWX instruction always clears a reservation held by the processor, whether the address matches that established by the LWX or not.

Reset, interrupts, exceptions, and breaks (including the BRK and BRKI instructions) all clear the reservation.

The following provides general guidelines for using the LWX and SWX instructions:

- The LWX and SWX instructions should be paired and use the same address.
- An unpaired SWX instruction to an arbitrary address can be used to clear any reservation held by the processor.
- A conditional sequence begins with an LWX instruction. It can be followed by memory accesses and/or computations on the loaded value. The sequence ends with an SWX instruction. In most cases, failure of the SWX instruction should cause a branch back to the LWX for a repeated attempt.
- An LWX instruction can be left unpaired when executing certain synchronization primitives if the value loaded by the LWX is not zero. An implementation of Test and Set exemplifies this:

  ```
  loop: lwx r5,r3,r0 ; load and reserve
  bnei r5,next ; branch if not equal to zero
  addik r5,r5,1 ; increment value
  swx r5,r3,r0 ; try to store non-zero value
  addic r5,r0,0 ; check reservation
  bnei r5,loop ; loop if reservation lost
  next:
  ```

- Performance can be improved by minimizing looping on an LWX instruction that fails to return a desired value. Performance can also be improved by using an ordinary load instruction to do the initial value check. An implementation of a spinlock exemplifies this:

  ```
  loop: lw r5,r3,r0 ; load the word
  bnei r5,loop ; loop back if word not equal to 0
  lwx r5,r3,r0 ; try reserving again
  bnei r5,loop ; likely that no branch is needed
  addik r5,r5,1 ; increment value
  swx r5,r3,r0 ; try to store non-zero value
  addic r5,r0,0 ; check reservation
  bnei r5,loop ; loop if reservation lost
  ```

- Minimizing the looping on an LWX/SWX instruction pair increases the likelihood that forward progress is made. The old value should be tested before attempting the store. If the order is reversed (store before load), more SWX instructions are executed and reservations are more likely to be lost between the LWX and SWX instructions.
Self-modifying Code

When using self-modifying code software must ensure that the modified instructions have been written to memory prior to fetching them for execution. There are several aspects to consider:

- The instructions to be modified may already have been fetched prior to modification:
  - into the instruction prefetch buffer,
  - into the instruction cache, if it is enabled,
  - into a stream buffer, if instruction cache stream buffers are used,
  - into the instruction cache, and then saved in a victim buffer, if victim buffers are used.

To ensure that the modified code is always executed instead of the old unmodified code, software must handle all these cases.

- If one or more of the instructions to be modified is a branch, and the branch target cache is used, the branch target address may have been cached.

To avoid using the cached branch target address, software must ensure that the branch target cache is cleared prior to executing the modified code.

- The modified instructions may not have been written to memory prior to execution:
  - they may be en route to memory, in temporary storage in the interconnect or the memory controller,
  - they may be stored in the data cache, if write-back cache is used,
  - they may be saved in a victim buffer, if write-back cache and victim buffers are used.

Software must ensure that the modified instructions have been written to memory before being fetched by the processor.

The annotated code below shows how each of the above issues can be addressed. This code assumes that both instruction cache and write-back data cache is used. If not, the corresponding instructions can be omitted.

The following code exemplifies storing a modified instruction, when using AXI interconnect:

```
swi r5,r6,0 ; r5 = new instruction
            ; r6 = physical instruction address
wdc.flush r6,r0 ; flush write-back data cache line
mbar 1 ; ensure new instruction is written to memory
wic r7,r0 ; invalidate line, empty stream & victim buffers
            ; r7 = virtual instruction address
mbar 2 ; empty prefetch buffer, clear branch target cache
```

The following code exemplifies storing a modified instruction, when using XCL:

```
swi r5,r6,0 ; r5 = new instruction
            ; r6 = physical instruction address
wdc.flush r6,r0 ; flush write-back data cache line
lwi r0,r6,0 ; read back new instruction from memory to ensure it
            ; has been written to memory
wic r7,r0 ; invalidate line, empty stream & victim buffers
            ; r7 = virtual instruction address
mbar 2 ; empty prefetch buffer, clear branch target cache
```

The physical and virtual addresses above are identical, unless MMU virtual mode is used. If the MMU is enabled, the code sequences must be executed in real mode, since WIC and WDC are privileged instructions.

The first instruction after the code sequences above must not be modified, since it may have been prefetched.
MicroBlaze has an orthogonal instruction set architecture. It has thirty-two 32-bit general purpose registers and up to eighteen 32-bit special purpose registers, depending on configured options.

**General Purpose Registers**

The thirty-two 32-bit General Purpose Registers are numbered R0 through R31. The register file is reset on bit stream download (reset value is 0x00000000). Figure 2-2 is a representation of a General Purpose Register and Table 2-7 provides a description of each register and the register reset value (if existing).

*Note:* The register file is not reset by the external reset inputs: Reset, MB_Reset and Debug_Rst.

![Figure 2-2: R0-R31](image)

**Table 2-7: General Purpose Registers (R0-R31)**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>R0</td>
<td>Always has a value of zero. Anything written to R0 is discarded</td>
<td>0x00000000</td>
</tr>
<tr>
<td>0:31</td>
<td>R1 through R13</td>
<td>32-bit general purpose registers</td>
<td>-</td>
</tr>
<tr>
<td>0:31</td>
<td>R14</td>
<td>32-bit register used to store return addresses for interrupts.</td>
<td>-</td>
</tr>
<tr>
<td>0:31</td>
<td>R15</td>
<td>32-bit general purpose register. Recommended for storing return addresses for user vectors.</td>
<td>-</td>
</tr>
<tr>
<td>0:31</td>
<td>R16</td>
<td>32-bit register used to store return addresses for breaks.</td>
<td>-</td>
</tr>
<tr>
<td>0:31</td>
<td>R17</td>
<td>If MicroBlaze is configured to support hardware exceptions, this register is loaded with the address of the instruction following the instruction causing the HW exception, except for exceptions in delay slots that use BTR instead (see “Branch Target Register (BTR)’’); if not, it is a general purpose register.</td>
<td>-</td>
</tr>
<tr>
<td>0:31</td>
<td>R18 through R31</td>
<td>R18 through R31 are 32-bit general purpose registers.</td>
<td>-</td>
</tr>
</tbody>
</table>

Refer to Table 4-2 for software conventions on general purpose register usage.
Special Purpose Registers

Program Counter (PC)

The Program Counter (PC) is the 32-bit address of the execution instruction. It can be read with an MFS instruction, but it cannot be written with an MTS instruction. When used with the MFS instruction the PC register is specified by setting Sa = 0x0000. Figure 2-3 illustrates the PC and Table 2-8 provides a description and reset value.

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>PC</td>
<td>Address of executing instruction, that is, “mfs r2 0” stores the address of the mfs instruction itself in R2.</td>
<td>0x00000000</td>
</tr>
</tbody>
</table>

Machine Status Register (MSR)

The Machine Status Register contains control and status bits for the processor. It can be read with an MFS instruction. When reading the MSR, bit 29 is replicated in bit 0 as the carry copy. MSR can be written using either an MTS instruction or the dedicated MSRSET and MSRCLR instructions.

When writing to the MSR using MSRSET or MSRCLR, the Carry bit takes effect immediately and the remaining bits take effect one clock cycle later. When writing using MTS, all bits take effect one clock cycle later. Any value written to bit 0 is discarded.

When used with an MTS or MFS instruction, the MSR is specified by setting Sx = 0x0001. Figure 2-4 illustrates the MSR register and Table 2-9 provides the bit description and reset values.

<table>
<thead>
<tr>
<th></th>
<th></th>
<th>VMS VM UMS UM PVR EIP EE DCE DZO ICE FSL BIP C IE RES</th>
</tr>
</thead>
<tbody>
<tr>
<td>CC</td>
<td>RESERVED</td>
<td></td>
</tr>
</tbody>
</table>
### Table 2-9: Machine Status Register (MSR)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>CC</td>
<td>Arithmetic Carry Copy</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Copy of the Arithmetic Carry (bit 29). CC is always the same as bit C.</td>
<td></td>
</tr>
<tr>
<td>1:16</td>
<td>Reserved</td>
<td></td>
<td></td>
</tr>
<tr>
<td>17</td>
<td>VMS</td>
<td>Virtual Protected Mode Save</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Only available when configured with an MMU</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>(if C_USE_MMU &gt; 1 and C_AREA_OPTIMIZED = 0)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
<tr>
<td>18</td>
<td>VM</td>
<td>Virtual Protected Mode</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = MMU address translation and access protection disabled, with C_USE_MMU = 3 (Virtual). Access protection disabled with C_USE_MMU = 2 (Protection)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = MMU address translation and access protection enabled, with C_USE_MMU = 3 (Virtual). Access protection enabled, with C_USE_MMU = 2 (Protection).</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Only available when configured with an MMU</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>(if C_USE_MMU &gt; 1 and C_AREA_OPTIMIZED = 0)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
<tr>
<td>19</td>
<td>UMS</td>
<td>User Mode Save</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Only available when configured with an MMU</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>(if C_USE_MMU &gt; 0 and C_AREA_OPTIMIZED = 0)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
<tr>
<td>20</td>
<td>UM</td>
<td>User Mode</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = Privileged Mode, all instructions are allowed</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = User Mode, certain instructions are not allowed</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Only available when configured with an MMU</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>(if C_USE_MMU &gt; 0 and C_AREA_OPTIMIZED = 0)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
<tr>
<td>21</td>
<td>PVR</td>
<td>Processor Version Register exists</td>
<td>Based on parameter C_PVR</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = No Processor Version Register</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = Processor Version Register exists</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read only</td>
<td></td>
</tr>
<tr>
<td>22</td>
<td>EIP</td>
<td>Exception In Progress</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = No hardware exception in progress</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = Hardware exception in progress</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Only available if configured with exception support (C_*EXCEPTION or C_USE_MMU &gt; 0)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
</tbody>
</table>
Table 2-9: Machine Status Register (MSR) (Continued)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>23</td>
<td>EE</td>
<td>Exception Enable</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = Hardware exceptions disabled¹</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = Hardware exceptions enabled</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Only available if configured with exception support (C_*_EXCEPTION or C_USE_MMU &gt; 0)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
<tr>
<td>24</td>
<td>DCE</td>
<td>Data Cache Enable</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = Data Cache disabled</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = Data Cache enabled</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Only available if configured to use data cache (C_USE_DCACHE = 1)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
<tr>
<td>25</td>
<td>DZO</td>
<td>Division by Zero or Division Overflow²</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = No division by zero or division overflow has occurred</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = Division by zero or division overflow has occurred</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Only available if configured to use hardware divider (C_USE_DIV = 1)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
<tr>
<td>26</td>
<td>ICE</td>
<td>Instruction Cache Enable</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = Instruction Cache disabled</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = Instruction Cache enabled</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Only available if configured to use instruction cache (C_USE_ICACHE = 1)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
<tr>
<td>27</td>
<td>FSL</td>
<td>Stream (FSL or AXI) Error</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = get or getd had no error</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = get or getd control type mismatch</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>This bit is sticky, i.e. it is set by a get or getd instruction when a control bit mismatch occurs. To clear it an mts or msrclr instruction must be used.</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Only available if configured to use stream links (C_FSL_LINKS &gt; 0)</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
<tr>
<td>28</td>
<td>BIP</td>
<td>Break in Progress</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = No Break in Progress</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = Break in Progress</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Break Sources can be software break instruction or hardware break from Ext_Brk or Ext_NM_Brk pin.</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
</tbody>
</table>
### Machine Status Register (MSR) (Continued)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
</table>
| 29   | C    | Arithmetic Carry  
                 0 = No Carry (Borrow)  
                 1 = Carry (No Borrow)  
                 Read/Write | 0 |
| 30   | IE   | Interrupt Enable  
                 0 = Interrupts disabled  
                 1 = Interrupts enabled  
                 Read/Write | 0 |
| 31   | -    | Reserved       | 0 |

1. The MMU exceptions (Data Storage Exception, Instruction Storage Exception, Data TLB Miss Exception, Instruction TLB Miss Exception) cannot be disabled, and are not affected by this bit.
2. This bit is only used for integer divide-by-zero or divide overflow signaling. There is a floating point equivalent in the FSR. The DZO-bit flags divide by zero or divide overflow conditions regardless if the processor is configured with exception handling or not.
Chapter 2: MicroBlaze Architecture

Exception Address Register (EAR)

The Exception Address Register stores the full load/store address that caused the exception for the following:

- An unaligned access exception that means the unaligned access address
- A DPLB or M_AXI_DP exception that specifies the failing PLB or AXI4 data access address
- A data storage exception that specifies the (virtual) effective address accessed
- An instruction storage exception that specifies the (virtual) effective address read
- A data TLB miss exception that specifies the (virtual) effective address accessed
- An instruction TLB miss exception that specifies the (virtual) effective address read

The contents of this register is undefined for all other exceptions. When read with the MFS instruction, the EAR is specified by setting $a = 0x0003$. The EAR register is illustrated in Figure 2-5 and Table 2-10 provides bit descriptions and reset values.

![Figure 2-5: EAR](image)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>EAR</td>
<td>Exception Address Register</td>
<td>0x00000000</td>
</tr>
</tbody>
</table>
Exception Status Register (ESR)

The Exception Status Register contains status bits for the processor. When read with the MFS instruction, the ESR is specified by setting Sa = 0x0005. The ESR register is illustrated in Figure 2-6, Table 2-11 provides bit descriptions and reset values, and Table 2-12 provides the Exception Specific Status (ESS).

![ESR Diagram]

Table 2-11: Exception Status Register (ESR)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:18</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
</tbody>
</table>
| 19    | DS      | Delay Slot Exception.  
0 = not caused by delay slot instruction  
1 = caused by delay slot instruction  
Read-only | 0          |
| 20:26 | ESS     | Exception Specific Status  
For details refer to Table 2-12.  
Read-only | See Table 2-12 |
| 27:31 | EC      | Exception Cause  
00000 = Stream exception  
00001 = Unaligned data access exception  
00010 = Illegal op-code exception  
00011 = Instruction bus error exception  
00100 = Data bus error exception  
00101 = Divide exception  
00110 = Floating point unit exception  
00111 = Privileged instruction exception  
00111 = Stack protection violation exception  
10000 = Data storage exception  
10001 = Instruction storage exception  
10010 = Data TLB miss exception  
10011 = Instruction TLB miss exception  
Read-only | 0          |
Table 2-12: Exception Specific Status (ESS)

<table>
<thead>
<tr>
<th>Exception Cause</th>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>Unaligned Data Access</td>
<td>20</td>
<td>W</td>
<td>Word Access Exception</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>$0 = $unaligned halfword access $1 = $unaligned word access</td>
<td></td>
</tr>
<tr>
<td></td>
<td>21</td>
<td>S</td>
<td>Store Access Exception</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>$0 = $unaligned load access $1 = $unaligned store access</td>
<td></td>
</tr>
<tr>
<td></td>
<td>22:26</td>
<td>Rx</td>
<td>Source/Destination Register</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>General purpose register used as source or destination</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>in unaligned access</td>
<td></td>
</tr>
<tr>
<td>Illegal Instruction</td>
<td>20:26</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>Instruction bus error</td>
<td>20</td>
<td>ECC</td>
<td>Exception caused by ILMB</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>correctable or uncorrectable error</td>
<td></td>
</tr>
<tr>
<td></td>
<td>21:26</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>Data bus error</td>
<td>20</td>
<td>ECC</td>
<td>Exception caused by DLMB</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>correctable or uncorrectable error</td>
<td></td>
</tr>
<tr>
<td></td>
<td>21:26</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>Divide</td>
<td>20</td>
<td>DEC</td>
<td>Divide - Division exception cause</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>$0 = $Divide-By-Zero $1 = $Division Overflow</td>
<td></td>
</tr>
<tr>
<td></td>
<td>21:26</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>Floating point unit</td>
<td>20:26</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>Privileged instruction</td>
<td>20:26</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>Stack protection violation</td>
<td>20:26</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>Stream</td>
<td>20:22</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
<tr>
<td></td>
<td>23:26</td>
<td>FSL</td>
<td>Stream (FSL or AXI) index that caused the exception</td>
<td>0</td>
</tr>
<tr>
<td>Data storage</td>
<td>20</td>
<td>DIZ</td>
<td>Data storage - Zone protection</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>$0 = $Did not occur $1 = Occurred</td>
<td></td>
</tr>
<tr>
<td></td>
<td>21</td>
<td>S</td>
<td>Data storage - Store instruction</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td>$0 = $Did not occur $1 = Occurred</td>
<td></td>
</tr>
<tr>
<td></td>
<td>22:26</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
</tbody>
</table>
Branch Target Register (BTR)

The Branch Target Register only exists if the MicroBlaze processor is configured to use exceptions. The register stores the branch target address for all delay slot branch instructions executed while MSR[EIP] = 0. If an exception is caused by an instruction in a delay slot (that is, ESR[DS]=1), the exception handler should return execution to the address stored in BTR instead of the normal exception return address stored in R17. When read with the MFS instruction, the BTR is specified by setting Sa = 0x000B. The BTR register is illustrated in Figure 2-7 and Table 2-13 provides bit descriptions and reset values.

### Table 2-12: Exception Specific Status (ESS) (Continued)

<table>
<thead>
<tr>
<th>Exception Cause</th>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>Instruction storage</td>
<td>20</td>
<td>DIZ</td>
<td>Instruction storage - Zone protection</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td>21:26</td>
<td>Reserved</td>
<td>0 = Did not occur 1 = Occurred</td>
<td>0</td>
</tr>
<tr>
<td>Data TLB miss</td>
<td>20</td>
<td>Reserved</td>
<td>0</td>
<td></td>
</tr>
<tr>
<td></td>
<td>21</td>
<td>S</td>
<td>Data TLB miss - Store instruction 0 = Did not occur</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td>22:26</td>
<td>Reserved</td>
<td>1 = Occurred</td>
<td>0</td>
</tr>
<tr>
<td>Instruction TLB miss</td>
<td>20:26</td>
<td>Reserved</td>
<td>0</td>
<td></td>
</tr>
</tbody>
</table>

### Table 2-13: Branch Target Register (BTR)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>BTR</td>
<td>Branch target address used by handler when returning from an exception caused by an instruction in a delay slot. Read-only</td>
<td>0x00000000</td>
</tr>
</tbody>
</table>
Floating Point Status Register (FSR)

The Floating Point Status Register contains status bits for the floating point unit. It can be read with an MFS, and written with an MTS instruction. When read or written, the register is specified by setting Sa = 0x0007. The bits in this register are sticky – floating point instructions can only set bits in the register, and the only way to clear the register is by using the MTS instruction. Figure 2-8 illustrates the FSR register and Table 2-14 provides bit descriptions and reset values.

![Figure 2-8: FSR](image)

**Table 2-14: Floating Point Status Register (FSR)**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:26</td>
<td>Reserved</td>
<td>undefined</td>
<td>undefined</td>
</tr>
<tr>
<td>27</td>
<td>IO</td>
<td>Invalid operation</td>
<td>0</td>
</tr>
<tr>
<td>28</td>
<td>DZ</td>
<td>Divide-by-zero</td>
<td>0</td>
</tr>
<tr>
<td>29</td>
<td>OF</td>
<td>Overflow</td>
<td>0</td>
</tr>
<tr>
<td>30</td>
<td>UF</td>
<td>Underflow</td>
<td>0</td>
</tr>
<tr>
<td>31</td>
<td>DO</td>
<td>Denormalized operand error</td>
<td>0</td>
</tr>
</tbody>
</table>

Exception Data Register (EDR)

The Exception Data Register stores data read on a stream link (FSL or AXI) that caused a stream exception.

The contents of this register is undefined for all other exceptions. When read with the MFS instruction, the EDR is specified by setting Sa = 0x000D. Figure 2-9 illustrates the EDR register and Table 2-15 provides bit descriptions and reset values.

**Note:** The register is only implemented if \texttt{C\_FSL\_LINKS} is greater than 0 and \texttt{C\_FSL\_EXCEPTION} is set to 1.

![Figure 2-9: EDR](image)

**Table 2-15: Exception Data Register (EDR)**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>EDR</td>
<td>Exception Data Register</td>
<td>0x00000000</td>
</tr>
</tbody>
</table>
Stack Low Register (SLR)

The Stack Low Register stores the stack low limit to detect stack overflow. When the address of a load or store instruction using the stack pointer (register R1) as rA is less than the Stack Low Register, a stack overflow occurs, causing a Stack Protection Violation exception if exceptions are enabled in MSR.

When read with the MFS instruction, the SLR is specified by setting Sa = 0x0800. Figure 2-10 illustrates the SLR register and Table 2-16 provides bit descriptions and reset values.

**Note:** The register is only implemented if `C_USE_STACK_PROTECTION` is set to 1.

```
0 31
    ▲
    SLR
```

Figure 2-10:  SLR

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>SLR</td>
<td>Stack Low Register</td>
<td>0x00000000</td>
</tr>
</tbody>
</table>

Stack High Register (SHR)

The Stack High Register stores the stack high limit to detect stack underflow. When the address of a load or store instruction using the stack pointer (register R1) as rA is greater than the Stack High Register, a stack underflow occurs, causing a Stack Protection Violation exception if exceptions are enabled in MSR.

When read with the MFS instruction, the SHR is specified by setting Sa = 0x0802. Figure 2-11 illustrates the SHR register and Table 2-17 provides bit descriptions and reset values.

**Note:** The register is only implemented if `C_USE_STACK_PROTECTION` is set to 1.

```
0 31
    ▲
    SHR
```

Figure 2-11:  SHR

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>SHR</td>
<td>Stack High Register</td>
<td>0xFFFFFFFF</td>
</tr>
</tbody>
</table>
Process Identifier Register (PID)

The Process Identifier Register is used to uniquely identify a software process during MMU address translation. It is controlled by the C_USE_MMU configuration option on MicroBlaze. The register is only implemented if C_USE_MMU is greater than 1 (User Mode) and C_AREA_OPTIMIZED is set to 0. When accessed with the MFS and MTS instructions, the PID is specified by setting Sa = 0x1000. The register is accessible according to the memory management special registers parameter C_MMU_TLB_ACCESS.

PID is also used when accessing a TLB entry:

- When writing Translation Look-Aside Buffer High (TLBHI) the value of PID is stored in the TID field of the TLB entry
- When reading TLBHI and MSR[UM] is not set, the value in the TID field is stored in PID

Figure 2-12 illustrates the PID register and Table 2-18 provides bit descriptions and reset values.

![Figure 2-12: PID]

Table 2-18: Process Identifier Register (PID)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:23</td>
<td>Reserved</td>
<td></td>
<td></td>
</tr>
<tr>
<td>24:31</td>
<td>PID</td>
<td>Used to uniquely identify a software process during MMU address translation. Read/Write</td>
<td>0x00</td>
</tr>
</tbody>
</table>
Zone Protection Register (ZPR)

The Zone Protection Register is used to override MMU memory protection defined in TLB entries. It is controlled by the `C_USE_MMU` configuration option on MicroBlaze. The register is only implemented if `C_USE_MMU` is greater than 1 (User Mode), `C_AREA_OPTIMIZED` is set to 0, and if the number of specified memory protection zones is greater than zero (`C_MMU_ZONES > 0`). The implemented register bits depend on the number of specified memory protection zones (`C_MMU_ZONES`). When accessed with the MFS and MTS instructions, the ZPR is specified by setting `Sa = 0x1001`. The register is accessible according to the memory management special registers parameter `C_MMU_TLB_ACCESS`. Figure 2-13 illustrates the ZPR register and Table 2-19 provides bit descriptions and reset values.

![Figure 2-13: ZPR](image)

**Table 2-19: Zone Protection Register (ZPR)**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:1</td>
<td>ZP0</td>
<td>Zone Protect</td>
<td>0x00000000</td>
</tr>
<tr>
<td>2:3</td>
<td>ZP1</td>
<td>User mode (MSR[UM] = 1): 00 = Override V in TLB entry. No access to the page is allowed</td>
<td></td>
</tr>
<tr>
<td>...</td>
<td>...</td>
<td>01 = No override. Use V, WR and EX from TLB entry</td>
<td></td>
</tr>
<tr>
<td>30:31</td>
<td>ZP15</td>
<td>10 = No override. Use V, WR and EX from TLB entry</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>11 = Override WR and EX in TLB entry. Access the page as writable and executable</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Privileged mode (MSR[UM] = 0): 00 = No override. Use V, WR and EX from TLB entry</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>01 = No override. Use V, WR and EX from TLB entry</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>10 = Override WR and EX in TLB entry. Access the page as writable and executable</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>11 = Override WR and EX in TLB entry. Access the page as writable and executable</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Read/Write</td>
<td></td>
</tr>
</tbody>
</table>
Translation Look-Aside Buffer Low Register (TLBLO)

The Translation Look-Aside Buffer Low Register is used to access MMU Unified Translation Look-Aside Buffer (UTLB) entries. It is controlled by the C_USE_MMU configuration option on MicroBlaze. The register is only implemented if C_USE_MMU is greater than 1 (User Mode), and C_AREA_OPTIMIZED is set to 0. When accessed with the MFS and MTS instructions, the TLBLO is specified by setting Sa = 0x1003. When reading or writing TLBLO, the UTLB entry indexed by the TLBX register is accessed. The register is readable according to the memory management special registers parameter C_MMU_TLB_ACCESS.

The UTLB is reset on bit stream download (reset value is 0x00000000 for all TLBLO entries).

**Note:** The UTLB is not reset by the external reset inputs: Reset, MB_Reset and Debug_Rst.

Figure 2-14 illustrates the TLBLO register and Table 2-20 provides bit descriptions and reset values.

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:21</td>
<td>RPN</td>
<td>Real Page Number or Physical Page Number</td>
<td>0x0000000</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When a TLB hit occurs, this field is read from the TLB entry and is used to form the physical address. Depending on the value of the SIZE field, some of the RPN bits are not used in the physical address. Software must clear unused bits in this field to zero. Only defined when C_USE_MMU=3 (Virtual). Read/Write</td>
<td></td>
</tr>
<tr>
<td>22</td>
<td>EX</td>
<td>Executable</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When bit is set to 1, the page contains executable code, and instructions can be fetched from the page. When bit is cleared to 0, instructions cannot be fetched from the page. Attempts to fetch instructions from a page with a clear EX bit cause an instruction-storage exception. Read/Write</td>
<td></td>
</tr>
</tbody>
</table>
### Registers

When bit is set to 1, the page is writable and store instructions can be used to store data at addresses within the page. When bit is cleared to 0, the page is read-only (not writable). Attempts to store data into a page with a clear WR bit cause a data storage exception.

- **Read/Write**

#### ZSEL Zone Select

This field selects one of 16 zone fields (Z0-Z15) from the zone-protection register (ZPR). For example, if ZSEL 0x5, zone field Z5 is selected. The selected ZPR field is used to modify the access protection specified by the TLB entry EX and WR fields. It is also used to prevent access to a page by overriding the TLB V (valid) field.

- **Read/Write**

#### W Write Through

When the parameter `C_DCACHE_USE_WRITEBACK` is set to 1, this bit controls caching policy. A write-through policy is selected when set to 1, and a write-back policy is selected otherwise. This bit is fixed to 1, and write-through is always used, when `C_DCACHE_USE_WRITEBACK` is cleared to 0.

- **Read/Write**

#### I Inhibit Caching

When bit is set to 1, accesses to the page are not cached (caching is inhibited). When cleared to 0, accesses to the page are cacheable.

- **Read/Write**

#### M Memory Coherent

This bit is fixed to 0, because memory coherence is not implemented on MicroBlaze.

- **Read Only**

#### G Guarded

When bit is set to 1, speculative page accesses are not allowed (memory is guarded). When cleared to 0, speculative page accesses are allowed. The G attribute can be used to protect memory-mapped I/O devices from inappropriate instruction accesses.

- **Read/Write**

---

**Table 2-20:** Translation Look-Aside Buffer Low Register (TLBLO) *(Continued)*

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>23</td>
<td>WR</td>
<td>Writable</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When bit is set to 1, the page is writable and store instructions can be used to store data at addresses within the page. When bit is cleared to 0, the page is read-only (not writable). Attempts to store data into a page with a clear WR bit cause a data storage exception.</td>
<td></td>
</tr>
<tr>
<td>24:27</td>
<td>ZSEL</td>
<td>Zone Select</td>
<td>0x0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>This field selects one of 16 zone fields (Z0-Z15) from the zone-protection register (ZPR). For example, if ZSEL 0x5, zone field Z5 is selected. The selected ZPR field is used to modify the access protection specified by the TLB entry EX and WR fields. It is also used to prevent access to a page by overriding the TLB V (valid) field.</td>
<td></td>
</tr>
<tr>
<td>28</td>
<td>W</td>
<td>Write Through</td>
<td>0/1</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When the parameter <code>C_DCACHE_USE_WRITEBACK</code> is set to 1, this bit controls caching policy. A write-through policy is selected when set to 1, and a write-back policy is selected otherwise. This bit is fixed to 1, and write-through is always used, when <code>C_DCACHE_USE_WRITEBACK</code> is cleared to 0.</td>
<td></td>
</tr>
<tr>
<td>29</td>
<td>I</td>
<td>Inhibit Caching</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When bit is set to 1, accesses to the page are not cached (caching is inhibited). When cleared to 0, accesses to the page are cacheable.</td>
<td></td>
</tr>
<tr>
<td>30</td>
<td>M</td>
<td>Memory Coherent</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>This bit is fixed to 0, because memory coherence is not implemented on MicroBlaze.</td>
<td></td>
</tr>
<tr>
<td>31</td>
<td>G</td>
<td>Guarded</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When bit is set to 1, speculative page accesses are not allowed (memory is guarded). When cleared to 0, speculative page accesses are allowed. The G attribute can be used to protect memory-mapped I/O devices from inappropriate instruction accesses.</td>
<td></td>
</tr>
</tbody>
</table>
Translation Look-Aside Buffer High Register (TLBHI)

The Translation Look-Aside Buffer High Register is used to access MMU Unified Translation Look-Aside Buffer (UTLB) entries. It is controlled by the _C_USE_MMU_ configuration option on MicroBlaze. The register is only implemented if _C_USE_MMU_ is greater than 1 (User Mode), and _C_AREA_OPTIMIZED_ is set to 0. When accessed with the MFS and MTS instructions, the TLBHI is specified by setting Sa = 0x1004. When reading or writing TLBHI, the UTLB entry indexed by the TLBX register is accessed. The register is readable according to the memory management special registers parameter _C_MMU_TLB_ACCESS_.

PID is also used when accessing a TLB entry:

- When writing TLBHI the value of PID is stored in the TID field of the TLB entry
- When reading TLBHI and MSR[UM] is not set, the value in the TID field is stored in PID

The UTLB is reset on bit stream download (reset value is 0x00000000 for all TLBHI entries).

**Note:** The UTLB is not reset by the external reset inputs: Reset, MB_Reset and Debug_Rst. Figure 2-15 illustrates the TLBHI register and Table 2-21 provides bit descriptions and reset values.

![Figure 2-15: TLBHI](image)

**Table 2-21: Translation Look-Aside Buffer High Register (TLBHI)**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
</table>
| 0:21   | TAG  | TLB-entry tag  
Is compared with the page number portion of the virtual memory address under the control of the SIZE field.  
Read/Write | 0x000000 |
| 22:24  | SIZE | Size  
Specifies the page size. The SIZE field controls the bit range used in comparing the TAG field with the page number portion of the virtual memory address. The page sizes defined by this field are listed in Table 2-36.  
Read/Write | 000 |
| 25     | V    | Valid  
When this bit is set to 1, the TLB entry is valid and contains a page-translation entry.  
When cleared to 0, the TLB entry is invalid.  
Read/Write | 0 |
When this bit is set to 1, a the page is accessed as a little endian page if `C_ENDIANNESS` is 0 (Big Endian), or as a big endian page otherwise.

When cleared to 0, the page is accessed as a big endian page if `C_ENDIANNESS` is 0 (Big Endian), or as a little endian page otherwise.

The E bit only affects data read or data write accesses. Instruction accesses are not affected.

The E bit is only implemented when the parameter `C_USE_REORDER_INSTR` is set to 1, otherwise it is fixed to 0.

### Table 2-21: Translation Look-Aside Buffer High Register (TLBHI) (Continued)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>26</td>
<td>E</td>
<td>Endian</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>When this bit is set to 1, a page is accessed as a little endian page if <code>C_ENDIANNESS</code> is 0 (Big Endian), or as a big endian page otherwise. When cleared to 0, the page is accessed as a big endian page if <code>C_ENDIANNESS</code> is 0 (Big Endian), or as a little endian page otherwise. The <code>E</code> bit only affects data read or data write accesses. Instruction accesses are not affected. The <code>E</code> bit is only implemented when the parameter <code>C_USE_REORDER_INSTR</code> is set to 1, otherwise it is fixed to 0. Read/Write</td>
<td></td>
</tr>
<tr>
<td>27</td>
<td>U0</td>
<td>User Defined</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>This bit is fixed to 0, since there are no user defined storage attributes on MicroBlaze. Read Only</td>
<td></td>
</tr>
<tr>
<td>28:31</td>
<td>Reserved</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>
Chapter 2: MicroBlaze Architecture

Translation Look-Aside Buffer Index Register (TLBX)

The Translation Look-Aside Buffer Index Register is used as an index to the Unified Translation Look-Aside Buffer (UTLB) when accessing the TLBLO and TLBHI registers. It is controlled by the C_USE_MMU configuration option on MicroBlaze. The register is only implemented if C_USE_MMU is greater than 1 (User Mode), and C_AREA_OPTIMIZED is set to 0. When accessed with the MFS and MTS instructions, the TLBX is specified by setting Sa = 0x1002. Figure 2-16 illustrates the TLBX register and Table 2-22 provides bit descriptions and reset values.

![Figure 2-16: TLBX](image)

Table 2-22: Translation Look-Aside Buffer Index Register (TLBX)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>MISS</td>
<td>TLB Miss</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>This bit is cleared to 0 when the TLBSX register is written with a virtual address, and the virtual address is found in a TLB entry. The bit is set to 1 if the virtual address is not found. It is also cleared when the TLBX register itself is written. Read Only Can be read if the memory management special registers parameter C_MMU_TLB_ACCESS &gt; 0 (MINIMAL).</td>
<td></td>
</tr>
<tr>
<td>1:25</td>
<td>Reserved</td>
<td></td>
<td></td>
</tr>
<tr>
<td>26:31</td>
<td>INDEX</td>
<td>TLB Index</td>
<td>000000</td>
</tr>
<tr>
<td></td>
<td></td>
<td>This field is used to index the Translation Look-Aside Buffer entry accessed by the TLBLO and TLBHI registers. The field is updated with a TLB index when the TLBSX register is written with a virtual address, and the virtual address is found in the corresponding TLB entry. Read/Write Can be read and written if the memory management special registers parameter C_MMU_TLB_ACCESS &gt; 0 (MINIMAL).</td>
<td></td>
</tr>
</tbody>
</table>
Translation Look-Aside Buffer Search Index Register (TLBSX)

The Translation Look-Aside Buffer Search Index Register is used to search for a virtual page number in the Unified Translation Look-Aside Buffer (UTLB). It is controlled by the C_USE_MMU configuration option on MicroBlaze. The register is only implemented if C_USE_MMU is greater than 1 (User Mode), and C_AREA_OPTIMIZED is set to 0. When written with the MTS instruction, the TLBSX is specified by setting Sa = 0x1005. Figure 2-17 illustrates the TLBSX register and Table 2-23 provides bit descriptions and reset values.

![Figure 2-17: TLBSX](image)

**Table 2-23: Translation Look-Aside Buffer Index Search Register (TLBSX)**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Reset Value</th>
</tr>
</thead>
</table>
| 0:21  | VPN    | Virtual Page Number  
This field represents the page number portion of the virtual memory address. It is compared with the page number portion of the virtual memory address under the control of the SIZE field, in each of the Translation Look-Aside Buffer entries that have the V bit set to 1.  
If the virtual page number is found, the TLBX register is written with the index of the TLB entry and the MISS bit in TLBX is cleared to 0. If the virtual page number is not found in any of the TLB entries, the MISS bit in the TLBX register is set to 1.  
Write Only |             |
Processor Version Register (PVR)

The Processor Version Register is controlled by the C_PVR configuration option on MicroBlaze.

- When C_PVR is set to 0 (None) the processor does not implement any PVR and MSR[PVR]=0.
- When C_PVR is set to 1 (Basic), MicroBlaze implements only the first register: PVR0, and if set to 2 (Full), all 12 PVR registers (PVR0 to PVR11) are implemented.

When read with the MFS instruction the PVR is specified by setting Sa = 0x200x, with x being the register number between 0x0 and 0xB.

Table 2-24 through Table 2-35 provide bit descriptions and values.

Table 2-24: Processor Version Register 0 (PVR0)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>CFG</td>
<td>PVR implementation: 0 = Basic, 1 = Full</td>
<td>Based on C_PVR</td>
</tr>
<tr>
<td>1</td>
<td>BS</td>
<td>Use barrel shifter</td>
<td>C_USE_BARRIER</td>
</tr>
<tr>
<td>2</td>
<td>DIV</td>
<td>Use divider</td>
<td>C_USE_DIV</td>
</tr>
<tr>
<td>3</td>
<td>MUL</td>
<td>Use hardware multiplier</td>
<td>C_USE_HW_MUL &gt; 0 (None)</td>
</tr>
<tr>
<td>4</td>
<td>FPU</td>
<td>Use FPU</td>
<td>C_USE_FPU &gt; 0 (None)</td>
</tr>
<tr>
<td>5</td>
<td>EXC</td>
<td>Use any type of exceptions</td>
<td>Based on C_*_EXCEPTION Also set if C_USE_MMU &gt; 0 (None)</td>
</tr>
<tr>
<td>6</td>
<td>ICU</td>
<td>Use instruction cache</td>
<td>C_USE_ICACHE</td>
</tr>
<tr>
<td>7</td>
<td>DCU</td>
<td>Use data cache</td>
<td>C_USE_DCACHE</td>
</tr>
<tr>
<td>8</td>
<td>MMU</td>
<td>Use MMU</td>
<td>C_USE_MMU &gt; 0 (None)</td>
</tr>
<tr>
<td>9</td>
<td>BTC</td>
<td>Use branch target cache</td>
<td>C_USE_BRANCH_TARGET_CACHE</td>
</tr>
<tr>
<td>10</td>
<td>ENDI</td>
<td>Selected endianness: 0 = Big endian, 1 = Little endian</td>
<td>C_ENDIANNESS</td>
</tr>
<tr>
<td>11</td>
<td>FT</td>
<td>Implement fault tolerant features</td>
<td>C_FAULT_TOLERANT</td>
</tr>
<tr>
<td>12</td>
<td>SPROT</td>
<td>Use stack protection</td>
<td>C_USE_STACK_PROTECTION</td>
</tr>
<tr>
<td>13</td>
<td>REORD</td>
<td>Implement reorder instructions</td>
<td>C_USE_REORDER_INSTR</td>
</tr>
<tr>
<td>14:15</td>
<td>Reserved</td>
<td></td>
<td></td>
</tr>
<tr>
<td>16:23</td>
<td>MBV</td>
<td>MicroBlaze release version code</td>
<td>Release Specific</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x1 = v5.00.a 0xE = v7.20.c</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x2 = v5.00.b 0xF = v7.20.d</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x3 = v5.00.c 0x10 = v7.30.a</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x4 = v6.00.a 0x11 = v7.30.b</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x6 = v6.00.b 0x12 = v8.00.a</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x5 = v7.00.a 0x13 = v8.00.b</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x7 = v7.00.b 0x14 = v8.10.a</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x8 = v7.10.a 0x15 = v8.20.a</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x9 = v7.10.b 0x16 = v8.20.b</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0xA = v7.10.c 0x17 = v8.30.a</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0xB = v7.10.d 0x18 = v8.40.a</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0xC = v7.20.a 0x19 = v8.40.b</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0xD = v7.20.b</td>
<td></td>
</tr>
<tr>
<td>24:31</td>
<td>USR1</td>
<td>User configured value 1</td>
<td>C_PVR_USER1</td>
</tr>
</tbody>
</table>
### Table 2-25: Processor Version Register 1 (PVR1)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>USR2</td>
<td>User configured value 2</td>
<td>C_PVR_USER2</td>
</tr>
</tbody>
</table>

### Table 2-26: Processor Version Register 2 (PVR2)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>DAXI</td>
<td>Data side AXI4 in use</td>
<td>C_D_AXI</td>
</tr>
<tr>
<td>1</td>
<td>DLMB</td>
<td>Data side LMB in use</td>
<td>C_D_LMB</td>
</tr>
<tr>
<td>2</td>
<td>IAXI</td>
<td>Instruction side AXI4 in use</td>
<td>C_I_AXI</td>
</tr>
<tr>
<td>3</td>
<td>ILMB</td>
<td>Instruction side LMB in use</td>
<td>C_I_LMB</td>
</tr>
<tr>
<td>4</td>
<td>IRQEDGE</td>
<td>Interrupt is edge triggered</td>
<td>C_INTERRUPT_IS_EDGE</td>
</tr>
<tr>
<td>5</td>
<td>IRQPOS</td>
<td>Interrupt edge is positive</td>
<td>C_EDGE_IS_POSITIVE</td>
</tr>
<tr>
<td>6</td>
<td>DPLB</td>
<td>Data side PLB in use</td>
<td>C_D_PLB</td>
</tr>
<tr>
<td>7</td>
<td>IPLB</td>
<td>Instruction side PLB in use</td>
<td>C_I_PLB</td>
</tr>
<tr>
<td>8</td>
<td>INTERCON</td>
<td>Use PLB interconnect</td>
<td>C_INTERCONNECT = 1 (PLBv46)</td>
</tr>
<tr>
<td>9</td>
<td>STREAM</td>
<td>Use AXI4-Stream interconnect</td>
<td>C_STREAM_INTERCONNECT = 1 (AXI4-Stream)</td>
</tr>
<tr>
<td>10:11</td>
<td>Reserved</td>
<td></td>
<td></td>
</tr>
<tr>
<td>12</td>
<td>FSL</td>
<td>Use extended stream (FSL or AXI) instructions</td>
<td>C_USE_EXTENDED_FSL_INSTR</td>
</tr>
<tr>
<td>13</td>
<td>FSLEXC</td>
<td>Generate exception for stream control bit (FSL or AXI) mismatch</td>
<td>C_FSL_EXCEPTION</td>
</tr>
<tr>
<td>14</td>
<td>MSR</td>
<td>Use msrset and msrclr instructions</td>
<td>C_USE_MSR_INSTR</td>
</tr>
<tr>
<td>15</td>
<td>PCMP</td>
<td>Use pattern compare and CLZ instructions</td>
<td>C_USE_PCMP_INSTR</td>
</tr>
<tr>
<td>16</td>
<td>AREA</td>
<td>Select implementation to optimize area with lower instruction throughput</td>
<td>C_AREA_OPTIMIZED</td>
</tr>
<tr>
<td>17</td>
<td>BS</td>
<td>Use barrel shifter</td>
<td>C_USE_BARREL</td>
</tr>
<tr>
<td>18</td>
<td>DIV</td>
<td>Use divider</td>
<td>C_USE_DIV</td>
</tr>
<tr>
<td>19</td>
<td>MUL</td>
<td>Use hardware multiplier</td>
<td>C_USE_HW_MUL &gt; 0 (None)</td>
</tr>
<tr>
<td>20</td>
<td>FPU</td>
<td>Use FPU</td>
<td>C_USE_FPU &gt; 0 (None)</td>
</tr>
<tr>
<td>21</td>
<td>MUL64</td>
<td>Use 64-bit hardware multiplier</td>
<td>C_USE_HW_MUL = 2 (Mul64)</td>
</tr>
</tbody>
</table>
Table 2-26: Processor Version Register 2 (PVR2) (Continued)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>22</td>
<td>FPU2</td>
<td>Use floating point conversion and square root instructions</td>
<td>C_USE_FPU = 2 (Extended)</td>
</tr>
<tr>
<td>23</td>
<td>IPLBEXC</td>
<td>Generate exception for IPLB error</td>
<td>C_IPLB_BUS_EXCEPTION</td>
</tr>
<tr>
<td>24</td>
<td>DPLBEXC</td>
<td>Generate exception for DPLB error</td>
<td>C_DPLB_BUS_EXCEPTION</td>
</tr>
<tr>
<td>25</td>
<td>OP0EXC</td>
<td>Generate exception for 0x0 illegal opcode</td>
<td>C_OPCODE_0x0_ILLEGAL</td>
</tr>
<tr>
<td>26</td>
<td>UNEXC</td>
<td>Generate exception for unaligned data access</td>
<td>C_UNALIGNED_EXCEPTION</td>
</tr>
<tr>
<td>27</td>
<td>OPEXC</td>
<td>Generate exception for any illegal opcode</td>
<td>C_ILL_OPCODE_EXCEPTION</td>
</tr>
<tr>
<td>28</td>
<td>AXIIEXC</td>
<td>Generate exception for M_AXI_I error</td>
<td>C_M_AXI_I_BUS_EXCEPTION</td>
</tr>
<tr>
<td>29</td>
<td>AXIDEXC</td>
<td>Generate exception for M_AXI_D error</td>
<td>C_M_AXI_D_BUS_EXCEPTION</td>
</tr>
<tr>
<td>30</td>
<td>DIVEXC</td>
<td>Generate exception for division by zero or division overflow</td>
<td>C_DIV_ZERO_EXCEPTION</td>
</tr>
<tr>
<td>31</td>
<td>FPUEXC</td>
<td>Generate exceptions from FPU</td>
<td>C_FPU_EXCEPTION</td>
</tr>
</tbody>
</table>

Table 2-27: Processor Version Register 3 (PVR3)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>DEBUG</td>
<td>Use debug logic</td>
<td>C_DEBUG_ENABLED</td>
</tr>
<tr>
<td>1:2</td>
<td></td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>3:6</td>
<td>CBRK</td>
<td>Number of PC breakpoints</td>
<td>C_NUMBER_OF_PC_BRK</td>
</tr>
<tr>
<td>7:9</td>
<td></td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>10:12</td>
<td>RDADDR</td>
<td>Number of read address breakpoints</td>
<td>C_NUMBER_OF_RD_ADDR_BRK</td>
</tr>
<tr>
<td>13:15</td>
<td></td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>16:18</td>
<td>WRADDR</td>
<td>Number of write address breakpoints</td>
<td>C_NUMBER_OF_WR_ADDR_BRK</td>
</tr>
<tr>
<td>19</td>
<td></td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>20:24</td>
<td>FSL</td>
<td>Number of stream links</td>
<td>C_FSL_LINKS</td>
</tr>
<tr>
<td>25:28</td>
<td></td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>29:31</td>
<td>BTC_SIZE</td>
<td>Branch Target Cache size</td>
<td>C_BRANCH_TARGET_CACHE_SIZE</td>
</tr>
</tbody>
</table>
### Table 2-28: Processor Version Register 4 (PVR4)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>ICU</td>
<td>Use instruction cache</td>
<td>C_USE_ICACHE</td>
</tr>
<tr>
<td>1:5</td>
<td>ICTS</td>
<td>Instruction cache tag size</td>
<td>C_ADDR_TAG_BITS</td>
</tr>
<tr>
<td>6</td>
<td>Reserved</td>
<td></td>
<td></td>
</tr>
<tr>
<td>7</td>
<td>ICW</td>
<td>Allow instruction cache write</td>
<td>C_ALLOW_ICACHE_WR</td>
</tr>
<tr>
<td>8:10</td>
<td>ICLL</td>
<td>The base two logarithm of the instruction cache line length</td>
<td>$\log_2(C_{ICACHE_LINE_LEN})$</td>
</tr>
<tr>
<td>11:15</td>
<td>ICBS</td>
<td>The base two logarithm of the instruction cache byte size</td>
<td>$\log_2(C_{CACHE_BYTE_SIZE})$</td>
</tr>
<tr>
<td>16</td>
<td>IAU</td>
<td>The instruction cache is used for all memory accesses within the cacheable range</td>
<td>C_ICACHE_ALWAYS_USED</td>
</tr>
<tr>
<td>17</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>18</td>
<td>ICI</td>
<td>Instruction cache XCL protocol</td>
<td>C_ICACHE_INTERFACE</td>
</tr>
<tr>
<td>19:21</td>
<td>ICV</td>
<td>Instruction cache victims</td>
<td>0-3: C_ICACHE_VICTIMS = 0,2,4,8</td>
</tr>
<tr>
<td>22:23</td>
<td>ICS</td>
<td>Instruction cache streams</td>
<td>C_ICACHE_STREAMS</td>
</tr>
<tr>
<td>24</td>
<td>IFTL</td>
<td>Instruction cache tag uses distributed RAM</td>
<td>C_ICACHE_FORCE_TAG_LUTRAM</td>
</tr>
<tr>
<td>25</td>
<td>ICDW</td>
<td>Instruction cache data width</td>
<td>C_ICACHE_DATA_WIDTH &gt; 0</td>
</tr>
<tr>
<td>26:31</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
</tbody>
</table>

### Table 2-29: Processor Version Register 5 (PVR5)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>DCU</td>
<td>Use data cache</td>
<td>C_USE_DCACHE</td>
</tr>
<tr>
<td>1:5</td>
<td>DCTS</td>
<td>Data cache tag size</td>
<td>C_DCACHE_ADDR_TAG</td>
</tr>
<tr>
<td>6</td>
<td>Reserved</td>
<td></td>
<td></td>
</tr>
<tr>
<td>7</td>
<td>DCW</td>
<td>Allow data cache write</td>
<td>C_ALLOW_DCACHE_WR</td>
</tr>
<tr>
<td>8:10</td>
<td>DCLL</td>
<td>The base two logarithm of the data cache line length</td>
<td>$\log_2(C_{DCACHE_LINE_LEN})$</td>
</tr>
<tr>
<td>11:15</td>
<td>DCBS</td>
<td>The base two logarithm of the data cache byte size</td>
<td>$\log_2(C_{DCACHE_BYTE_SIZE})$</td>
</tr>
<tr>
<td>16</td>
<td>DAU</td>
<td>The data cache is used for all memory accesses within the cacheable range</td>
<td>C_DCACHE_ALWAYS_USED</td>
</tr>
<tr>
<td>17</td>
<td>DWB</td>
<td>Data cache policy is write-back</td>
<td>C_DCACHE_USE_WRITEBACK</td>
</tr>
</tbody>
</table>
Table 2-29: Processor Version Register 5 (PVR5) (Continued)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>18</td>
<td>DCI</td>
<td>Data cache XCL protocol</td>
<td>C_DCACHE_INTERFACE</td>
</tr>
<tr>
<td>19:21</td>
<td>DCV</td>
<td>Data cache victims</td>
<td>0-3: C_DCACHE_VICTIMS = 0,2,4,8</td>
</tr>
<tr>
<td>22:23</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>24</td>
<td>DFTL</td>
<td>Data cache tag uses distributed RAM</td>
<td>C_DCACHE_FORCE_TAG_LUTRAM</td>
</tr>
<tr>
<td>25</td>
<td>DCDW</td>
<td>Data cache data width</td>
<td>C_DCACHE_DATA_WIDTH &gt; 0</td>
</tr>
<tr>
<td>26:31</td>
<td>Reserved</td>
<td></td>
<td>0</td>
</tr>
</tbody>
</table>

Table 2-30: Processor Version Register 6 (PVR6)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>ICBA</td>
<td>Instruction Cache Base Address</td>
<td>C_ICACHE_BASEADDR</td>
</tr>
</tbody>
</table>

Table 2-31: Processor Version Register 7 (PVR7)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>ICHA</td>
<td>Instruction Cache High Address</td>
<td>C_ICACHE_HIGHADDR</td>
</tr>
</tbody>
</table>

Table 2-32: Processor Version Register 8 (PVR8)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>DCBA</td>
<td>Data Cache Base Address</td>
<td>C_DCACHE_BASEADDR</td>
</tr>
</tbody>
</table>

Table 2-33: Processor Version Register 9 (PVR9)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:31</td>
<td>DCHA</td>
<td>Data Cache High Address</td>
<td>C_DCACHE_HIGHADDR</td>
</tr>
</tbody>
</table>
Table 2-34: Processor Version Register 10 (PVR10)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:7</td>
<td>ARCH</td>
<td>Target architecture:</td>
<td>Defined by parameter C_FAMILY</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x6 = Spartan®-3, Automotive Spartan-3</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x7 = Virtex-4, Defence Grade Virtex-4 Q</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Space-Grade Virtex-4 QV</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x8 = Virtex-5, Defence Grade Virtex-5 Q</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Space-Grade Virtex-5 QV</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x9 = Spartan-3E, Automotive Spartan-3E</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0xA = Spartan-3A, Automotive Spartan-3A</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0xB = Spartan-3AN</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0xC = Spartan-3A DSP, Automotive Spartan-3A</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0xD = Spartan-6, Automotive Spartan-6,</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Defence Grade Spartan-6 Q</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0xE = Virtex-6, Defence Grade Virtex-6 Q</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0xF = Virtex-7, Defence Grade Virtex-7 Q</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x10 = Kintex™-7, Defence Grade Kintex-7 Q</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x11 = Artix™-7, Automotive Artix-7,</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Defence Grade Artix-7 Q</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>0x12 = Zynq™-7000, Automotive Zynq-7000,</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>Defence Grade Zynq-7000 Q</td>
<td></td>
</tr>
<tr>
<td></td>
<td>8:31</td>
<td>Reserved</td>
<td>0</td>
</tr>
</tbody>
</table>

Table 2-35: Processor Version Register 11 (PVR11)

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0:1</td>
<td>MMU</td>
<td>Use MMU:</td>
<td>C_USE_MMU</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = None</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = User Mode</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>2 = Protection</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>3 = Virtual</td>
<td></td>
</tr>
<tr>
<td>2:4</td>
<td>ITLB</td>
<td>Instruction Shadow TLB size</td>
<td>(\log_2(C_{MMU_ITLB_SIZE}))</td>
</tr>
<tr>
<td>5:7</td>
<td>DTLB</td>
<td>Data Shadow TLB size</td>
<td>(\log_2(C_{MMU_DTLB_SIZE}))</td>
</tr>
<tr>
<td>8:9</td>
<td>TLBACC</td>
<td>TLB register access:</td>
<td>C_MMU_TLB_ACCESS</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = Minimal</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = Read</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>2 = Write</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>3 = Full</td>
<td></td>
</tr>
<tr>
<td>10:14</td>
<td>ZONES</td>
<td>Number of memory protection zones</td>
<td>C_MMU_ZONES</td>
</tr>
<tr>
<td>15</td>
<td>PRIVINS</td>
<td>Privileged instructions:</td>
<td>C_MMU_PRIVILEGED_INSTR</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0 = Full protection</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = Allow stream instructions</td>
<td></td>
</tr>
<tr>
<td>16:16</td>
<td>Reserved</td>
<td>Reserved for future use</td>
<td>0</td>
</tr>
<tr>
<td>17:31</td>
<td>RSTMSR</td>
<td>Reset value for MSR</td>
<td>C_RESET_MSR</td>
</tr>
</tbody>
</table>
Chapter 2: MicroBlaze Architecture

Pipeline Architecture

MicroBlaze instruction execution is pipelined. For most instructions, each stage takes one clock cycle to complete. Consequently, the number of clock cycles necessary for a specific instruction to complete is equal to the number of pipeline stages, and one instruction is completed on every cycle. A few instructions require multiple clock cycles in the execute stage to complete. This is achieved by stalling the pipeline.

When executing from slower memory, instruction fetches may take multiple cycles. This additional latency directly affects the efficiency of the pipeline. MicroBlaze implements an instruction prefetch buffer that reduces the impact of such multi-cycle instruction memory latency. While the pipeline is stalled by a multi-cycle instruction in the execution stage, the prefetch buffer continues to load sequential instructions. When the pipeline resumes execution, the fetch stage can load new instructions directly from the prefetch buffer instead of waiting for the instruction memory access to complete. If instructions are modified during execution (e.g. with self-modifying code), the prefetch buffer should be emptied before executing the modified instructions, to ensure that it does not contain the old unmodified instructions. The recommended way to do this is using an MBAR instruction, although it is also possible to use a synchronizing branch instruction, for example BRI 4.

Three Stage Pipeline

With \texttt{C\_AREA\_OPTIMIZED} set to 1, the pipeline is divided into three stages to minimize hardware cost: Fetch, Decode, and Execute.

<table>
<thead>
<tr>
<th>cycle1</th>
<th>cycle2</th>
<th>cycle3</th>
<th>cycle4</th>
<th>cycle5</th>
<th>cycle6</th>
<th>cycle7</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>Fetch</td>
<td>Decode</td>
<td>Execute</td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>cycle1</th>
<th>cycle2</th>
<th>cycle3</th>
<th>cycle4</th>
<th>cycle5</th>
<th>cycle6</th>
<th>cycle7</th>
<th>cycle8</th>
<th>cycle9</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>IF</td>
<td>OF</td>
<td>EX</td>
<td>MEM</td>
<td>WB</td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>cycle1</th>
<th>cycle2</th>
<th>cycle3</th>
<th>cycle4</th>
<th>cycle5</th>
<th>cycle6</th>
<th>cycle7</th>
<th>cycle8</th>
<th>cycle9</th>
</tr>
</thead>
<tbody>
<tr>
<td>IF</td>
<td>OF</td>
<td>EX</td>
<td>MEM</td>
<td>MEM</td>
<td>MEM</td>
<td>WB</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>instruction 1</th>
<th>instruction 2</th>
<th>instruction 3</th>
</tr>
</thead>
<tbody>
<tr>
<td>Fetch</td>
<td>Fetch</td>
<td>Fetch</td>
</tr>
<tr>
<td>Decode</td>
<td>Decode</td>
<td>Decode</td>
</tr>
<tr>
<td>Execute</td>
<td>Execute</td>
<td>Stall</td>
</tr>
<tr>
<td></td>
<td>Execute</td>
<td>Stall</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Execute</td>
</tr>
</tbody>
</table>

Five Stage Pipeline

With \texttt{C\_AREA\_OPTIMIZED} set to 0, the pipeline is divided into five stages to maximize performance: Fetch (IF), Decode (OF), Execute (EX), Access Memory (MEM), and Writeback (WB).

<table>
<thead>
<tr>
<th>cycle1</th>
<th>cycle2</th>
<th>cycle3</th>
<th>cycle4</th>
<th>cycle5</th>
<th>cycle6</th>
<th>cycle7</th>
<th>cycle8</th>
<th>cycle9</th>
</tr>
</thead>
<tbody>
<tr>
<td>IF</td>
<td>OF</td>
<td>EX</td>
<td>MEM</td>
<td>MEM</td>
<td>MEM</td>
<td>WB</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>cycle1</th>
<th>cycle2</th>
<th>cycle3</th>
<th>cycle4</th>
<th>cycle5</th>
<th>cycle6</th>
<th>cycle7</th>
<th>cycle8</th>
<th>cycle9</th>
</tr>
</thead>
<tbody>
<tr>
<td>IF</td>
<td>OF</td>
<td>EX</td>
<td>Stall</td>
<td>Stall</td>
<td>MEM</td>
<td>WB</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Branches

Normally the instructions in the fetch and decode stages (as well as prefetch buffer) are flushed when executing a taken branch. The fetch pipeline stage is then reloaded with a new instruction from the calculated branch address. A taken branch in MicroBlaze takes three clock cycles to execute, two of which are required for refilling the pipeline. To reduce this latency overhead, MicroBlaze supports branches with delay slots.
**Delay Slots**

When executing a taken branch with delay slot, only the fetch pipeline stage in MicroBlaze is flushed. The instruction in the decode stage (branch delay slot) is allowed to complete. This technique effectively reduces the branch penalty from two clock cycles to one. Branch instructions with delay slots have a D appended to the instruction mnemonic. For example, the BNE instruction does not execute the subsequent instruction (does not have a delay slot), whereas BNED executes the next instruction before control is transferred to the branch location.

A delay slot must not contain the following instructions: IMM, branch, or break. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.

Instructions that could cause recoverable exceptions (e.g. unaligned word or halfword load and store) are allowed in the delay slot. If an exception is caused in a delay slot the ESR[DS] bit is set, and the exception handler is responsible for returning the execution to the branch target (stored in the special purpose register BTR). If the ESR[DS] bit is set, register R17 is not valid (otherwise it contains the address following the instruction causing the exception).

**Branch Target Cache**

To improve branch performance, MicroBlaze provides a Branch Target Cache (BTC) coupled with a branch prediction scheme. With the BTC enabled, a correctly predicted immediate branch or return instruction incurs no overhead.

The BTC operates by saving the target address of each immediate branch and return instruction the first time the instruction is encountered. The next time it is encountered, it is usually found in the Branch Target Cache, and the Instruction Fetch Program Counter is then simply changed to the saved target address, in case the branch should be taken. Unconditional branches and return instructions are always taken, whereas conditional branches use branch prediction, to avoid taking a branch that should not have been taken and vice versa.

The BTC is cleared when a memory barrier (MBAR 0) or synchronizing branch (BRI 4) is executed. There are three cases where the branch prediction can cause a mispredict, namely:

- A conditional branch that should not have been taken, is actually taken,
- A conditional branch that should actually have been taken, is not taken,
- The target address of a return instruction is incorrect, which may occur when returning from a function called from different places in the code.

All of these cases are detected and corrected when the branch or return instruction reaches the execute stage, and the branch prediction bits or target address are updated in the BTC, to reflect the actual instruction behavior. This correction incurs a penalty of two clock cycles.

The size of the BTC can be selected with \texttt{C\_BRANCH\_TARGET\_CACHE\_SIZE}. The default recommended setting uses one block RAM, and provides either 512 entries (for Virtex-5, Virtex-6, and 7 Series) or 256 entries (for all other families). When selecting 64 entries or below, distributed RAM is used to implement the BTC, otherwise block RAM is used.

When the BTC uses block RAM, and \texttt{C\_FAULT\_TOLERANT} is set to 1, block RAMs are protected by parity. In case of a parity error, the branch is not predicted. To avoid accumulating errors in this case, the BTC should be cleared periodically by a synchronizing branch.

The Branch Target Cache is available when \texttt{C\_USE\_BRANCH\_TARGET\_CACHE} is set to 1 and \texttt{C\_AREA\_OPTIMIZED} is set to 0.
Chapter 2: MicroBlaze Architecture

Memory Architecture

MicroBlaze is implemented with a Harvard memory architecture; instruction and data accesses are done in separate address spaces. Each address space has a 32-bit range (that is, handles up to 4-GB of instructions and data memory respectively). The instruction and data memory ranges can be made to overlap by mapping them both to the same physical memory. The latter is useful for software debugging.

Both instruction and data interfaces of MicroBlaze are default 32 bits wide and use big endian or little endian, bit-reversed format, depending on the parameter C_ENDIANNESS. MicroBlaze supports word, halfword, and byte accesses to data memory.

Data accesses must be aligned (word accesses must be on word boundaries, halfword on halfword boundaries), unless the processor is configured to support unaligned exceptions. All instruction accesses must be word aligned.

MicroBlaze prefetches instructions to improve performance, using the instruction prefetch buffer and (if enabled) instruction cache streams. To avoid attempts to prefetch instructions beyond the end of physical memory, which may cause an instruction bus error or a processor stall, instructions must not be located too close to the end of physical memory. The instruction prefetch buffer requires 16 bytes margin, and using instruction cache streams adds two additional cache lines (32 or 64 bytes).

MicroBlaze does not separate data accesses to I/O and memory (it uses memory mapped I/O). The processor has up to three interfaces for memory accesses:

- Local Memory Bus (LMB)
- Advanced eXtensible Interface (AXI4) or Processor Local Bus (PLB)
- Advanced eXtensible Interface (AXI4) or Xilinx CacheLink (XCL)

The LMB memory address range must not overlap with AXI4, PLB or XCL ranges.

The C_ENDIANNESS parameter is automatically set to little endian when using AXI4, and to big endian when using PLB, but can be overridden by the user.

MicroBlaze has a single cycle latency for accesses to local memory (LMB) and for cache read hits, except with CAREA_OPTIMIZED set to 1, when data side accesses and data cache read hits require two clock cycles, and with C_FAULT_TOLERANT set to 1, when byte writes and halfword writes to LMB normally require two clock cycles.

The data cache writes latency depends on C_DCACHE_USE_WRITEBACK. When C_DCACHE_USE_WRITEBACK is set to 1, the write latency normally is one cycle (more if the cache needs to do memory accesses). When C_DCACHE_USE_WRITEBACK is cleared to 0, the write latency normally is two cycles (more if the posted-write buffer in the memory controller is full).

The MicroBlaze instruction and data caches can be configured to use 4 or 8 word cache lines. When using a longer cache line, more bytes are prefetched, which generally improves performance for software with sequential access patterns. However, for software with a more random access pattern the performance can instead decrease for a given cache size. This is caused by a reduced cache hit rate due to fewer available cache lines.

For details on the different memory interfaces refer to Chapter 3, MicroBlaze Signal Interface Description.
Privileged Instructions

The following MicroBlaze instructions are privileged:

- **GET, GETD, PUT, PUTD** (except when explicitly allowed)
- **WIC, WDC**
- **MTS**
- **MSRCLR, MSRSET** (except when only the C bit is affected)
- **BRK**
- **RTID, RTBD, RTED**
- **BRKI** (except when jumping to physical address \( C_{\text{BASE_VECTORS}} + 0x8 \) or \( C_{\text{BASE_VECTORS}} + 0x18 \))
- **SLEEP**

Attempted use of these instructions when running in user mode causes a privileged instruction exception.

When setting the parameter \( C_{\text{MMU_PRIVILEGED_INSTR}} \) to 1, the instructions GET, GETD, PUT, and PUTD are not considered privileged, and can be executed when running in user mode. It is strongly discouraged to do this, unless absolutely necessary for performance reasons, since it allows application programs to interfere with each other.

There are six ways to leave user mode and virtual mode:

1. Hardware generated reset (including debug reset)
2. Hardware exception
3. Non-maskable break or hardware break
4. Interrupt
5. Executing "BRALID Re, C_{\text{BASE_VECTORS}} + 0x8" to perform a user vector exception
6. Executing the software break instructions "BRKI" jumping to physical address \( C_{\text{BASE_VECTORS}} + 0x8 \) or \( C_{\text{BASE_VECTORS}} + 0x18 \)

In all of these cases, except hardware generated reset, the user mode and virtual mode status is saved in the MSR UMS and VMS bits.

Application (user-mode) programs transfer control to system-service routines (privileged mode programs) using the BRALID or BRKI instruction, jumping to physical address \( C_{\text{BASE_VECTORS}} + 0x8 \). Executing this instruction causes a system-call exception to occur. The exception handler determines which system-service routine to call and whether the calling application has permission to call that service. If permission is granted, the exception handler performs the actual procedure call to the system-service routine on behalf of the application program.

The execution environment expected by the system-service routine requires the execution of prologue instructions to set up that environment. Those instructions usually create the block of storage that holds procedural information (the activation record), update and initialize pointers, and save volatile registers (registers the system-service routine uses). Prologue code can be inserted by the linker when creating an executable module, or it can be included as stub code in either the system-call interrupt handler or the system-library routines.

Returns from the system-service routine reverse the process described above. Epilog code is executed to unwind and deallocate the activation record, restore pointers, and restore volatile registers. The interrupt handler executes a return from exception instruction (RTED) to return to the application.
Virtual-Memory Management

Programs running on MicroBlaze use effective addresses to access a flat 4 GB address space. The processor can interpret this address space in one of two ways, depending on the translation mode:

- In real mode, effective addresses are used to directly access physical memory
- In virtual mode, effective addresses are translated into physical addresses by the virtual-memory management hardware in the processor

Virtual mode provides system software with the ability to relocate programs and data anywhere in the physical address space. System software can move inactive programs and data out of physical memory when space is required by active programs and data.

Relocation can make it appear to a program that more memory exists than is actually implemented by the system. This frees the programmer from working within the limits imposed by the amount of physical memory present in a system. Programmers do not need to know which physical-memory addresses are assigned to other software processes and hardware devices. The addresses visible to programs are translated into the appropriate physical addresses by the processor.

Virtual mode provides greater control over memory protection. Blocks of memory as small as 1 KB can be individually protected from unauthorized access. Protection and relocation enable system software to support multitasking. This capability gives the appearance of simultaneous or near-simultaneous execution of multiple programs.

In MicroBlaze, virtual mode is implemented by the memory-management unit (MMU), available when `C_USE_MMU` is set to 3 (Virtual) and `C_AREA_OPTIMIZED` is set to 0. The MMU controls effective-address to physical-address mapping and supports memory protection. Using these capabilities, system software can implement demand-paged virtual memory and other memory management schemes.

The MicroBlaze MMU implementation is based upon PowerPC™ 405. For details, see the PowerPC Processor Reference Guide (UG011) document.

The MMU features are summarized as follows:

- Translates effective addresses into physical addresses
- Controls page-level access during address translation
- Provides additional virtual-mode protection control through the use of zones
- Provides independent control over instruction-address and data-address translation and protection
- Supports eight page sizes: 1 kB, 4 kB, 16 kB, 64 kB, 256 kB, 1 MB, 4 MB, and 16 MB. Any combination of page sizes can be used by system software
- Software controls the page-replacement strategy

Real Mode

The processor references memory when it fetches an instruction and when it accesses data with a load or store instruction. Programs reference memory locations using a 32-bit effective address calculated by the processor. When real mode is enabled, the physical address is identical to the effective address and the processor uses it to access physical memory. After a processor reset, the processor operates in real mode. Real mode can also be enabled by clearing the VM bit in the MSR.

Physical-memory data accesses (loads and stores) are performed in real mode using the effective address. Real mode does not provide system software with virtual address translation, but the full memory access-protection is available, implemented when `C_USE_MMU > 1` (User Mode) and `C_AREA_OPTIMIZED = 0`. Implementation of a real-mode memory manager is more
Virtual-Memory Management

straightforward than a virtual-mode memory manager. Real mode is often an appropriate solution for memory management in simple embedded environments, when access-protection is necessary, but virtual address translation is not required.

Virtual Mode

In virtual mode, the processor translates an effective address into a physical address using the process shown in Figure 2-18. Virtual mode can be enabled by setting the VM bit in the MSR.

![Virtual-Mode Address Translation](image_url)

Each address shown in Figure 2-18 contains a page-number field and an offset field. The page number represents the portion of the address translated by the MMU. The offset represents the byte offset into a page and is not translated by the MMU. The virtual address consists of an additional field, called the process ID (PID), which is taken from the PID register (see Process-ID Register, page 36). The combination of PID and effective page number (EPN) is referred to as the virtual page number (VPN). The value n is determined by the page size, as shown in Table 2-36.

System software maintains a page-translation table that contains entries used to translate each virtual page into a physical page. The page size defined by a page translation entry determines the size of the page number and offset fields. For example, when a 4 kB page size is used, the page-number field is 20 bits and the offset field is 12 bits. The VPN in this case is 28 bits.

Then the most frequently used page translations are stored in the translation look-aside buffer (TLB). When translating a virtual address, the MMU examines the page-translation entries for a matching VPN (PID and EPN). Rather than examining all entries in the table, only entries contained in the processor TLB are examined. When a page-translation entry is found with a matching VPN, the corresponding physical-page number is read from the entry and combined with the offset to form the 32-bit physical address. This physical address is used by the processor to reference memory.
System software can use the PID to uniquely identify software processes (tasks, subroutines, threads) running on the processor. Independently compiled processes can operate in effective-address regions that overlap each other. This overlap must be resolved by system software if multitasking is supported. Assigning a PID to each process enables system software to resolve the overlap by relocating each process into a unique region of virtual-address space. The virtual-address space mappings enable independent translation of each process into the physical-address space.

Page-Translation Table

The page-translation table is a software-defined and software-managed data structure containing page translations. The requirement for software-managed page translation represents an architectural trade-off targeted at embedded-system applications. Embedded systems tend to have a tightly controlled operating environment and a well-defined set of application software. That environment enables virtual-memory management to be optimized for each embedded system in the following ways:

- The page-translation table can be organized to maximize page-table search performance (also called table walking) so that a given page-translation entry is located quickly. Most general-purpose processors implement either an indexed page table (simple search method, large page-table size) or a hashed page table (complex search method, small page-table size). With software table walking, any hybrid organization can be employed that suits the particular embedded system. Both the page-table size and access time can be optimized.

- Independent page sizes can be used for application modules, device drivers, system service routines, and data. Independent page-size selection enables system software to more efficiently use memory by reducing fragmentation (unused memory). For example, a large data structure can be allocated to a 16 MB page and a small I/O device-driver can be allocated to a 1 KB page.

- Page replacement can be tuned to minimize the occurrence of missing page translations. As described in the following section, the most-frequently used page translations are stored in the translation look-aside buffer (TLB). Software is responsible for deciding which translations are stored in the TLB and which translations are replaced when a new translation is required. The replacement strategy can be tuned to avoid thrashing, whereby page-translation entries are constantly being moved in and out of the TLB. The replacement strategy can also be tuned to prevent replacement of critical-page translations, a process sometimes referred to as page locking.

The unified 64-entry TLB, managed by software, caches a subset of instruction and data page-translation entries accessible by the MMU. Software is responsible for reading entries from the page-translation table in system memory and storing them in the TLB. The following section describes the unified TLB in more detail. Internally, the MMU also contains shadow TLBs for instructions and data, with sizes configurable by `C_MMU_ITLB_SIZE` and `C_MMU_DTLB_SIZE` respectively.

These shadow TLBs are managed entirely by the processor (transparent to software) and are used to minimize access conflicts with the unified TLB.

Translation Look-Aside Buffer

The translation look-aside buffer (TLB) is used by the MicroBlaze MMU for address translation when the processor is running in virtual mode, memory protection, and storage control. Each entry within the TLB contains the information necessary to identify a virtual page (PID and effective page number), specify its translation into a physical page, determine the protection characteristics of the page, and specify the storage attributes associated with the page.
The MicroBlaze TLB is physically implemented as three separate TLBs:

- Unified TLB—The UTLB contains 64 entries and is pseudo-associative. Instruction-page and data-page translation can be stored in any UTLB entry. The initialization and management of the UTLB is controlled completely by software.

- Instruction Shadow TLB—The ITLB contains instruction page-translation entries and is fully associative. The page-translation entries stored in the ITLB represent the most-recently accessed instruction-page translations from the UTLB. The ITLB is used to minimize contention between instruction translation and UTLB-update operations. The initialization and management of the ITLB is controlled completely by hardware and is transparent to software.

- Data Shadow TLB—The DTLB contains data page-translation entries and is fully associative. The page-translation entries stored in the DTLB represent the most-recently accessed data-page translations from the UTLB. The DTLB is used to minimize contention between data translation and UTLB-update operations. The initialization and management of the DTLB is controlled completely by hardware and is transparent to software.

Figure 2-19 provides the translation flow for TLB.

![Figure 2-19: TLB Address Translation Flow](image_url)
Figure 2-20 shows the format of a TLB entry. Each TLB entry is 68 bits and is composed of two portions: TLBLO (also referred to as the data entry), and TLBHI (also referred to as the tag entry).

**TLBLO:**

```
0 22 23 24 28 29 30 31
```

- **RPN**
- **EX**
- **WR**
- **ZSEL**
- **W**
- **I**
- **M**
- **G**

**TLBHI:**

```
0 22 25 26 27 28 35
```

- **TAG**
- **SIZE**
- **V**
- **E**
- **U0**
- **TID**

Figure 2-20: TLB Entry Format

The TLB entry contents are described in Table 2-20, page 38 and Table 2-21, page 40.

The fields within a TLB entry are categorized as follows:

- Virtual-page identification (TAG, SIZE, V, TID)—These fields identify the page-translation entry. They are compared with the virtual-page number during the translation process.
- Physical-page identification (RPN, SIZE)—These fields identify the translated page in physical memory.
- Access control (EX, WR, ZSEL)—These fields specify the type of access allowed in the page and are used to protect pages from improper accesses.
- Storage attributes (W, I, M, G, E, U0)—These fields specify the storage-control attributes, such as caching policy for the data cache (write-back or write-through), whether a page is cacheable, and how bytes are ordered (endianness).

Table 2-36 shows the relationship between the TLB-entry SIZE field and the translated page size. This table also shows how the page size determines which address bits are involved in a tag comparison, which address bits are used as a page offset, and which bits in the physical page number are used in the physical address.

**Table 2-36: Page-Translation Bit Ranges by Page Size**

<table>
<thead>
<tr>
<th>Page Size</th>
<th>SIZE (TLBHI Field)</th>
<th>Tag Comparison Bit Range</th>
<th>Page Offset</th>
<th>Physical Page Number</th>
<th>RPN Bits Clear to 0</th>
</tr>
</thead>
<tbody>
<tr>
<td>1 KB</td>
<td>000</td>
<td>TAG[0:21] - Address[0:21]</td>
<td>Address[22:31]</td>
<td>RPN[0:21]</td>
<td>-</td>
</tr>
</tbody>
</table>
TLB Access

When the MMU translates a virtual address (the combination of PID and effective address) into a physical address, it first examines the appropriate shadow TLB for the page translation entry. If an entry is found, it is used to access physical memory. If an entry is not found, the MMU examines the UTLB for the entry. A delay occurs each time the UTLB must be accessed due to a shadow TLB miss. The miss latency ranges from 2-32 cycles. The DTLB has priority over the ITLB if both simultaneously access the UTLB.

Figure 2-21, page 60 shows the logical process the MMU follows when examining a page-translation entry in one of the shadow TLBs or the UTLB. All valid entries in the TLB are checked.

A TLB hit occurs when all of the following conditions are met by a TLB entry:

- The entry is valid
- The TAG field in the entry matches the effective address EPN under the control of the SIZE field in the entry
- The TID field in the entry matches the PID

If any of the above conditions are not met, a TLB miss occurs. A TLB miss causes an exception, described as follows:

A TID value of 0x00 causes the MMU to ignore the comparison between the TID and PID. Only the TAG and EA[EPN] are compared. A TLB entry with TID=0x00 represents a process-independent translation. Pages that are accessed globally by all processes should be assigned a TID value of 0x00. A PID value of 0x00 does not identify a process that can access any page. When PID=0x00, a page-translation hit only occurs when TID=0x00. It is possible for software to load the TLB with multiple entries that match an EA[EPN] and PID combination. However, this is considered a programming error and results in undefined behavior.

When a hit occurs, the MMU reads the RPN field from the corresponding TLB entry. Some or all of the bits in this field are used, depending on the value of the SIZE field (see Table 2-36). For example, if the SIZE field specifies a 256 kB page size, RPN[0:13] represents the physical page number and is used to form the physical address. RPN[14:21] is not used, and software must clear those bits to 0 when initializing the TLB entry. The remainder of the physical address is taken from the page-offset portion of the EA. If the page size is 256 kB, the 32-bit physical address is formed by concatenating RPN[0:13] with bits14:31 of the effective address.

Prior to accessing physical memory, the MMU examines the TLB-entry access-control fields. These fields indicate whether the currently executing program is allowed to perform the requested memory access.

If access is allowed, the MMU checks the storage-attribute fields to determine how to access the page. The storage-attribute fields specify the caching policy for memory accesses.

TLB Access Failures

A TLB-access failure causes an exception to occur. This interrupts execution of the instruction that caused the failure and transfers control to an interrupt handler to resolve the failure. A TLB access can fail for two reasons:

- A matching TLB entry was not found, resulting in a TLB miss
- A matching TLB entry was found, but access to the page was prevented by either the storage attributes or zone protection

When an interrupt occurs, the processor enters real mode by clearing MSR[VM] to 0. In real mode, all address translation and memory-protection checks performed by the MMU are disabled. After
system software initializes the UTLB with page-translation entries, management of the MicroBlaze UTLB is usually performed using interrupt handlers running in real mode.

**Figure 2-21** diagrams the general process for examining a TLB entry.

![General Process for Examining a TLB Entry](UG011_41_033101)

The following sections describe the conditions under which exceptions occur due to TLB access failures.

**Data-Storage Exception**

When virtual mode is enabled, (MSR[VM]=1), a data-storage exception occurs when access to a page is not permitted for any of the following reasons:

- From user mode:
  - The TLB entry specifies a zone field that prevents access to the page (ZPR[Zn]=00). This applies to load and store instructions.
  - The TLB entry specifies a read-only page (TLBLO[WR]=0) that is not otherwise overridden by the zone field (ZPR[Zn], 11). This applies to store instructions.
• From privileged mode:
  ♦ The TLB entry specifies a read-only page (TLBLO[WR]=0) that is not otherwise overridden by the zone field (ZPR[Zn], 10 and ZPR[Zn], 11). This applies to store instructions.

Instruction-Storage Exception

When virtual mode is enabled, (MSR[VM]=1), an instruction-storage exception occurs when access to a page is not permitted for any of the following reasons:

• From user mode:
  ♦ The TLB entry specifies a zone field that prevents access to the page (ZPR[Zn]=00).
  ♦ The TLB entry specifies a non-executable page (TLBLO[EX]=0) that is not otherwise overridden by the zone field (ZPR[Zn], 11).
  ♦ The TLB entry specifies a guarded-storage page (TLBLO[G]=1).

• From privileged mode:
  ♦ The TLB entry specifies a non-executable page (TLBLO[EX]=0) that is not otherwise overridden by the zone field (ZPR[Zn], 10 and ZPR[Zn], 11).
  ♦ The TLB entry specifies a guarded-storage page (TLBLO[G]=1).

Data TLB-Miss Exception

When virtual mode is enabled (MSR[VM]=1) a data TLB-miss exception occurs if a valid, matching TLB entry was not found in the TLB (shadow and UTLB). Any load or store instruction can cause a data TLB-miss exception.

Instruction TLB-Miss Exception

When virtual mode is enabled (MSR[VM]=1) an instruction TLB-miss exception occurs if a valid, matching TLB entry was not found in the TLB (shadow and UTLB). Any instruction fetch can cause an instruction TLB-miss exception.

Access Protection

System software uses access protection to protect sensitive memory locations from improper access. System software can restrict memory accesses for both user-mode and privileged-mode software. Restrictions can be placed on reads, writes, and instruction fetches. Access protection is available when virtual protected mode is enabled.

Access control applies to instruction fetches, data loads, and data stores. The TLB entry for a virtual page specifies the type of access allowed to the page. The TLB entry also specifies a zone-protection field in the zone-protection register that is used to override the access controls specified by the TLB entry.

TLB Access-Protection Controls

Each TLB entry controls three types of access:

• Process—Processes are protected from unauthorized access by assigning a unique process ID (PID) to each process. When system software starts a user-mode application, it loads the PID for that application into the PID register. As the application executes, memory addresses are translated using only TLB entries with a TID field in Translation Look-Aside Buffer High (TLBHI) that matches the PID. This enables system software to restrict accesses for an application to a specific area in virtual memory.
A TLB entry with TID=0x00 represents a process-independent translation. Pages that are accessed globally by all processes should be assigned a TID value of 0x00.

- Execution—The processor executes instructions only if they are fetched from a virtual page marked as executable (TLBLO[EX]=1). Clearing TLBLO[EX] to 0 prevents execution of instructions fetched from a page, instead causing an instruction-storage interrupt (ISI) to occur. The ISI does not occur when the instruction is fetched, but instead occurs when the instruction is executed. This prevents speculatively fetched instructions that are later discarded (rather than executed) from causing an ISI.

  The zone-protection register can override execution protection.

- Read/Write—Data is written only to virtual pages marked as writable (TLBLO[WR]=1). Clearing TLBLO[WR] to 0 marks a page as read-only. An attempt to write to a read-only page causes a data-storage interrupt (DSI) to occur.

  The zone-protection register can override write protection.

TLB entries cannot be used to prevent programs from reading pages. In virtual mode, zone protection is used to read-protect pages. This is done by defining a no-access-allowed zone (ZPR[Zn] = 00) and using it to override the TLB-entry access protection. Only programs running in user mode can be prevented from reading a page. Privileged programs always have read access to a page.

**Zone Protection**

Zone protection is used to override the access protection specified in a TLB entry. Zones are an arbitrary grouping of virtual pages with common access protection. Zones can contain any number of pages specifying any combination of page sizes. There is no requirement for a zone to contain adjacent pages.

The zone-protection register (ZPR) is a 32-bit register used to specify the type of protection override applied to each of 16 possible zones. The protection override for a zone is encoded in the ZPR as a 2-bit field. The 4-bit zone-select field in a TLB entry (TLBLO[ZSEL]) selects one of the 16 zone fields from the ZPR (Z0–Z15). For example, zone Z5 is selected when ZSEL = 0101.

Changing a zone field in the ZPR applies a protection override across all pages in that zone. Without the ZPR, protection changes require individual alterations to each page translation entry within the zone.

Unimplemented zones (when `C_MMU_ZONES < 16`) are treated as if they contained 11.

**UTLB Management**

The UTLB serves as the interface between the processor MMU and memory-management software. System software manages the UTLB to tell the MMU how to translate virtual addresses into physical addresses. When a problem occurs due to a missing translation or an access violation, the MMU communicates the problem to system software using the exception mechanism. System software is responsible for providing interrupt handlers to correct these problems so that the MMU can proceed with memory translation.

Software reads and writes UTLB entries using the MFS and MTS instructions, respectively. These instructions use the TLBX register index (numbered 0 to 63) corresponding to one of the 64 entries in the UTLB. The tag and data portions are read and written separately, so software must execute two MFS or MTS instructions to completely access an entry. The UTLB is searched for a specific translation using the TLBSX register. TLBSX locates a translation using an effective address and loads the corresponding UTLB index into the TLBX register.
Individual UTLB entries are invalidated using the MTS instruction to clear the valid bit in the tag portion of a TLB entry (TLBHI[V]).

When C_FAULT_TOLERANT is set to 1, the UTLB block RAM is protected by parity. In case of a parity error, a TLB miss exception occurs. To avoid accumulating errors in this case, each entry in the UTLB should be periodically invalidated.

Recording Page Access and Page Modification

Software management of virtual-memory poses several challenges:

- In a virtual-memory environment, software and data often consume more memory than is physically available. Some of the software and data pages must be stored outside physical memory, such as on a hard drive, when they are not used. Ideally, the most-frequently used pages stay in physical memory and infrequently used pages are stored elsewhere.

- When pages in physical-memory are replaced to make room for new pages, it is important to know whether the replaced (old) pages were modified. If they were modified, they must be saved prior to loading the replacement (new) pages. If the old pages were not modified, the new pages can be loaded without saving the old pages.

- A limited number of page translations are kept in the UTLB. The remaining translations must be stored in the page-translation table. When a translation is not found in the UTLB (due to a miss), system software must decide which UTLB entry to discard so that the missing translation can be loaded. It is desirable for system software to replace infrequently used translations rather than frequently used translations.

Solving the above problems in an efficient manner requires keeping track of page accesses and page modifications. MicroBlaze does not track page access and page modification in hardware. Instead, system software can use the TLB-miss exceptions and the data-storage exception to collect this information. As the information is collected, it can be stored in a data structure associated with the page-translation table.

Page-access information is used to determine which pages should be kept in physical memory and which are replaced when physical-memory space is required. System software can use the valid bit in the TLB entry (TLBHI[V]) to monitor page accesses. This requires page translations be initialized as not valid (TLBHI[V]=0) to indicate they have not been accessed. The first attempt to access a page causes a TLB-miss exception, either because the UTLB entry is marked not valid or because the page translation is not present in the UTLB. The TLB-miss handler updates the UTLB with a valid translation (TLBHI[V]=1). The set valid bit serves as a record that the page and its translation have been accessed. The TLB-miss handler can also record the information in a separate data structure associated with the page-translation entry.

Page-modification information is used to indicate whether an old page can be overwritten with a new page or the old page must first be stored to a hard disk. System software can use the write-protection bit in the TLB entry (TLBLO[WR]) to monitor page modification. This requires page translations be initialized as read-only (TLBLO[WR]=0) to indicate they have not been modified. The first attempt to write data into a page causes a data-storage exception, assuming the page has already been accessed and marked valid as described above. If software has permission to write into the page, the data-storage handler marks the page as writable (TLBLO[WR]=1) and returns. The set write-protection bit serves as a record that a page has been modified. The data-storage handler can also record this information in a separate data structure associated with the page-translation entry.

Tracking page modification is useful when virtual mode is first entered and when a new process is started.
Reset, Interrupts, Exceptions, and Break

MicroBlaze supports reset, interrupt, user exception, break, and hardware exceptions. The following section describes the execution flow associated with each of these events.

The relative priority starting with the highest is:

1. Reset
2. Hardware Exception
3. Non-maskable Break
4. Break
5. Interrupt
6. User Vector (Exception)

Table 2-37 defines the memory address locations of the associated vectors and the hardware enforced register file locations for return addresses. Each vector allocates two addresses to allow full address range branching (requires an \texttt{IMM} followed by a \texttt{BRAI} instruction). Normally the vectors start at address 0x00000000, but the parameter \texttt{C_BASE_VECTORS} can be used to locate them anywhere in memory.

The address range 0x28 to 0x4F is reserved for future software support by Xilinx. Allocating these addresses for user applications is likely to conflict with future releases of EDK support software.

<table>
<thead>
<tr>
<th>Event</th>
<th>Vector Address</th>
<th>Register File Return Address</th>
</tr>
</thead>
<tbody>
<tr>
<td>Reset</td>
<td>( \text{C_BASE_VECTORS} + 0x00000000 ) ( - ) ( \text{C_BASE_VECTORS} + 0x00000004 )</td>
<td>-</td>
</tr>
<tr>
<td>User Vector (Exception)</td>
<td>( \text{C_BASE_VECTORS} + 0x00000008 ) ( - ) ( \text{C_BASE_VECTORS} + 0x0000000C )</td>
<td>Rx</td>
</tr>
<tr>
<td>Interrupt(^1)</td>
<td>( \text{C_BASE_VECTORS} + 0x00000010 ) ( - ) ( \text{C_BASE_VECTORS} + 0x00000014 )</td>
<td>R14</td>
</tr>
<tr>
<td>Break: Non-maskable hardware</td>
<td>( \text{C_BASE_VECTORS} + 0x00000018 ) ( - ) ( \text{C_BASE_VECTORS} + 0x0000001C )</td>
<td>R16</td>
</tr>
<tr>
<td>Break: Hardware</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Break: Software</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Hardware Exception</td>
<td>( \text{C_BASE_VECTORS} + 0x00000020 ) ( - ) ( \text{C_BASE_VECTORS} + 0x00000024 )</td>
<td>R17 or BTR</td>
</tr>
<tr>
<td>Reserved by Xilinx for future use</td>
<td>( \text{C_BASE_VECTORS} + 0x00000028 ) ( - ) ( \text{C_BASE_VECTORS} + 0x0000004F )</td>
<td>-</td>
</tr>
</tbody>
</table>

1. With low-latency interrupt mode, the vector address is supplied by the Interrupt Controller.

All of these events will clear the reservation bit, used together with the LWX and SWX instructions to implement mutual exclusion, such as semaphores and spinlocks.

**Reset**

When a \texttt{Reset}, \texttt{MB\_Reset}, or \texttt{Debug\_Rst} (\(^1\)) occurs, MicroBlaze flushes the pipeline and starts fetching instructions from the reset vector (address 0x0). Both external reset signals are active high and should be asserted for a minimum of 16 cycles.

1. Reset input controlled by the XMD debugger via MDM.
Equivalent Pseudocode

```plaintext
PC ← C_BASE_VECTORS + 0x00000000
MSR ← C_RESET_MSR (see "MicroBlaze Core Configurability" in Chapter 3)
EAR ← 0; ESR ← 0; FSR ← 0
PID ← 0; ZPR ← 0; TLBX ← 0
Reservation ← 0
```

Hardware Exceptions

MicroBlaze can be configured to trap the following internal error conditions: illegal instruction, instruction and data bus error, and unaligned access. The divide exception can only be enabled if the processor is configured with a hardware divider (C_USE_DIV=1). When configured with a hardware floating point unit (C_USE_FPU>0), it can also trap the following floating point specific exceptions: underflow, overflow, float division-by-zero, invalid operation, and denormalized operand error.

When configured with a hardware Memory Management Unit, it can also trap the following memory management specific exceptions: Illegal Instruction Exception, Data Storage Exception, Instruction Storage Exception, Data TLB Miss Exception, and Instruction TLB Miss Exception.

A hardware exception causes MicroBlaze to flush the pipeline and branch to the hardware exception vector (address C_BASE_VECTORS + 0x20). The execution stage instruction in the exception cycle is not executed.

The exception also updates the general purpose register R17 in the following manner:

- For the MMU exceptions (Data Storage Exception, Instruction Storage Exception, Data TLB Miss Exception, Instruction TLB Miss Exception) the register R17 is loaded with the appropriate program counter value to re-execute the instruction causing the exception upon return. The value is adjusted to return to a preceding IMM instruction, if any. If the exception is caused by an instruction in a branch delay slot, the value is adjusted to return to the branch instruction, including adjustment for a preceding IMM instruction, if any.

- For all other exceptions the register R17 is loaded with the program counter value of the subsequent instruction, unless the exception is caused by an instruction in a branch delay slot. If the exception is caused by an instruction in a branch delay slot, the ESR[DS] bit is set. In this case the exception handler should resume execution from the branch target address stored in BTR.

The EE and EIP bits in MSR are automatically reverted when executing the RTED instruction.

The VM and UM bits in MSR are automatically reverted from VMS and UMS when executing the RTED, RTBD, and RTID instructions.

Exception Priority

When two or more exceptions occur simultaneously, they are handled in the following order, from the highest priority to the lowest:

- Instruction Bus Exception
- Instruction TLB Miss Exception
- Instruction Storage Exception
- Illegal Opcode Exception
- Privileged Instruction Exception or Stack Protection Violation Exception
- Data TLB Miss Exception
- Data Storage Exception
Chapter 2: MicroBlaze Architecture

- Unaligned Exception
- Data Bus Exception
- Divide Exception
- FPU Exception
- Stream Exception

Exception Causes

- Stream Exception
  The stream exception (FSL or AXI) is caused by executing a `get` or `getd` instruction with the ‘e’ bit set to ‘1’ when there is a control bit mismatch.

- Instruction Bus Exception
  The instruction bus exception is caused by errors when reading data from memory.
  - The instruction peripheral AXI4 interface (M_AXI_IP) exception is caused by an error response on M_AXI_IP_RRESP.
  - The instruction cache AXI4 interface (M_AXI_IC) is caused by an error response on M_AXI_IC_RRESP. The exception can only occur when C_ICACHE_ALWAYS_USED is set to 1 and the cache is turned off. In all other cases the response is ignored.
  - The instruction Processor Local Bus (PLB) exception is caused by an active error signal from the slave (IPLB_MRdErr) or timeout signal from the arbiter (IPLB_MTimeout).
  - The instructions side local memory (ILMB) can only cause instruction bus exception when C_FAULT_TOLERANT is set to 1, and either an uncorrectable error occurs in the LMB memory, as indicated by the IUE signal, or C_ECC_USE_CE_EXCEPTION is set to 1 and a correctable error occurs in the LMB memory, as indicated by the ICE signal.
  - The CacheLink (IXCL) interfaces cannot cause instruction bus exceptions.

- Illegal Opcode Exception
  The illegal opcode exception is caused by an instruction with an invalid major opcode (bits 0 through 5 of instruction). Bits 6 through 31 of the instruction are not checked. Optional processor instructions are detected as illegal if not enabled. If the optional feature C_OPCODE_0x0_ILLEGAL is enabled, an illegal opcode exception is also caused if the instruction is equal to 0x00000000.

- Data Bus Exception
  The data bus exception is caused by errors when reading data from memory or writing data to memory.
  - The data peripheral AXI4 interface (M_AXI_DP) exception is caused by an error response on M_AXI_DP_RRESP or M_AXI_DP_BRESP.
  - The data cache AXI4 interface (M_AXI_DC) exception is caused by:
    - An error response on M_AXI_DC_RRESP or M_AXI_DC_BRESP,
    - OKAY response on M_AXI_DC_RRESP in case of an exclusive access using LWX.
    The exception can only occur when C_DCACHE_ALWAYS_USED is set to 1 and the cache is turned off, or when an exclusive access using LWX or SWX is performed. In all other cases the response is ignored.
  - The data Processor Local Bus exception is caused by an active error signal from the slave (DPLB_MRdErr or DPLB_MWrErr) or timeout signal from the arbiter (DPLB_MTimeout).
The data side local memory (DLMB) can only cause instruction bus exception when \texttt{C_FAULT_TOLERANT} is set to 1, and either an uncorrectable error occurs in the LMB memory, as indicated by the \texttt{DUE} signal, or \texttt{C_ECC_USE_CE_EXCEPTION} is set to 1 and a correctable error occurs in the LMB memory, as indicated by the \texttt{DCE} signal. An error can occur for all read accesses, and for byte and halfword write accesses.

The CacheLink (DXCL) interfaces cannot cause data bus exceptions.

- **Unaligned Exception**
  The unaligned exception is caused by a word access where the address to the data bus has bits 30 or 31 set, or a half-word access with bit 31 set.

- **Divide Exception**
  The divide exception is caused by an integer division (\texttt{idiv} or \texttt{idivu}) where the divisor is zero, or by a signed integer division (\texttt{idiv}) where overflow occurs (-2147483648 / -1).

- **FPU Exception**
  An FPU exception is caused by an underflow, overflow, divide-by-zero, illegal operation, or denormalized operand occurring with a floating point instruction.
  - **Underflow** occurs when the result is denormalized.
  - **Overflow** occurs when the result is not-a-number (NaN).
  - **The divide-by-zero FPU exception** is caused by the rA operand to \texttt{fdiv} being zero when rB is not infinite.
  - **Illegal operation** is caused by a signaling NaN operand or by illegal infinite or zero operand combinations.

- **Privileged Instruction Exception**
  The Privileged Instruction exception is caused by an attempt to execute a privileged instruction in User Mode.

- **Stack Protection Violation Exception**
  A Stack Protection Violation exception is caused by executing a load or store instruction using the stack pointer (register R1) as rA with an address outside the stack boundaries defined by the special Stack Low and Stack High registers, causing a stack overflow or a stack underflow.

- **Data Storage Exception**
  The Data Storage exception is caused by an attempt to access data in memory that results in a memory-protection violation.

- **Instruction Storage Exception**
  The Instruction Storage exception is caused by an attempt to access instructions in memory that results in a memory-protection violation.

- **Data TLB Miss Exception**
  The Data TLB Miss exception is caused by an attempt to access data in memory, when a valid Translation Look-Aside Buffer entry is not present, and virtual protected mode is enabled.

- **Instruction TLB Miss Exception**
  The Instruction TLB Miss exception is caused by an attempt to access instructions in memory, when a valid Translation Look-Aside Buffer entry is not present, and virtual protected mode is enabled.

Should an Instruction Bus Exception, Illegal Opcode Exception or Data Bus Exception occur when \texttt{C_FAULT_TOLERANT} is set to 1, and an exception is in progress (i.e. MSR[EIP] set and MSR[EE] cleared), the pipeline is halted, and the external signal \texttt{MB_Error} is set.
Equivalent Pseudocode

ESR[DS] ← exception in delay slot
if ESR[DS] then
    BTR ← branch target PC
    if MMU exception then
        if branch preceded by IMM then
            r17 ← PC - 8
        else
            r17 ← PC - 4
    else
        r17 ← invalid value
else if MMU exception then
    if instruction preceded by IMM then
        r17 ← PC - 4
    else
        r17 ← PC
else
    r17 ← PC + 4
PC ← C_BASE_VECTORS + 0x00000020
MSR[EE] ← 0, MSR[EIP]← 1
MSR[UMS] ← MSR[UM], MSR[UM] ← 0, MSR[VMS] ← MSR[VM], MSR[VM] ← 0
ESR[EC] ← exception specific value
ESR[ESS]← exception specific value
EAR ← exception specific value
FSR ← exception specific value
Reservation ← 0

Breaks

There are two kinds of breaks:

- Hardware (external) breaks
- Software (internal) breaks

Hardware Breaks

Hardware breaks are performed by asserting the external break signal (that is, the Ext_BRK and Ext_NM_BRK input ports). On a break, the instruction in the execution stage completes while the instruction in the decode stage is replaced by a branch to the break vector (address C_BASE_VECTORS + 0x18). The break return address (the PC associated with the instruction in the decode stage at the time of the break) is automatically loaded into general purpose register R16. MicroBlaze also sets the Break In Progress (BIP) flag in the Machine Status Register (MSR).

A normal hardware break (that is, the Ext_BRK input port) is only handled when MSR[BIP] and MSR[EIP] are set to 0 (that is, there is no break or exception in progress). The Break In Progress flag disables interrupts. A non-maskable break (that is, the Ext_NM_BRK input port) is always handled immediately.

The BIP bit in the MSR is automatically cleared when executing the RTBD instruction.

The Ext_BRK signal must be kept asserted until the break has occurred, and deasserted before the RTBD instruction is executed. The Ext_NM_BRK signal must only be asserted one clock cycle.
Software Breaks

To perform a software break, use the `brk` and `brki` instructions. Refer to Chapter 5, MicroBlaze Instruction Set Architecture for detailed information on software breaks.

As a special case, when `C_USE_DEBUG` is set, and “`brki rD, 0x18`” is executed, a software breakpoint is signaled to the Xilinx Microprocessor Debugger (XMD) tool, irrespective of the value of `C_BASE_VECTORS`.

Latency

The time it takes MicroBlaze to enter a break service routine from the time the break occurs depends on the instruction currently in the execution stage and the latency to the memory storing the break vector.

Equivalent Pseudocode

```
   r16 ← PC
   PC ← C_BASE_VECTORS + 0x00000018
   MSR[BIP] ← 1
   MSR[UMS] ← MSR[UM], MSR[UM] ← 0, MSR[VMS] ← MSR[VM], MSR[VM] ← 0
   Reservation ← 0
```

Interrupt

MicroBlaze supports one external interrupt source (connected to the Interrupt input port). The processor only reacts to interrupts if the Interrupt Enable (IE) bit in the Machine Status Register (MSR) is set to 1. On an interrupt, the instruction in the execution stage completes while the instruction in the decode stage is replaced by a branch to the interrupt vector. This is either address `C_BASE_VECTORS + 0x10`, or with low-latency interrupt mode, the address supplied by the Interrupt Controller.

The interrupt return address (the PC associated with the instruction in the decode stage at the time of the interrupt) is automatically loaded into general purpose register R14. In addition, the processor also disables future interrupts by clearing the IE bit in the MSR. The IE bit is automatically set again when executing the RTID instruction.

Interrupts are ignored by the processor if either of the break in progress (BIP) or exception in progress (EIP) bits in the MSR are set to 1.

By using the parameter `C_INTERRUPT_IS_EDGE`, the external interrupt can either be set to level-sensitive or edge-sensitive:

- When using level-sensitive interrupts, the Interrupt input must remain set until MicroBlaze has taken the interrupt, and jumped to the interrupt vector. Software must clear the interrupt before returning from the interrupt handler. If not, the interrupt is taken again, as soon as interrupts are enabled when returning from the interrupt handler.

- When using edge-sensitive interrupts, MicroBlaze detects and latches the Interrupt input edge, which means that the input only needs to be asserted one clock cycle. The interrupt input can remain asserted, but must be deasserted at least one clock cycle before a new interrupt can be detected. The latching of an edge sensitive interrupt is independent of the IE bit in MSR. Should an interrupt occur while the IE bit is 0, it will immediately be serviced when the IE bit is set to 1.

Low-latency Interrupt Mode

A low-latency interrupt mode is available, which allows the Interrupt Controller to directly supply the interrupt vector for each individual interrupt (via the Interrupt_Address input port).
Chapter 2: MicroBlaze Architecture

The address of each fast interrupt handler must be passed to the Interrupt Controller when initializing the interrupt system. When a particular interrupt occurs, this address is supplied by the Interrupt Controller, which allows MicroBlaze to directly jump to the handler code.

With this mode, MicroBlaze also directly sends the appropriate interrupt acknowledge to the Interrupt Controller (via the Interrupt_Ack output port), although it is still the responsibility of the Interrupt Service Routine to acknowledge level sensitive interrupts at the source.

To inform the Interrupt Controller of the interrupt handling events, Interrupt_Ack is set to:

- 01 - when MicroBlaze jumps to the interrupt handler code,
- 10 - when the RTID instruction is executed to return from interrupt,
- 11 - when MSR[IE] is changed from 0 to 1, which enables interrupts again.

The Interrupt_Ack output port is active during one clock cycle, and is then reset to 00.

This information allows the Interrupt Controller to acknowledge interrupts appropriately, both for level-sensitive and edge-triggered interrupt.

Latency

The time it takes MicroBlaze to enter an Interrupt Service Routine (ISR) from the time an interrupt occurs, depends on the configuration of the processor and the latency of the memory controller storing the interrupt vectors. If MicroBlaze is configured to have a hardware divider, the largest latency happens when an interrupt occurs during the execution of a division instruction.

With low-latency interrupt mode, the time to enter the ISR is significantly reduced, since the interrupt vector for each individual interrupt is directly supplied by the Interrupt Controller. With compiler support for fast interrupts, there is no need for a common ISR at all. Instead, the ISR for each individual interrupt will be directly called, and the compiler takes care of saving and restoring registers used by the ISR.

Equivalent Pseudocode

```
r14 ← PC
if C_USE_INTERRUPT = 2
   PC ← Interrupt_Address
   Interrupt_Ack ← 01
else
   PC ← C_BASE_VECTORS + 0x00000010
   MSR[IE] ← 0
   MSR[UMS] ← MSR[UM], MSR[UM] ← 0, MSR[VMS] ← MSR[VM], MSR[VM] ← 0
   Reservation ← 0
```

User Vector (Exception)

The user exception vector is located at address 0x8. A user exception is caused by inserting a ‘BRALID Rx, 0x8’ instruction in the software flow. Although Rx could be any general purpose register, Xilinx recommends using R15 for storing the user exception return address, and to use the RTSD instruction to return from the user exception handler.

Pseudocode

```
rx ← PC
PC ← C_BASE_VECTORS + 0x00000008
MSR[UMS] ← MSR[UM], MSR[UM] ← 0, MSR[VMS] ← MSR[VM], MSR[VM] ← 0
Reservation ← 0
```
Instruction Cache

Overview

MicroBlaze can be used with an optional instruction cache for improved performance when executing code that resides outside the LMB address range.

The instruction cache has the following features:

- Direct mapped (1-way associative)
- User selectable cacheable memory address range
- Configurable cache and tag size
- Caching over AXI4 interface (M_AXI_IC) or CacheLink (XCL) interface
- Option to use 4 or 8 word cache-line
- Cache on and off controlled using a bit in the MSR
- Optional WIC instruction to invalidate instruction cache lines
- Optional stream buffers to improve performance by speculatively prefetching instructions
- Optional victim cache to improve performance by saving evicted cache lines
- Optional parity protection that invalidates cache lines if a Block RAM bit error is detected
- Optional data width selection to either use 32 bits, an entire cache line, or 512 bits

General Instruction Cache Functionality

When the instruction cache is used, the memory address space is split into two segments: a cacheable segment and a non-cacheable segment. The cacheable segment is determined by two parameters: C_ICACHE_BASEADDR and C_ICACHE_HIGHADDR. All addresses within this range correspond to the cacheable address segment. All other addresses are non-cacheable.

The cacheable segment size must be $2^N$, where N is a positive integer. The range specified by C_ICACHE_BASEADDR and C_ICACHE_HIGHADDR must comprise a complete power-of-two range, such that range $= 2^N$ and the N least significant bits of C_ICACHE_BASEADDR must be zero.

The cacheable instruction address consists of two parts: the cache address, and the tag address. The MicroBlaze instruction cache can be configured from 64 bytes to 64 kB. This corresponds to a cache address of between 6 and 16 bits. The tag address together with the cache address should match the full address of cacheable memory. When selecting cache sizes below 2 kB, distributed RAM is used to implement the Tag RAM and Instruction RAM. Distributed RAM is always used to implement the Tag RAM, when setting the parameter C_ICACHE_FORCE_TAG_LUTRAM to 1. This parameter is only available with cache sizes 8 kB or 16 kB and less, for 4 or 8 word cache-lines, respectively.

For example: in a MicroBlaze configured with C_ICACHE_BASEADDR= 0x00300000, C_ICACHE_HIGHADDR=0x0030ffff, C_CACHE_BYTE_SIZE=4096, C_ICACHE_LINE_LEN=8, and C_ICACHE_FORCE_TAG_LUTRAM=0; the cacheable memory of 64 kB uses 16 bits of byte address, and the 4 kB cache uses 12 bits of byte address, thus the required address tag width is: 16-12=4 bits. The total number of block RAM primitives required in this configuration is: 2 RAMB16 for storing the 1024 instruction words, and 1 RAMB16 for 128 cache line entries, each consisting of: 4 bits of tag, 8 word-valid bits, 1 line-valid bit. In total 3 RAMB16 primitives.

Figure 2-22, page 72 shows the organization of Instruction Cache.
Instruction Cache Operation

For every instruction fetched, the instruction cache detects if the instruction address belongs to the cacheable segment. If the address is non-cacheable, the cache controller ignores the instruction and lets the M_AXI_IP, PLB or LMB complete the request. If the address is cacheable, a lookup is performed on the tag memory to check if the requested address is currently cached. The lookup is successful if: the word and line valid bits are set, and the tag address matches the instruction address tag segment. On a cache miss, the cache controller requests the new instruction over the instruction AXI4 interface (M_AXI_IC) or instruction CacheLink (IXCL) interface, and waits for the memory controller to return the associated cache line.

With the AXI4 interface, \( C_{ICACHE\_DATA\_WIDTH} \) determines the bus data width, either 32 bits, an entire cache line (128 bits or 256 bits), or 512 bits.

When \( C_{FAULT\_TOLERANT} \) is set to 1, a cache miss also occurs if a parity error is detected in a tag or instruction Block RAM.

Stream Buffers

When stream buffers are enabled, by setting the parameter \( C_{ICACHE\_STREAMS} \) to 1, the cache will speculatively fetch cache lines in advance in sequence following the last requested address, until the stream buffer is full. The stream buffer can hold up to two cache lines. Should the processor subsequently request instructions from a cache line prefetched by the stream buffer, which occurs in linear code, they are immediately available.

The stream buffer often improves performance, since the processor generally has to spend less time waiting for instructions to be fetched from memory.

With the AXI4 interface, \( C_{ICACHE\_DATA\_WIDTH} \) determines the amount of data transferred from the stream buffer each clock cycle, either 32 bits or an entire cache line.

To be able to use instruction cache stream buffers, area optimization must not be enabled.
Victim Cache

The victim cache is enabled by setting the parameter C_ICACHE_VICTIMS to 2, 4 or 8. This defines the number of cache lines that can be stored in the victim cache. Whenever a cache line is evicted from the cache, it is saved in the victim cache. By saving the most recent lines they can be fetched much faster, should the processor request them, thereby improving performance. If the victim cache is not used, all evicted cache lines must be read from memory again when they are needed.

With the AXI4 interface, C_ICACHE_DATA_WIDTH determines the amount of data transferred from/to the victim cache each clock cycle, either 32 bits or an entire cache line.

Note that to be able to use the victim cache, area optimization must not be enabled.

Instruction Cache Software Support

MSR Bit

The ICE bit in the MSR provides software control to enable and disable caches. The contents of the cache are preserved by default when the cache is disabled. You can invalidate cache lines using the WIC instruction or using the hardware debug logic of MicroBlaze.

WIC Instruction

The optional WIC instruction (C_ALLOW_ICACHE_WR=1) is used to invalidate cache lines in the instruction cache from an application. For a detailed description, refer to Chapter 5, MicroBlaze Instruction Set Architecture.

The WIC instruction can also be used together with parity protection to periodically invalidate entries the cache, to avoid accumulating errors.

Data Cache

Overview

MicroBlaze can be used with an optional data cache for improved performance. The cached memory range must not include addresses in the LMB address range. The data cache has the following features:

- Direct mapped (1-way associative)
- Write-through or Write-back
- User selectable cacheable memory address range
- Configurable cache size and tag size
- Caching over AXI4 interface (M_AXI_DC) or CacheLink (XCL) interface
- Option to use 4 or 8 word cache-lines
- Cache on and off controlled using a bit in the MSR
- Optional WDC instruction to invalidate or flush data cache lines
- Optional victim cache with write-back to improve performance by saving evicted cache lines
- Optional parity protection for write-through cache that invalidates cache lines if a Block RAM bit error is detected
- Optional data width selection to either use 32 bits, an entire cache line, or 512 bits
General Data Cache Functionality

When the data cache is used, the memory address space is split into two segments: a cacheable segment and a non-cacheable segment. The cacheable area is determined by two parameters: `C_DCACHE_BASEADDR` and `C_DCACHE_HIGHADDR`. All addresses within this range correspond to the cacheable address space. All other addresses are non-cacheable.

The cacheable segment size must be $2^N$, where N is a positive integer. The range specified by `C_DCACHE_BASEADDR` and `C_DCACHE_HIGHADDR` must comprise a complete power-of-two range, such that range $= 2^N$ and the N least significant bits of `C_DCACHE_BASEADDR` must be zero.

Figure 2-23 shows the Data Cache Organization.

![Data Cache Organization Diagram](image-url)

**Figure 2-23: Data Cache Organization**

The cacheable data address consists of two parts: the cache address, and the tag address. The MicroBlaze data cache can be configured from 64 bytes to 64 kB. This corresponds to a cache address of between 6 and 16 bits. The tag address together with the cache address should match the full address of cacheable memory. When selecting cache sizes below 2 kB, distributed RAM is used to implement the Tag RAM and Data RAM, except that block RAM is always used for the Data RAM when `C_AREA_OPTIMIZED` is set and `C_DCACHE_USE_WRITEBACK` is not set. Distributed RAM is always used to implement the Tag RAM, when setting the parameter `C_DCACHE_FORCE_TAG_LUTRAM` to 1. This parameter is only available with cache sizes 8 kB or 16 kB and less, for 4 or 8 word cache-lines, respectively.

For example, in a MicroBlaze configured with `C_DCACHE_BASEADDR=0x00400000`, `C_DCACHE_HIGHADDR=0x00403fff`, `C_DCACHE_BYTE_SIZE=2048`, `C_DCACHE_LINE_LEN=4`, and `C_DCACHE_FORCE_TAG_LUTRAM=0`; the cacheable memory of 16 kB uses 14 bits of byte address, and the 2 kB cache uses 11 bits of byte address, thus the required address tag width is 14-11=3 bits. The total number of block RAM primitives required in this configuration is 1 RAMB16 for storing the 512 data words, and 1 RAMB16 for 128 cache line entries, each consisting of 3 bits of tag, 4 word-valid bits, 1 line-valid bit. In total, 2 RAMB16 primitives.
Data Cache Operation

The caching policy used by the MicroBlaze data cache, write-back or write-through, is determined by the parameter \texttt{C\_DCACHE\_USE\_WRITEBACK}. When this parameter is set, a write-back protocol is implemented, otherwise write-through is implemented. However, when configured with an MMU (\texttt{C\_USE\_MMU} > 1, \texttt{C\_AREA\_OPTIMIZED} = 0, \texttt{C\_DCACHE\_USE\_WRITEBACK} = 1), the caching policy in virtual mode is determined by the W storage attribute in the TLB entry, whereas write-back is used in real mode.

With the write-back protocol, a store to an address within the cacheable range always updates the cached data. If the target address word is not in the cache (that is, the access is a cache-miss), and the location in the cache contains data that has not yet been written to memory (the cache location is dirty), the old data is written over the data AXI4 interface (M\_AXI\_DC) or the data CacheLink (DXCL) to external memory before updating the cache with the new data. If an entire cache line needs to be written, a burst cache line write is used, otherwise single word writes are used. For byte or halfword stores, in case of a cache miss, the address is first requested over the data AXI4 interface or the data CacheLink, while a word store only updates the cache.

With the write-through protocol, a store to an address within the cacheable range generates an equivalent byte, halfword, or word write over the data AXI4 interface or the data CacheLink to external memory. The write also updates the cached data if the target address word is in the cache (that is, the write is a cache hit). A write cache-miss does not load the associated cache line into the cache.

Provided that the cache is enabled a load from an address within the cacheable range triggers a check to determine if the requested data is currently cached. If it is (that is, on a cache hit) the requested data is retrieved from the cache. If not (that is, on a cache miss) the address is requested over the data AXI4 interface or data CacheLink, and the processor pipeline stalls until the cache line associated to the requested address is returned from the external memory controller.

With the AXI4 interface, \texttt{C\_DCACHE\_DATA\_WIDTH} determines the bus data width, either 32 bits, an entire cache line (128 bits or 256 bits), or 512 bits.

When \texttt{C\_FAULT\_TOLERANT} is set to 1 and write-through protocol is used, a cache miss also occurs if a parity error is detected in the tag or data Block RAM.

Victim Cache

The victim cache is enabled by setting the parameter \texttt{C\_DCACHE\_VICTIMS} to 2, 4 or 8. This defines the number of cache lines that can be stored in the victim cache. Whenever a complete cache line is evicted from the cache, it is saved in the victim cache. By saving the most recent lines they can be fetched much faster, should the processor request them, thereby improving performance. If the victim cache is not used, all evicted cache lines must be read from memory again when they are needed.

With the AXI4 interface, \texttt{C\_DCACHE\_DATA\_WIDTH} determines the amount of data transferred from/to the victim cache each clock cycle, either 32 bits or an entire cache line.

Note that to be able to use the victim cache, write-back must be enabled and area optimization must not be enabled.
Data Cache Software Support

MSR Bit

The DCE bit in the MSR controls whether or not the cache is enabled. When disabling caches the user must ensure that all the prior writes within the cacheable range have been completed in external memory before reading back over M_AXI_DP or PLB. This can be done by writing to a semaphore immediately before turning off caches, and then in a loop poll until it has been written.

The contents of the cache are preserved when the cache is disabled.

WDC Instruction

The optional WDC instruction (C_ALLOW_DCACHE_WR=1) is used to invalidate or flush cache lines in the data cache from an application. For a detailed description, please refer to Chapter 5, MicroBlaze Instruction Set Architecture.

The WDC instruction can also be used together with parity protection to periodically invalidate entries the cache, to avoid accumulating errors.
Floating Point Unit (FPU)

Overview

The MicroBlaze floating point unit is based on the IEEE 754-1985 standard:

- Uses IEEE 754 single precision floating point format, including definitions for infinity, not-a-number (NaN), and zero
- Supports addition, subtraction, multiplication, division, comparison, conversion and square root instructions
- Implements round-to-nearest mode
- Generates sticky status bits for: underflow, overflow, divide-by-zero and invalid operation

For improved performance, the following non-standard simplifications are made:

- Denormalized operands are not supported. A hardware floating point operation on a denormalized number returns a quiet NaN and sets the sticky denormalized operand error bit in FSR; see "Floating Point Status Register (FSR)" on page 34
- A denormalized result is stored as a signed 0 with the underflow bit set in FSR. This method is commonly referred to as Flush-to-Zero (FTZ)
- An operation on a quiet NaN returns the fixed NaN: 0xFFC00000, rather than one of the NaN operands
- Overflow as a result of a floating point operation always returns signed \( \infty \)

Format

An IEEE 754 single precision floating point number is composed of the following three fields:

1. 1-bit sign
2. 8-bit biased exponent
3. 23-bit fraction (a.k.a. mantissa or significand)

The fields are stored in a 32 bit word as defined in Figure 2-24:

```
0 1 9 31
sign exponent fraction
```

Figure 2-24: IEEE 754 Single Precision Format

The value of a floating point number \( v \) in MicroBlaze has the following interpretation:

1. If \( exponent = 255 \) and \( fraction <> 0 \), then \( v = NaN \), regardless of the sign bit
2. If \( exponent = 255 \) and \( fraction = 0 \), then \( v = (-1)^{sign} \times \infty \)
3. If \( 0 < exponent < 255 \), then \( v = (-1)^{sign} \times 2^{(exponent-127)} \times (1.fraction) \)
4. If \( exponent = 0 \) and \( fraction <> 0 \), then \( v = (-1)^{sign} \times 2^{-126} \times (0.fraction) \)
5. If \( exponent = 0 \) and \( fraction = 0 \), then \( v = (-1)^{sign} \times 0 \)

---

1. Numbers that are so close to 0, that they cannot be represented with full precision, that is, any number \( n \) that falls in the following ranges: \((1.17549 \times 10^{-38} > n > 0)\), or \((0 > n > -1.17549 \times 10^{-38})\)
For practical purposes only 3 and 5 are useful, while the others all represent either an error or numbers that can no longer be represented with full precision in a 32 bit format.

**Rounding**

The MicroBlaze FPU only implements the default rounding mode, “Round-to-nearest”, specified in IEEE 754. By definition, the result of any floating point operation should return the nearest single precision value to the infinitely precise result. If the two nearest representable values are equally near, then the one with its least significant bit zero is returned.

**Operations**

All MicroBlaze FPU operations use the processors general purpose registers rather than a dedicated floating point register file, see “General Purpose Registers”.

**Arithmetic**

The FPU implements the following floating point operations:

- addition, fadd
- subtraction, fsub
- multiplication, fmul
- division, fdiv
- square root, fsqrt (available if \( \text{C\_USE\_FPU} = 2, \text{EXTENDED} \))

**Comparison**

The FPU implements the following floating point comparisons:

- compare less-than, fcmp.lt
- compare equal, fcmp.eq
- compare less-or-equal, fcmp.le
- compare greater-than, fcmp.gt
- compare not-equal, fcmp.ne
- compare greater-or-equal, fcmp.ge
- compare unordered, fcmp.un (used for NaN)

**Conversion**

The FPU implements the following conversions (available if \( \text{C\_USE\_FPU} = 2, \text{EXTENDED} \)):

- convert from signed integer to floating point, flt
- convert from floating point to signed integer, fint

**Exceptions**

The floating point unit uses the regular hardware exception mechanism in MicroBlaze. When enabled, exceptions are thrown for all the IEEE standard conditions: underflow, overflow, divide-by-zero, and illegal operation, as well as for the MicroBlaze specific exception: denormalized operand error.

A floating point exception inhibits the write to the destination register (Rd). This allows a floating point exception handler to operate on the uncorrupted register file.
Software Support

The EDK compiler system, based on GCC, provides support for the Floating Point Unit compliant with the MicroBlaze API. Compiler flags are automatically added to the GCC command line based on the type of FPU present in the system, when using XPS or SDK.

All double-precision operations are emulated in software. Be aware that the xil_printf() function does not support floating-point output. The standard C library printf() and related functions do support floating-point output, but will increase the program code size.

Libraries and Binary Compatibility

The EDK compiler system only includes software floating point C runtime libraries. To take advantage of the hardware FPU, the libraries must be recompiled with the appropriate compiler switches.

For all cases where separate compilation is used, it is very important that you ensure the consistency of FPU compiler flags throughout the build.

Operator Latencies

The latencies of the various operations supported by the FPU are listed in Chapter 5, “MicroBlaze Instruction Set Architecture.” The FPU instructions are not pipelined, so only one operation can be ongoing at any time.

C Language Programming

To gain maximum benefit from the FPU without low-level assembly-language programming, it is important to consider how the C compiler will interpret your source code. Very often the same algorithm can be expressed in many different ways, and some are more efficient than others.

Immediate Constants

Floating-point constants in C are double-precision by default. When using a single-precision FPU, careless coding may result in double-precision software emulation routines being used instead of the native single-precision instructions. To avoid this, explicitly specify (by cast or suffix) that immediate constants in your arithmetic expressions are single-precision values.

For example:

```c
float x = 0.0;
...
x += (float)1.0; /* float addition */
x += 1.0F; /* alternative to above */
x += 1.0; /* warning - uses double addition! */
```

Note that the GNU C compiler can be instructed to treat all floating-point constants as single-precision (contrary to the ANSI C standard) by supplying the compiler flag -fsingle-precision-constants.

Avoid unnecessary casting

While conversions between floating-point and integer formats are supported in hardware by the FPU, when C_USE_FPU is set to 2 (Extended), it is still best to avoid them when possible.

The following “bad” example calculates the sum of squares of the integers from 1 to 10 using floating-point representation:
float sum, t;
int i;
sum = 0.0f;
for (i = 1; i <= 10; i++) {
    t = (float)i;
    sum += t * t;
}

The above code requires a cast from an integer to a float on each loop iteration. This can be rewritten as:

float sum, t;
int i;
t = sum = 0.0f;
for (i = 1; i <= 10; i++) {
    t += 1.0f;
    sum += t * t;
}

Note that the compiler is not at liberty to perform this optimization in general, as the two code fragments above may give different results in some cases (for example, very large t).

Square root runtime library function

The standard C runtime math library functions operate using double-precision arithmetic. When using a single-precision FPU, calls to the square root functions (sqrt()) result in inefficient emulation routines being used instead of FPU instructions:

```
#include <math.h>
...
float x=-1.0F;
...
x = sqrt(x); /* uses double precision */
```

Here the math.h header is included to avoid a warning message from the compiler.

When used with single-precision data types, the result is a cast to double, a runtime library call is made (which does not use the FPU) and then a truncation back to float is performed.

The solution is to use the non-ANSI function sqrtf() instead, which operates using single precision and can be carried out using the FPU. For example:

```
#include <math.h>
...
float x=-1.0F;
...
x = sqrtf(x); /* uses single precision */
```

Note that when compiling this code, the compiler flag `-fno-math-errno` (in addition to `-mhard-float` and `-mxl-float-sqrt`) must be used, to ensure that the compiler does not generate unnecessary code to handle error conditions by updating the errno variable.
Stream Link Interfaces

MicroBlaze can be configured with up to 16 Fast Simplex Link (FSL) or AXI4-Stream interfaces, each consisting of one input and one output port. The channels are dedicated uni-directional point-to-point data streaming interfaces. The parameter C_STREAM_INTERCONNECT is used to select FSL or AXI4.

For detailed information on the FSL interface, please refer to the Fast Simplex Link (FSL) Bus data sheet, DS449, in the Xilinx EDK IP Documentation. For detailed information on the AXI4-Stream interface, please refer to the AMBA® 4 AXI4-Stream Protocol Specification, Version 1.0 document.

The interfaces on MicroBlaze are 32 bits wide. A separate bit indicates whether the sent/received word is of control or data type. The get instruction in the MicroBlaze ISA is used to transfer information from a port to a general purpose register. The put instruction is used to transfer data in the opposite direction. Both instructions come in 4 flavors: blocking data, non-blocking data, blocking control, and non-blocking control. For a detailed description of the get and put instructions, please refer to Chapter 5, MicroBlaze Instruction Set Architecture.

Hardware Acceleration

Each link provides a low latency dedicated interface to the processor pipeline. Thus they are ideal for extending the processors execution unit with custom hardware accelerators. A simple example is illustrated in Figure 2-25. The code uses RFSLx to indicate the used link, independent of whether FSL or AXI4-Stream is used.

Example code:

```
// Configure fx
cpuR, RFSLx
// Store operands
put Ra, RFSLx // op 1
put Rb, RFSLx // op 2
// Load result
get Rt, RFSLx
```

![Figure 2-25: Stream Link Used with HW Accelerated Function $f_x$](image)

This method is similar to extending the ISA with custom instructions, but has the benefit of not making the overall speed of the processor pipeline dependent on the custom function. Also, there are no additional requirements on the software tool chain associated with this type of functional extension.
Debug and Trace

Debug Overview

MicroBlaze features a debug interface to support JTAG based software debugging tools (commonly known as BDM or Background Debug Mode debuggers) like the Xilinx Microprocessor Debug (XMD) tool. The debug interface is designed to be connected to the Xilinx Microprocessor Debug Module (MDM) core, which interfaces with the JTAG port of Xilinx FPGAs. Multiple MicroBlaze instances can be interfaced with a single MDM to enable multiprocessor debugging. The debugging features include:

- Configurable number of hardware breakpoints and watchpoints and unlimited software breakpoints
- External processor control enables debug tools to stop, reset, and single step MicroBlaze
- Read from and write to: memory, general purpose registers, and special purpose register, except EAR, EDR, ESR, BTR and PVR0 - PVR11, which can only be read
- Support for multiple processors

Whenever MicroBlaze is halted the `MB_Halted` output signal is set to 1, for example after a breakpoint or watchpoint is hit, after a stop XMD command, or when the `DBG_STOP` input is set. The output is cleared when MicroBlaze execution is resumed by an XMD command.

When the `DBG_STOP` input is set to 1, MicroBlaze will halt after a few instructions. XMD will detect that MicroBlaze has halted, and indicate where the halt occurred. The signal can be used to halt MicroBlaze at any external event, for example when a ChipScope™ logic analyzer is triggered.

The `MB_Halted` signal may be used to trigger a ChipScope logic analyzer, or halt other MicroBlaze cores in a multiprocessor system by connecting the signal to their `DBG_STOP` inputs.

Trace Overview

The MicroBlaze trace interface exports a number of internal state signals for performance monitoring and analysis. Xilinx recommends that users only use the trace interface through Xilinx developed analysis cores. This interface is not guaranteed to be backward compatible in future releases of MicroBlaze.
Fault Tolerance

The fault tolerance features included in MicroBlaze, enabled with C_FAULT_TOLERANT, provide Error Detection for internal block RAMs, and support for Error Detection and Correction (ECC) in LMB block RAMs. When fault tolerance is enabled, all soft errors in block RAMs are detected and corrected, which significantly reduces overall failure intensity.

In addition to protecting block RAM, the FPGA configuration memory also generally needs to be protected. A detailed explanation of this topic, and further references, can be found in the document SEU Strategies for Virtex-5 Devices (XAPP864).

Configuration

Using MicroBlaze Configuration

Fault tolerance can be enabled in the MicroBlaze configuration dialog, on the General page.

After enabling fault tolerance in MicroBlaze, ECC is automatically enabled in the connected LMB BRAM Interface Controllers by the tools, when the system is generated. This means that nothing else needs to be configured to enable fault tolerance and minimal ECC support.

It is possible (albeit not recommended) to manually override ECC support, leaving the LMB BRAM unprotected, by disabling C_ECC in the configuration dialogs of all connected LMB BRAM Interface Controllers. In this case, the internal MicroBlaze block RAM protection is still enabled, since fault tolerance is enabled.

Using LMB BRAM Interface Controller Configuration

As an alternative to the method described above, it is also possible to enable ECC in the configuration dialogs of all connected LMB BRAM Interface Controllers. In this case, fault tolerance is automatically enabled in MicroBlaze by the tools, when the system is generated. This means that nothing else needs to be configured to enable ECC support and MicroBlaze fault tolerance.

ECC must either be enabled or disabled in all Controllers, which is enforced by a DRC.

It is possible to manually override fault tolerance support in MicroBlaze, by explicitly disabling C_FAULT_TOLERANT in the MicroBlaze configuration dialog. This is not recommended, unless no block RAM is used in MicroBlaze, and there is no need to handle bus exceptions from uncorrectable ECC errors.

Features

An overview of all MicroBlaze fault tolerance features is given here. Further details on each feature can be found in the following sections:

- “Instruction Cache Operation”
- “Data Cache Operation”
- “UTLB Management”
- “Branch Target Cache”
- “Instruction Bus Exception”
- “Data Bus Exception”
- “Exception Causes”
Chapter 2: MicroBlaze Architecture

The LMB BRAM Interface Controller v3.00.a or later provides the LMB ECC implementation. For details, including performance and resource utilization, see the IP Processor LMB BRAM Interface Controller (DS452) data-sheet, in the Xilinx EDK IP Documentation.

Instruction and Data Cache Protection

To protect the block RAM in the Instruction and Data Cache, parity is used. When a parity error is detected, the corresponding cache line is invalidated. This forces the cache to reload the correct value from external memory. Parity is checked whenever a cache hit occurs.

Note that this scheme only works for write-through, and thus write-back data cache is not available when fault tolerance is enabled. This is enforced by a DRC.

When new values are written to a block RAM in the cache, parity is also calculated and written. One parity bit is used for the tag, one parity bit for the instruction cache data, and one parity bit for each word in a data cache line.

In many cases, enabling fault tolerance does not increase the required number of cache block RAMs, since spare bits can be used for the parity. Any increase in resource utilization, in particular number of block RAMs, can easily be seen in the MicroBlaze configuration dialog, when enabling fault tolerance.

Memory Management Unit Protection

To protect the block RAM in the MMU Unified Translation Look-Aside Buffer (UTLB), parity is used. When a parity error is detected during an address translation, a TLB miss exception occurs, forcing software to reload the entry.

When a new TLB entry is written using the TLBHI and TLBLO registers, parity is calculated. One parity bit is used for each entry.

Parity is also checked when a UTLB entry is read using the TLBHI and TLBLO registers. When a parity error is detected in this case, the entry is marked invalid by clearing the valid bit.

Enabling fault tolerance does not increase the MMU block RAM size, since a spare bit is available for the parity.

Branch Target Cache Protection

To protect block RAM in the Branch Target Cache, parity is used. When a parity error is detected when looking up a branch target address, the address is ignored, forcing a normal branch.

When a new branch address is written to the Branch Target Cache, parity is calculated. One parity bit is used for each address.

Enabling fault tolerance does not increase the Branch Target Cache block RAM size, since a spare bit is available for the parity.

Exception Handling

With fault tolerance enabled, if an error occurs in LMB block RAM, the LMB BRAM Interface Controller generates error signals on the LMB interface.

If exceptions are enabled in MicroBlaze, by setting the EE bit in the Machine Status Register, the uncorrectable error signal either generates an instruction bus exception or a data bus exception, depending on the affected interface.

Should a bus exception occur when an exception is in progress, MicroBlaze is halted, and the external error signal MB_Error is set. This behavior ensures that it is impossible to execute an instruction corrupted by an uncorrectable error.
Software Support

Scrubbing

To ensure that bit errors are not accumulated in block RAMs, they must be periodically scrubbed. The standalone BSP provides the function `microblaze_scrub()` to perform scrubbing of the entire LMB block RAM and all MicroBlaze internal block RAMs used in a particular configuration. This function is intended to be called periodically from a timer interrupt routine.

The following example code illustrates how this can be done.

```c
#include "xparameters.h"
#include "xtmrctr.h"
#include "xintc.h"
#include "mb_interface.h"

#define SCRUB_PERIOD ...

XIntc InterruptController; /* The Interrupt Controller instance */
XTmrCtr TimerCounterInst; /* The Timer Counter instance */

void MicroBlazeScrubHandler(void *CallBackRef, u8 TmrCtrNumber)
{
    /* Perform other timer interrupt processing here */
    microblaze_scrub();
}

int main (void)
{
    int Status;

    /*
     * Initialize the timer counter so that it's ready to use,
     * specify the device ID that is generated in xparameters.h
     */
    Status = XTmrCtr_Initialize(&TimerCounterInst, TMRCTR_DEVICE_ID);
    if (Status != XST_SUCCESS) {
        return XST_FAILURE;
    }

    /*
     * Connect the timer counter to the interrupt subsystem such that
     * interrupts can occur.
     */
    Status = XIntc_Initialize(&InterruptController, INTC_DEVICE_ID);
    if (Status != XST_SUCCESS) {
        return XST_FAILURE;
    }

    /*
     * Connect a device driver handler that will be called when an
     * interrupt for the device occurs, the device driver handler performs
     * the specific interrupt processing for the device
     */
    Status = XIntc_Connect(&InterruptController, TMRCTR_DEVICE_ID,
                            (XInterruptHandler)XTmrCtr_InterruptHandler,
                            (void *) &TimerCounterInst);
    if (Status != XST_SUCCESS) {
        return XST_FAILURE;
    }
}
```
return XST_FAILURE;
}

/*
* Start the interrupt controller such that interrupts are enabled for
* all devices that cause interrupts, specifying real mode so that the
* timer counter can cause interrupts thru the interrupt controller.
*/
Status = XIntc_Start(&InterruptController, XIN_REAL_MODE);
if (Status != XST_SUCCESS) {
    return XST_FAILURE;
}

/*/n
* Setup the handler for the timer counter that will be called from the
* interrupt context when the timer expires, specify a pointer to the
* timer counter driver instance as the callback reference so the
* handler is able to access the instance data
*/
XTmrCtr_SetHandler(&TimerCounterInst, MicroBlazeScrubHandler,
                   &TimerCounterInst);

/*
* Enable the interrupt of the timer counter so interrupts will occur
* and use auto reload mode such that the timer counter will reload
* itself automatically and continue repeatedly, without this option
* it would expire once only
*/
XTmrCtr_SetOptions(&TimerCounterInst, TIMER_CNTR_0,
                   XTC_INT_MODE_OPTION | XTC_AUTO_RELOAD_OPTION);

/*/n
* Set a reset value for the timer counter such that it will expire
* earlier than letting it roll over from 0, the reset value is loaded
* into the timer counter when it is started
*/
XTmrCtr_SetResetValue(TmrCtrInstancePtr, TmrCtrNumber, SCRUB_PERIOD);

/*/n
* Start the timer counter such that it's incrementing by default,
* then wait for it to timeout a number of times
*/
XTmrCtr_Start(&TimerCounterInst, TIMER_CNTR_0);
...

See the section “Scrubbing” below for further details on how scrubbing is implemented, including
how to calculate the scrubbing rate.

BRAM Driver

The standalone BSP BRAM driver is used to access the ECC registers in the LMB BRAM Interface
Controller, and also provides a comprehensive self test.

By implementing the SDK Xilinx C Project "Peripheral Tests", a self-test example including the
BRAM self test for each LMB BRAM Interface Controller in the system is generated. Depending on
the ECC features enabled in the LMB BRAM Interface Controller, this code will perform all
possible tests of the ECC function.
The self-test example can be found in the standalone BSP BRAM driver source code, typically in the subdirectory `microblaze_0/libsrc/bram_v3_00_a/src/xbram_selftest.c`.

## Scrubbing

### Scrubbing Methods

Scrubbing is performed using specific methods for the different block RAMs:

- Instruction and data caches: All lines in the caches are cyclically invalidated using the WIC and WDC instructions respectively. This forces the cache to reload the cache line from external memory.
- Memory Management Unit UTLB: All entries in the UTLB are cyclically invalidated by writing the TLBHI register with the valid bit cleared.
- Branch Target Cache: The entire BTC is invalided by doing a synchronizing branch, BRI 4.
- LMB block RAM: All addresses in the memory are cyclically read and written, thus correcting any single bit errors on each address.

It is also possible to add interrupts for correctable errors from the LMB BRAM Interface Controllers, and immediately scrub this address in the interrupt handler, although in most cases it only improves reliability slightly.

The failing address can be determined by reading the Correctable Error First Failing Address Register in each of the LMB BRAM Interface Controllers. To be able to generate an interrupt, `C_ECC_STATUS_REGISTERS` must be set to 1 in the connected LMB BRAM Interface Controllers, and to read the failing address `C_CE_FAILING_REGISTERS` must be set to 1.

### Calculating Scrubbing Rate

The scrubbing rate depends on failure intensity and desired reliability.

The approximate equation to determine the LMB memory scrubbing rate is in our case given by

\[
P_W \approx 760 \left( \frac{BER}{SR} \right)^2
\]

where \(P_W\) is the probability of an uncorrectable error in a memory word, \(BER\) is the soft error rate for a single memory bit, and \(SR\) is the Scrubbing Rate.

The soft error rates affecting block RAM for each product family can be found in the [Device Reliability Report (UG116)](https://www.xilinx.com).

### Use Cases

Several of common use cases are described here. These use cases are derived from the [IP Processor LMB BRAM Interface Controller (DS452)](https://www.xilinx.com) data-sheet.

**Minimal**

This system is obtained when enabling fault tolerance in MicroBlaze, without doing any other configuration.

The system is suitable when area constraints are high, and there is no need for testing of the ECC function, or analysis of error frequency and location. No ECC registers are implemented. Single bit errors are corrected by the ECC logic before being passed to MicroBlaze. Uncorrectable errors set an error signal, which generates an exception in MicroBlaze.
Small
This system should be used when it is necessary to monitor error frequency, but there is no need for testing of the ECC function. It is a minimal system with Correctable Error Counter Register added to monitor single bit error rates. If the error rate is too high, the scrubbing rate should be increased to minimize the risk of a single bit error becoming an uncorrectable double bit error. Parameters set are \( C_{ECC} = 1 \) and \( C_{CE\_COUNTER\_WIDTH} = 10 \).

Typical
This system represents a typical use case, where it is required to monitor error frequency, as well as generating an interrupt to immediately correct a single bit error through software. It does not provide support for testing of the ECC function. It is a small system with Correctable Error First Failing registers and Status register added. A single bit error will latch the address for the access into the Correctable Error First Failing Address Register and set the CE_STATUS bit in the ECC Status Register. An interrupt will be generated triggering MicroBlaze to read the failing address and then perform a read followed by a write on the failing address. This will remove the single bit error from the BRAM, thus reducing the risk of the single bit error becoming an uncorrectable double bit error. Parameters set are \( C_{ECC} = 1, \ C_{CE\_COUNTER\_WIDTH} = 10, \ C_{ECC\_STATUS\_REGISTER} = 1 \) and \( C_{CE\_FAILING\_REGISTERS} = 1 \).

Full
This system uses all of the features provided by the LMB BRAM Interface Controller, to enable full error injection capability, as well as error monitoring and interrupt generation. It is a typical system with Uncorrectable Error First Failing registers and Fault Injection registers added. All features are switched on for full control of ECC functionality for system debug or systems with high fault tolerance requirements. Parameters set are \( C_{ECC} = 1, C_{CE\_COUNTER\_WIDTH} = 10, \ C_{ECC\_STATUS\_REGISTER} = 1 \) and \( C_{CE\_FAILING\_REGISTERS} = 1, \ C_{UE\_FAILING\_REGISTERS} = 1 \) and \( C_{FAULT\_INJECT} = 1 \).
Lockstep Operation

Lockstep Operation is a configuration where two or more identical MicroBlaze cores execute the same program. By comparing the outputs of the cores, any tampering attempts, transient faults or permanent hardware faults can be detected.

System Configuration

The parameter `C_LOCKSTEP_SLAVE` is set to one on all slave MicroBlaze cores in the system, except the master (or primary) core. The master core drives all the output signals, and handles the debug functionality. The port `Lockstep_Master_Out` on the master is connected to the port `Lockstep_Slave_In` on the slaves, in order to handle debugging.

The slave cores should not drive any output signals, only receive input signals. This must be ensured by only connecting signals to the input ports of the slaves. For buses this means that each individual input port must be explicitly connected.

The port `Lockstep_Out` on the master and slave cores provide all output signals for comparison. Unless an error occurs, individual signals from each of the cores are identical every clock cycle.

To ensure that lockstep operation works properly, all input signals to the cores must be synchronous. Input signals that may require external synchronization are `Interrupt`, `Reset`, `Mb_Reset`, `Ext_Brk`, and `Ext_Nm_Brk`.

Use Cases

Two common use cases are described here. In addition, lockstep operation provides the basis for implementing triple modular redundancy on MicroBlaze core level.

Tamper Protection

This application represents a high assurance use case, where it is required that the system is tamper-proof. A typically example is a cryptographic application.

The approach involves having two redundant MicroBlaze processors with dedicated local memory and redundant comparators, each in a protected area. The outputs from each processor feed two comparators and each processor receive copies of every input signal.

The redundant MicroBlaze processors are functionally identical and completely independent of each other, without any connecting signals. The only exception is debug logic and associated signals, since it is assumed that debugging is disabled before any productization and certification of the system.

The outputs from the master MicroBlaze core drive the peripherals in the system. All data leaving the protected area pass through inhibitors. Each inhibitor is controlled from its associated comparator.

Each protected area of the design must be implemented in its own partition, using a hierarchical Single Chip Cryptography (SCC) flow. A detailed explanation of this flow, and further references, can be found in the document `Hierarchical Design Methodology Guide (UG748)`.

For Spartan-6 target architectures, the parameter `C_AVOID_PRIMITIVES` must be set to 3 (`BOTH`) in order to follow the SCC flow.

A block diagram of the system is shown in Figure 2-26.
Chapter 2: MicroBlaze Architecture

Error Detection

The error detection use case requires that all transient and permanent faults are detected. This is essential in fail safe and fault tolerant applications, where redundancy is utilized to improve system availability.

In this system two redundant MicroBlaze processors run in lockstep. A comparator is used to signal an error when a mis-match is detected on the outputs of the two processors. Any error immediately causes both processors to halt, preventing further error propagation.

The redundant MicroBlaze processors are functionally identical, except for debug logic and associated signals. The outputs from the master MicroBlaze core drive the peripherals in the system. The slave MicroBlaze core only has inputs connected; all outputs are left open.

The system contains the basic building block for designing a complete fault tolerant application, where one or more additional blocks must be added to provide redundancy.

This use case is illustrated in Figure 2-27.
Figure 2-27: Lockstep Error Detection Application
Chapter 3

MicroBlaze Signal Interface Description

This chapter describes the types of signal interfaces that can be used to connect MicroBlaze™.

Overview

The MicroBlaze core is organized as a Harvard architecture with separate bus interface units for data and instruction accesses. The following four memory interfaces are supported: Local Memory Bus (LMB), the AMBA® AXI4 interface (AXI4), the IBM Processor Local Bus (PLB), and Xilinx CacheLink (XCL). The LMB provides single-cycle access to on-chip dual-port block RAM. The AXI4 and PLB interfaces provide a connection to both on-chip and off-chip peripherals and memory. The CacheLink interface is intended for use with specialized external memory controllers. MicroBlaze also supports up to 16 Fast Simplex Link (FSL) or AXI4-Stream interface ports, each with one master and one slave interface.

Features

MicroBlaze can be configured with the following bus interfaces:

- The AMBA AXI4 Interface (see ARM® AMBA® AXI Protocol Specification, Version 2.0, ARM IHI 0022C), both for peripheral interfaces and cache interfaces.
- A 32-bit version of the PLB V4.6 interface (see IBM's 128-Bit Processor Local Bus Architectural Specifications, Version 4.6).
- LMB provides simple synchronous protocol for efficient block RAM transfers
- FSL or AXI4-Stream provides a fast non-arbitrated streaming communication mechanism
- XCL provides a fast slave-side arbitrated streaming interface between caches and external memory controllers
- Debug interface for use with the Microprocessor Debug Module (MDM) core
- Trace interface for performance analysis
MicroBlaze I/O Overview

The core interfaces shown in Figure 3-1 and the following Table 3-1 are defined as follows:

- **M_AXI_DP**: Peripheral Data Interface, AXI4-Lite or AXI4 interface
- **DPLB**: Data interface, Processor Local Bus
- **DLMB**: Data interface, Local Memory Bus (BRAM only)
- **M_AXI_IP**: Peripheral Instruction interface, AXI4-Lite interface
- **IPLB**: Instruction interface, Processor Local Bus
- **ILMB**: Instruction interface, Local Memory Bus (BRAM only)
- **M0_AXIS..M15_AXIS**: AXI4-Stream interface master direct connection interfaces
- **S0_AXIS..S15_AXIS**: AXI4-Stream interface slave direct connection interfaces
- **MFSL 0..15**: FSL master interfaces
- **DWFSL 0..15**: FSL master direct connection interfaces
- **SFSL0..15**: FSL slave interfaces
- **DRFSL0..15**: FSL slave direct connection interfaces
- **DXCL**: Data side Xilinx CacheLink interface (FSL master/slave pair)
- **M_AXI_DC**: Data side cache AXI4 interface
- **IXCL**: Instruction side Xilinx CacheLink interface (FSL master/slave pair)
- **M_AXI_IC**: Instruction side cache AXI4 interface
- **Core**: Miscellaneous signals for: clock, reset, debug, and trace

![MicroBlaze Core Block Diagram](image-url)
MicroBlaze Processor Reference Guide

Table 3-1: Summary of MicroBlaze Core I/O

<table>
<thead>
<tr>
<th>Signal</th>
<th>Interface</th>
<th>I/O</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>M_AXI_DP_AWID</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Write address ID</td>
</tr>
<tr>
<td>M_AXI_DP_AWADDR</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Write address</td>
</tr>
<tr>
<td>M_AXI_DP_AWLEN</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Burst length</td>
</tr>
<tr>
<td>M_AXI_DP_AWSIZE</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Burst size</td>
</tr>
<tr>
<td>M_AXI_DP_AWBURST</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Burst type</td>
</tr>
<tr>
<td>M_AXI_DP_AWLOCK</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Lock type</td>
</tr>
<tr>
<td>M_AXI_DP_AWCACHE</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Cache type</td>
</tr>
<tr>
<td>M_AXI_DP_AWPROT</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Protection type</td>
</tr>
<tr>
<td>M_AXI_DP_AWQOS</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Quality of Service</td>
</tr>
<tr>
<td>M_AXI_DP_AWVALID</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Write address valid</td>
</tr>
<tr>
<td>M_AXI_DP_AWREADY</td>
<td>M_AXI_DP</td>
<td>I</td>
<td>Slave Write address ready</td>
</tr>
<tr>
<td>M_AXI_DP_WDATA</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Write data</td>
</tr>
<tr>
<td>M_AXI_DP_WSTRB</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Write strobes</td>
</tr>
<tr>
<td>M_AXI_DP_WLAST</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Write last</td>
</tr>
<tr>
<td>M_AXI_DP_WVALID</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Write valid</td>
</tr>
<tr>
<td>M_AXI_DP_WREADY</td>
<td>M_AXI_DP</td>
<td>I</td>
<td>Slave Write ready</td>
</tr>
<tr>
<td>M_AXI_DP_BID</td>
<td>M_AXI_DP</td>
<td>I</td>
<td>Slave Response ID</td>
</tr>
<tr>
<td>M_AXI_DP_BRESP</td>
<td>M_AXI_DP</td>
<td>I</td>
<td>Slave Write response</td>
</tr>
<tr>
<td>M_AXI_DP_BVALID</td>
<td>M_AXI_DP</td>
<td>I</td>
<td>Slave Write response valid</td>
</tr>
<tr>
<td>M_AXI_DP_BREADY</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Response ready</td>
</tr>
<tr>
<td>M_AXI_DP_ARID</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Read address ID</td>
</tr>
<tr>
<td>M_AXI_DP_ARADDR</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Read address</td>
</tr>
<tr>
<td>M_AXI_DP_ARLEN</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Burst length</td>
</tr>
<tr>
<td>M_AXI_DP_ARSIZE</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Burst size</td>
</tr>
<tr>
<td>M_AXI_DP_ARBURST</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Burst type</td>
</tr>
<tr>
<td>M_AXI_DP_ARLOCK</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Lock type</td>
</tr>
<tr>
<td>M_AXI_DP_ARCACHE</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Cache type</td>
</tr>
<tr>
<td>M_AXI_DP_ARPROT</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Protection type</td>
</tr>
<tr>
<td>M_AXI_DP_ARQOS</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Quality of Service</td>
</tr>
<tr>
<td>M_AXI_DP_ARVALID</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Read address valid</td>
</tr>
<tr>
<td>M_AXI_DP_ARREADY</td>
<td>M_AXI_DP</td>
<td>I</td>
<td>Slave Read address ready</td>
</tr>
</tbody>
</table>
### Table 3-1: Summary of MicroBlaze Core I/O (Continued)

<table>
<thead>
<tr>
<th>Signal</th>
<th>Interface</th>
<th>I/O</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>M_AXI_DP_RID</td>
<td>M_AXI_DP</td>
<td>I</td>
<td>Slave Read ID tag</td>
</tr>
<tr>
<td>M_AXI_DP_RDATA</td>
<td>M_AXI_DP</td>
<td>I</td>
<td>Slave Read data</td>
</tr>
<tr>
<td>M_AXI_DP_RRESP</td>
<td>M_AXI_DP</td>
<td>I</td>
<td>Slave Read response</td>
</tr>
<tr>
<td>M_AXI_DP_RLAST</td>
<td>M_AXI_DP</td>
<td>I</td>
<td>Slave Read last</td>
</tr>
<tr>
<td>M_AXI_DP_RVALID</td>
<td>M_AXI_DP</td>
<td>I</td>
<td>Slave Read valid</td>
</tr>
<tr>
<td>M_AXI_DP_RREADY</td>
<td>M_AXI_DP</td>
<td>O</td>
<td>Master Read ready</td>
</tr>
<tr>
<td>M_AXI_IP_AWID</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Write address ID</td>
</tr>
<tr>
<td>M_AXI_IP_AWADDR</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Write address</td>
</tr>
<tr>
<td>M_AXI_IP_AWLEN</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Burst length</td>
</tr>
<tr>
<td>M_AXI_IP_AWSIZE</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Burst size</td>
</tr>
<tr>
<td>M_AXI_IP_AWBURST</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Burst type</td>
</tr>
<tr>
<td>M_AXI_IP_AWLOCK</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Lock type</td>
</tr>
<tr>
<td>M_AXI_IP_AWCACHE</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Cache type</td>
</tr>
<tr>
<td>M_AXI_IP_AWPROT</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Protection type</td>
</tr>
<tr>
<td>M_AXI_IP_AWQOS</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Quality of Service</td>
</tr>
<tr>
<td>M_AXI_IP_AWVALID</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Write address valid</td>
</tr>
<tr>
<td>M_AXI_IP_AWREADY</td>
<td>M_AXI_IP</td>
<td>I</td>
<td>Slave Write address ready</td>
</tr>
<tr>
<td>M_AXI_IP_WDATA</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Write data</td>
</tr>
<tr>
<td>M_AXI_IP_WSTRB</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Write strobes</td>
</tr>
<tr>
<td>M_AXI_IP_WLAST</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Write last</td>
</tr>
<tr>
<td>M_AXI_IP_WVALID</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Write valid</td>
</tr>
<tr>
<td>M_AXI_IP_WREADY</td>
<td>M_AXI_IP</td>
<td>I</td>
<td>Slave Write ready</td>
</tr>
<tr>
<td>M_AXI_IP_BID</td>
<td>M_AXI_IP</td>
<td>I</td>
<td>Slave Response ID</td>
</tr>
<tr>
<td>M_AXI_IP_BRESP</td>
<td>M_AXI_IP</td>
<td>I</td>
<td>Slave Write response</td>
</tr>
<tr>
<td>M_AXI_IP_BVALID</td>
<td>M_AXI_IP</td>
<td>I</td>
<td>Slave Write response valid</td>
</tr>
<tr>
<td>M_AXI_IP_BREADY</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Response ready</td>
</tr>
<tr>
<td>M_AXI_IP_ARID</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Read address ID</td>
</tr>
<tr>
<td>M_AXI_IP_ARADDR</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Read address</td>
</tr>
<tr>
<td>M_AXI_IP_ARLEN</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Burst length</td>
</tr>
<tr>
<td>M_AXI_IP_ARSIZE</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Burst size</td>
</tr>
<tr>
<td>M_AXI_IP_ARBURST</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Burst type</td>
</tr>
<tr>
<td>M_AXI_IP_ARLOCK</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Lock type</td>
</tr>
</tbody>
</table>
Table 3-1:  Summary of MicroBlaze Core I/O (Continued)

<table>
<thead>
<tr>
<th>Signal</th>
<th>Interface</th>
<th>I/O</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>M_AXI_IP_ARCACHE</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Cache type</td>
</tr>
<tr>
<td>M_AXI_IP_ARPROT</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Protection type</td>
</tr>
<tr>
<td>M_AXI_IP_ARQOS</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Quality of Service</td>
</tr>
<tr>
<td>M_AXI_IP_ARVALID</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Read address valid</td>
</tr>
<tr>
<td>M_AXI_IP_AREADY</td>
<td>M_AXI_IP</td>
<td>I</td>
<td>Slave Read address ready</td>
</tr>
<tr>
<td>M_AXI_IP_RID</td>
<td>M_AXI_IP</td>
<td>I</td>
<td>Slave Read ID tag</td>
</tr>
<tr>
<td>M_AXI_IP_RDATA</td>
<td>M_AXI_IP</td>
<td>I</td>
<td>Slave Read data</td>
</tr>
<tr>
<td>M_AXI_IP_RRESP</td>
<td>M_AXI_IP</td>
<td>I</td>
<td>Slave Read response</td>
</tr>
<tr>
<td>M_AXI_IP_RLAST</td>
<td>M_AXI_IP</td>
<td>I</td>
<td>Slave Read last</td>
</tr>
<tr>
<td>M_AXI_IP_RVALID</td>
<td>M_AXI_IP</td>
<td>I</td>
<td>Slave Read valid</td>
</tr>
<tr>
<td>M_AXI_IP_RREADY</td>
<td>M_AXI_IP</td>
<td>O</td>
<td>Master Read ready</td>
</tr>
<tr>
<td>M_AXI_DC_AWADDR</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Write address</td>
</tr>
<tr>
<td>M_AXI_DC_AWLEN</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Burst length</td>
</tr>
<tr>
<td>M_AXI_DC_AWSIZE</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Burst size</td>
</tr>
<tr>
<td>M_AXI_DC_AWBURST</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Burst type</td>
</tr>
<tr>
<td>M_AXI_DC_AWLOCK</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Lock type</td>
</tr>
<tr>
<td>M_AXI_DC_AWCACHE</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Cache type</td>
</tr>
<tr>
<td>M_AXI_DC_AWPROT</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Protection type</td>
</tr>
<tr>
<td>M_AXI_DC_AWQOS</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Quality of Service</td>
</tr>
<tr>
<td>M_AXI_DC_AWVALID</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Write address valid</td>
</tr>
<tr>
<td>M_AXI_DC_AWREADY</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Write address ready</td>
</tr>
<tr>
<td>M_AXI_DC_AWUSER</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Write address user signals</td>
</tr>
<tr>
<td>M_AXI_DC_WDATA</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Write data</td>
</tr>
<tr>
<td>M_AXI_DC_WSTRB</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Write strobes</td>
</tr>
<tr>
<td>M_AXI_DC_WLAST</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Write last</td>
</tr>
<tr>
<td>M_AXI_DC_WVALID</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Write valid</td>
</tr>
<tr>
<td>M_AXI_DC_WREADY</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Write ready</td>
</tr>
<tr>
<td>M_AXI_DC_WUSER</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Write user signals</td>
</tr>
<tr>
<td>M_AXI_DC_BRESP</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Write response</td>
</tr>
<tr>
<td>M_AXI_DC_BID</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Response ID</td>
</tr>
<tr>
<td>M_AXI_DC_BVALID</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Write response valid</td>
</tr>
<tr>
<td>M_AXI_DC_BREADY</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Response ready</td>
</tr>
</tbody>
</table>
Table 3-1: Summary of MicroBlaze Core I/O (Continued)

<table>
<thead>
<tr>
<th>Signal</th>
<th>Interface</th>
<th>I/O</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>M_AXI_DC_BUSER</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Write response user signals</td>
</tr>
<tr>
<td>M_AXI_DC_ARID</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Read address ID</td>
</tr>
<tr>
<td>M_AXI_DC_AADDR</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Read address</td>
</tr>
<tr>
<td>M_AXI_DC_ARLEN</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Burst length</td>
</tr>
<tr>
<td>M_AXI_DC_ARSIZE</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Burst size</td>
</tr>
<tr>
<td>M_AXI_DC_ARBURST</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Burst type</td>
</tr>
<tr>
<td>M_AXI_DC_ARLOCK</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Lock type</td>
</tr>
<tr>
<td>M_AXI_DC_ARCACHE</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Cache type</td>
</tr>
<tr>
<td>M_AXI_DC_ARPROT</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Protection type</td>
</tr>
<tr>
<td>M_AXI_DC_ARQOS</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Quality of Service</td>
</tr>
<tr>
<td>M_AXI_DC_ARVALID</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Read address valid</td>
</tr>
<tr>
<td>M_AXI_DC_ARREADY</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Read address ready</td>
</tr>
<tr>
<td>M_AXI_DC_ARUSER</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Read address user signals</td>
</tr>
<tr>
<td>M_AXI_DC_RID</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Read ID tag</td>
</tr>
<tr>
<td>M_AXI_DC_RDATA</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Read data</td>
</tr>
<tr>
<td>M_AXI_DC_RRESP</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Read response</td>
</tr>
<tr>
<td>M_AXI_DC_RLAST</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Read last</td>
</tr>
<tr>
<td>M_AXI_DC_RVALID</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Read valid</td>
</tr>
<tr>
<td>M_AXI_DC_RREADY</td>
<td>M_AXI_DC</td>
<td>O</td>
<td>Master Read ready</td>
</tr>
<tr>
<td>M_AXI_DC_RUSER</td>
<td>M_AXI_DC</td>
<td>I</td>
<td>Slave Read user signals</td>
</tr>
<tr>
<td>M_AXI_IC_AWID</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Write address ID</td>
</tr>
<tr>
<td>M_AXI_IC_AWADDR</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Write address</td>
</tr>
<tr>
<td>M_AXI_IC_AWLEN</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Burst length</td>
</tr>
<tr>
<td>M_AXI_IC_AWSIZE</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Burst size</td>
</tr>
<tr>
<td>M_AXI_IC_AWBURST</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Burst type</td>
</tr>
<tr>
<td>M_AXI_IC_AWLOCK</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Lock type</td>
</tr>
<tr>
<td>M_AXI_IC_AWCACHE</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Cache type</td>
</tr>
<tr>
<td>M_AXI_IC_AWPROT</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Protection type</td>
</tr>
<tr>
<td>M_AXI_IC_AWQOS</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Quality of Service</td>
</tr>
<tr>
<td>M_AXI_IC_AWVALID</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Write address valid</td>
</tr>
<tr>
<td>M_AXI_IC_AWREADY</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Write address ready</td>
</tr>
<tr>
<td>M_AXI_IC_AWUSER</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Write address user signals</td>
</tr>
</tbody>
</table>
### Table 3-1: Summary of MicroBlaze Core I/O (Continued)

<table>
<thead>
<tr>
<th>Signal</th>
<th>Interface</th>
<th>I/O</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>M_AXI_IC_WDATA</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Write data</td>
</tr>
<tr>
<td>M_AXI_IC_WSTRB</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Write strobes</td>
</tr>
<tr>
<td>M_AXI_IC_WLAST</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Write last</td>
</tr>
<tr>
<td>M_AXI_IC_WVALID</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Write valid</td>
</tr>
<tr>
<td>M_AXI_IC_WREADY</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Write ready</td>
</tr>
<tr>
<td>M_AXI_IC_WUSER</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Write user signals</td>
</tr>
<tr>
<td>M_AXI_IC_BID</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Response ID</td>
</tr>
<tr>
<td>M_AXI_IC_BRESP</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Write response</td>
</tr>
<tr>
<td>M_AXI_IC_BVALID</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Write response valid</td>
</tr>
<tr>
<td>M_AXI_IC_BREADY</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Response ready</td>
</tr>
<tr>
<td>M_AXI_IC_BUSER</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Write response user signals</td>
</tr>
<tr>
<td>M_AXI_IC_ARID</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Read address ID</td>
</tr>
<tr>
<td>M_AXI_IC_ARADDR</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Read address</td>
</tr>
<tr>
<td>M_AXI_IC_ARLEN</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Burst length</td>
</tr>
<tr>
<td>M_AXI_IC_ARSIZE</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Burst size</td>
</tr>
<tr>
<td>M_AXI_IC_ARBURST</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Burst type</td>
</tr>
<tr>
<td>M_AXI_IC_ARLOCK</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Lock type</td>
</tr>
<tr>
<td>M_AXI_IC_ARCACHE</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Cache type</td>
</tr>
<tr>
<td>M_AXI_IC_ARPROT</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Protection type</td>
</tr>
<tr>
<td>M_AXI_IC_ARQOS</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Quality of Service</td>
</tr>
<tr>
<td>M_AXI_IC_ARVALID</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Read address valid</td>
</tr>
<tr>
<td>M_AXI_IC_ARREADY</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Read address ready</td>
</tr>
<tr>
<td>M_AXI_IC_ARUSER</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Read address user signals</td>
</tr>
<tr>
<td>M_AXI_IC_RID</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Read ID tag</td>
</tr>
<tr>
<td>M_AXI_IC_RDATA</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Read data</td>
</tr>
<tr>
<td>M_AXI_IC_RRESP</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Read response</td>
</tr>
<tr>
<td>M_AXI_IC_RLAST</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Read last</td>
</tr>
<tr>
<td>M_AXI_IC_RVALID</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Read valid</td>
</tr>
<tr>
<td>M_AXI_IC_RREADY</td>
<td>M_AXI_IC</td>
<td>O</td>
<td>Master Read ready</td>
</tr>
<tr>
<td>M_AXI_IC_RUSER</td>
<td>M_AXI_IC</td>
<td>I</td>
<td>Slave Read user signals</td>
</tr>
<tr>
<td>DPLB_M_ABORT</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB abort bus request indicator</td>
</tr>
</tbody>
</table>

**Table 3-1:** Summary of MicroBlaze Core I/O (Continued)
Table 3-1: Summary of MicroBlaze Core I/O (Continued)

<table>
<thead>
<tr>
<th>Signal</th>
<th>Interface</th>
<th>I/O</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DPLB_M_ABus</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB address bus</td>
</tr>
<tr>
<td>DPLB_M_UABus</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB upper address bus</td>
</tr>
<tr>
<td>DPLB_M_BE</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB byte enables</td>
</tr>
<tr>
<td>DPLB_M_busLock</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB bus lock</td>
</tr>
<tr>
<td>DPLB_M_lockErr</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB lock error indicator</td>
</tr>
<tr>
<td>DPLB_M_MSize</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB master data bus size</td>
</tr>
<tr>
<td>DPLB_M_priority</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB bus request priority</td>
</tr>
<tr>
<td>DPLB_M_rdBurst</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB burst read transfer indicator</td>
</tr>
<tr>
<td>DPLB_M_request</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB bus request</td>
</tr>
<tr>
<td>DPLB_M_RNW</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB read/not write</td>
</tr>
<tr>
<td>DPLB_M_size</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB transfer size</td>
</tr>
<tr>
<td>DPLB_M_TAttribute</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB Transfer Attribute bus</td>
</tr>
<tr>
<td>DPLB_M_type</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB transfer type</td>
</tr>
<tr>
<td>DPLB_M_wrBurst</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB burst write transfer indicator</td>
</tr>
<tr>
<td>DPLB_M_wrDBus</td>
<td>DPLB</td>
<td>O</td>
<td>Data Interface PLB write data bus</td>
</tr>
<tr>
<td>DPLB_MBusy</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB slave busy indicator</td>
</tr>
<tr>
<td>DPLB_MRdErr</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB slave read error indicator</td>
</tr>
<tr>
<td>DPLB_MWrErr</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB slave write error indicator</td>
</tr>
<tr>
<td>DPLB_MIRQ</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB slave interrupt indicator</td>
</tr>
<tr>
<td>DPLB_MWrBTerm</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB terminate write burst indicator</td>
</tr>
<tr>
<td>DPLB_MWrDAck</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB write data acknowledge</td>
</tr>
<tr>
<td>DPLB_MAaddrAck</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB address acknowledge</td>
</tr>
<tr>
<td>DPLB_MRdBTerm</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB terminate read burst indicator</td>
</tr>
<tr>
<td>DPLB_MRdDAck</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB read data acknowledge</td>
</tr>
<tr>
<td>DPLB_MRDDBus</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB read data bus</td>
</tr>
<tr>
<td>DPLB_MRdWdAddr</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB read word address</td>
</tr>
<tr>
<td>DPLB_MRearbitrate</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB bus rearbitrate indicator</td>
</tr>
<tr>
<td>DPLB_MSSize</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB slave data bus size</td>
</tr>
<tr>
<td>DPLB_MTimeout</td>
<td>DPLB</td>
<td>I</td>
<td>Data Interface PLB bus timeout</td>
</tr>
</tbody>
</table>
### Table 3-1: Summary of MicroBlaze Core I/O (Continued)

<table>
<thead>
<tr>
<th>Signal</th>
<th>Interface</th>
<th>I/O</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>IPLB_M_ABort</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB abort bus request indicator</td>
</tr>
<tr>
<td>IPLB_M_ABus</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB address bus</td>
</tr>
<tr>
<td>IPLB_M_UABus</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB upper address bus</td>
</tr>
<tr>
<td>IPLB_M_BE</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB byte enables</td>
</tr>
<tr>
<td>IPLB_M_busLock</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB bus lock</td>
</tr>
<tr>
<td>IPLB_M_lockErr</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB lock error indicator</td>
</tr>
<tr>
<td>IPLB_M_MSize</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB master data bus size</td>
</tr>
<tr>
<td>IPLB_M_priority</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB bus request priority</td>
</tr>
<tr>
<td>IPLB_M_rdBurst</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB burst read transfer indicator</td>
</tr>
<tr>
<td>IPLB_M_request</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB bus request</td>
</tr>
<tr>
<td>IPLB_M_RNW</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB read/not write</td>
</tr>
<tr>
<td>IPLB_M_size</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB transfer size</td>
</tr>
<tr>
<td>IPLB_M_TAttribute</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB Transfer Attribute bus</td>
</tr>
<tr>
<td>IPLB_M_type</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB transfer type</td>
</tr>
<tr>
<td>IPLB_M_wrBurst</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB burst write transfer indicator</td>
</tr>
<tr>
<td>IPLB_M_wrDBus</td>
<td>IPLB</td>
<td>O</td>
<td>Instruction Interface PLB write data bus</td>
</tr>
<tr>
<td>IPLB_MBusy</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB slave busy indicator</td>
</tr>
<tr>
<td>IPLB_MRdErr</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB slave read error indicator</td>
</tr>
<tr>
<td>IPLB_MWrErr</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB slave write error indicator</td>
</tr>
<tr>
<td>IPLB_MIRQ</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB slave interrupt indicator</td>
</tr>
<tr>
<td>IPLB_MWrBTerm</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB terminate write burst indicator</td>
</tr>
<tr>
<td>IPLB_MWrDAck</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB write data acknowledge</td>
</tr>
<tr>
<td>IPLB_MAddrAck</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB address acknowledge</td>
</tr>
</tbody>
</table>
### Table 3-1: Summary of MicroBlaze Core I/O (Continued)

<table>
<thead>
<tr>
<th>Signal</th>
<th>Interface</th>
<th>I/O</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>IPLB_MRdBTerm</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB terminate read burst indicator</td>
</tr>
<tr>
<td>IPLB_MRdDAck</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB read data acknowledge</td>
</tr>
<tr>
<td>IPLB_MRdDBus</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB read data bus</td>
</tr>
<tr>
<td>IPLB_MRdWdAddr</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB read word address</td>
</tr>
<tr>
<td>IPLB_MRearbitrate</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB bus rearbitrate indicator</td>
</tr>
<tr>
<td>IPLB_MSSize</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB slave data bus size</td>
</tr>
<tr>
<td>IPLB_MTimeout</td>
<td>IPLB</td>
<td>I</td>
<td>Instruction Interface PLB bus timeout</td>
</tr>
<tr>
<td>Data_Addr[0:31]</td>
<td>DLMB</td>
<td>O</td>
<td>Data interface LMB address bus</td>
</tr>
<tr>
<td>Byte_Enable[0:3]</td>
<td>DLMB</td>
<td>O</td>
<td>Data interface LMB byte enables</td>
</tr>
<tr>
<td>Data_Write[0:31]</td>
<td>DLMB</td>
<td>O</td>
<td>Data interface LMB write data bus</td>
</tr>
<tr>
<td>D_AS</td>
<td>DLMB</td>
<td>O</td>
<td>Data interface LMB address strobe</td>
</tr>
<tr>
<td>Read_Strobe</td>
<td>DLMB</td>
<td>O</td>
<td>Data interface LMB read strobe</td>
</tr>
<tr>
<td>Write_Strobe</td>
<td>DLMB</td>
<td>O</td>
<td>Data interface LMB write strobe</td>
</tr>
<tr>
<td>Data_Read[0:31]</td>
<td>DLMB</td>
<td>I</td>
<td>Data interface LMB read data bus</td>
</tr>
<tr>
<td>DReady</td>
<td>DLMB</td>
<td>I</td>
<td>Data interface LMB data ready</td>
</tr>
<tr>
<td>DWait</td>
<td>DLMB</td>
<td>I</td>
<td>Data interface LMB data wait</td>
</tr>
<tr>
<td>DCE</td>
<td>DLMB</td>
<td>I</td>
<td>Data interface LMB correctable error</td>
</tr>
<tr>
<td>DUE</td>
<td>DLMB</td>
<td>I</td>
<td>Data interface LMB uncorrectable error</td>
</tr>
<tr>
<td>Instr_Addr[0:31]</td>
<td>ILMB</td>
<td>O</td>
<td>Instruction interface LMB address bus</td>
</tr>
<tr>
<td>I_AS</td>
<td>ILMB</td>
<td>O</td>
<td>Instruction interface LMB address strobe</td>
</tr>
<tr>
<td>IFetch</td>
<td>ILMB</td>
<td>O</td>
<td>Instruction interface LMB instruction fetch</td>
</tr>
<tr>
<td>Instr[0:31]</td>
<td>ILMB</td>
<td>I</td>
<td>Instruction interface LMB read data bus</td>
</tr>
<tr>
<td>IReady</td>
<td>ILMB</td>
<td>I</td>
<td>Instruction interface LMB data ready</td>
</tr>
<tr>
<td>IWait</td>
<td>ILMB</td>
<td>I</td>
<td>Instruction interface LMB data wait</td>
</tr>
<tr>
<td>ICE</td>
<td>ILMB</td>
<td>I</td>
<td>Instruction interface LMB correctable error</td>
</tr>
<tr>
<td>IUE</td>
<td>ILMB</td>
<td>I</td>
<td>Instruction interface LMB uncorrectable error</td>
</tr>
<tr>
<td>Mn_AXIS_TLAST</td>
<td>M0_AXIS..</td>
<td>O</td>
<td>Master interface output AXI4 channels write last</td>
</tr>
<tr>
<td>Mn_AXIS_TDATA</td>
<td>M0_AXIS..</td>
<td>O</td>
<td>Master interface output AXI4 channels write data</td>
</tr>
</tbody>
</table>
## MicroBlaze I/O Overview

### Table 3-1: Summary of MicroBlaze Core I/O (Continued)

<table>
<thead>
<tr>
<th>Signal</th>
<th>Interface</th>
<th>I/O</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>Mn_AXIS_TVALID</td>
<td>M0_AXIS.. M15_AXIS</td>
<td>O</td>
<td>Master interface output AXI4 channels write valid</td>
</tr>
<tr>
<td>Mn_AXIS_TREADY</td>
<td>M0_AXIS.. M15_AXIS</td>
<td>I</td>
<td>Master interface input AXI4 channels write ready</td>
</tr>
<tr>
<td>Sn_AXIS_TLAST</td>
<td>S0_AXIS.. S15_AXIS</td>
<td>I</td>
<td>Slave interface input AXI4 channels write last</td>
</tr>
<tr>
<td>Sn_AXIS_TDATA</td>
<td>S0_AXIS.. S15_AXIS</td>
<td>I</td>
<td>Slave interface input AXI4 channels write data</td>
</tr>
<tr>
<td>Sn_AXIS_TVALID</td>
<td>S0_AXIS.. S15_AXIS</td>
<td>I</td>
<td>Slave interface input AXI4 channels write valid</td>
</tr>
<tr>
<td>Sn_AXIS_TREADY</td>
<td>S0_AXIS.. S15_AXIS</td>
<td>O</td>
<td>Slave interface output AXI4 channels write ready</td>
</tr>
<tr>
<td>FSL0_M .. FSL15_M</td>
<td>MFSL, or DWFSL</td>
<td>O</td>
<td>Master interface to output FSL channels MFSL is used for FSL bus connections, whereas DWFSL is used for direct connections with FSL slaves</td>
</tr>
<tr>
<td>FSL0_S .. FSL15_S</td>
<td>SFSL, or DRFSL</td>
<td>I</td>
<td>Slave interface to input FSL channels SFSL is used for FSL bus connections, whereas DRFSL is used for direct connections with FSL masters</td>
</tr>
<tr>
<td>ICache_FSL_in...</td>
<td>IXCL_S</td>
<td>IO</td>
<td>Instruction side CacheLink FSL slave interface</td>
</tr>
<tr>
<td>ICache_FSL_out...</td>
<td>IXCL_M</td>
<td>IO</td>
<td>Instruction side CacheLink FSL master interface</td>
</tr>
<tr>
<td>DCache_FSL_in...</td>
<td>DXCL_S</td>
<td>IO</td>
<td>Data side CacheLink FSL slave interface</td>
</tr>
<tr>
<td>DCache_FSL_out...</td>
<td>DXCL_M</td>
<td>IO</td>
<td>Data side CacheLink FSL master interface</td>
</tr>
<tr>
<td>Interrupt</td>
<td>Core</td>
<td>I</td>
<td>Interrupt</td>
</tr>
<tr>
<td>Interrupt_Address¹</td>
<td>Core</td>
<td>I</td>
<td>Interrupt vector address</td>
</tr>
<tr>
<td>Interrupt_Ack¹</td>
<td>Core</td>
<td>O</td>
<td>Interrupt acknowledge</td>
</tr>
<tr>
<td>Reset²</td>
<td>Core</td>
<td>I</td>
<td>Core reset, active high. Should be held for at least 1 Clk clock cycle.</td>
</tr>
<tr>
<td>MB_Reset²</td>
<td>Core</td>
<td>I</td>
<td>Core reset, active high. Should be held for at least 1 Clk clock cycle.</td>
</tr>
<tr>
<td>Clk</td>
<td>Core</td>
<td>I</td>
<td>Clock³</td>
</tr>
<tr>
<td>Ext_BRK</td>
<td>Core</td>
<td>I</td>
<td>Break signal from MDM</td>
</tr>
<tr>
<td>Ext_NM_BRK</td>
<td>Core</td>
<td>I</td>
<td>Non-maskable break signal from MDM</td>
</tr>
<tr>
<td>MB_Halted</td>
<td>Core</td>
<td>O</td>
<td>Pipeline is halted, either via the Debug Interface or by setting Dbg_Stop</td>
</tr>
</tbody>
</table>
**Table 3-1: Summary of MicroBlaze Core I/O (Continued)**

<table>
<thead>
<tr>
<th>Signal</th>
<th>Interface</th>
<th>I/O</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>Dbg_Stop</td>
<td>Core</td>
<td>I</td>
<td>Unconditionally force pipeline to halt as soon as possible. Rising-edge detected pulse that should be held for at least 1 Clk clock cycle. The signal only has any effect when C_DEBUG_ENABLED is set to 1.</td>
</tr>
<tr>
<td>MB_Error</td>
<td>Core</td>
<td>O</td>
<td>Pipeline is halted due to a missed exception, when C_FAULT_TOLERANT is set to 1.</td>
</tr>
<tr>
<td>Sleep</td>
<td>Core</td>
<td>O</td>
<td>MicroBlaze is in sleep mode after executing a SLEEP instruction, all external accesses are completed, and the pipeline is halted.</td>
</tr>
<tr>
<td>Wakeup[0:1]</td>
<td>Core</td>
<td>I</td>
<td>Wake MicroBlaze from sleep mode when either or both bits are set to 1. Ignored if MicroBlaze is not in sleep mode.</td>
</tr>
<tr>
<td>Dbg_Wakeup</td>
<td>Core</td>
<td>O</td>
<td>Debug request that external logic should wake MicroBlaze from sleep mode with the Wakeup signal.</td>
</tr>
<tr>
<td>Lockstep_...</td>
<td>Core</td>
<td>IO</td>
<td>Lockstep signals for high integrity applications. See Table 3-12 for details.</td>
</tr>
<tr>
<td>Dbg_...</td>
<td>Core</td>
<td>IO</td>
<td>Debug signals from MDM. See Table 3-14 for details.</td>
</tr>
<tr>
<td>Trace_...</td>
<td>Core</td>
<td>O</td>
<td>Trace signals for real time HW analysis. See Table 3-15 for details.</td>
</tr>
</tbody>
</table>

1. Only used with C_USE_INTERRUPT = 2, for low-latency interrupt support.
2. The Reset and MB_Reset signals are functionally equivalent. MB_Reset is intended for the AXI4 and PLB interfaces.
3. MicroBlaze is a synchronous design clocked with the Clk signal, except for hardware debug logic, which is clocked with the Dbg_Clk signal. If hardware debug logic is not used, there is no minimum frequency limit for Clk. However, if hardware debug logic is used, there are signals transferred between the two clock regions. In this case Clk must have a higher frequency than Dbg_Clk.

**AXI4 Interface Description**

**Memory Mapped Interfaces**

**Peripheral Interfaces**

The MicroBlaze AXI4 memory mapped peripheral interfaces are implemented as 32-bit masters. Each of these interfaces only have a single outstanding transaction at any time, and all transactions are completed in order.

- The instruction peripheral interface (M_AXI_IP) only performs single word read accesses, and is always set to use the AXI4-Lite subset.
- The data peripheral interface (M_AXI_DP) performs single word accesses, and is set to use the AXI4-Lite subset as default, but is set to use AXI4 when enabling exclusive access for LWX and SWX instructions. Halfword and byte writes are performed by setting the appropriate byte strobes.
Cache Interfaces

The AXI4 memory mapped cache interfaces are implemented either as AXI4 32-bit, 128-bit, 256-bit, or 512-bit masters, depending on cache line length and data width parameters.

- With a 32-bit master, the instruction cache interface (M_AXI_IC) performs 4 word or 8 word burst read accesses, depending on cache line length. With 128-bit, 256-bit, or 512-bit masters, only single read accesses are performed.

  This interface can have multiple outstanding transactions, issuing up to 2 transactions or up to 5 transactions when stream cache is enabled. The stream cache can request two cache lines in advance, which means that in some cases 5 outstanding transactions can occur. When stream cache is enabled, C_INTERCONNECT_M_AXI_IC_READ_ISSUING is set to 8, since it must be a power of two.

  How memory locations are accessed depend on the parameter C_ICACHE_ALWAYS_USED. If the parameter is 1, the cached memory range is always accessed via the AXI4 cache interface. If the parameter is 0, the cached memory range is accessed over the AXI4 peripheral interface when the caches are software disabled (that is, MSR[ICE]=0).

- With a 32-bit master, the data cache interface (M_AXI_DC) performs single word accesses, as well as 4 word or 8 word burst accesses, depending on cache line length. Burst write accesses are only performed when using write-back cache. With 128-bit, 256-bit, or 512-bit masters, only single accesses are performed.

  This interface can have multiple outstanding transactions, either issuing up to 2 transactions when reading, or up to 32 transactions when writing. MicroBlaze ensures that all outstanding writes are completed before a read is issued, since the processor must maintain an ordered memory model but AXI has separate read/write channels without any ordering. Using up to 32 outstanding write transactions improves performance, since it allows multiple writes to proceed without stalling the pipeline.

  Word, halfword and byte writes are performed by setting the appropriate byte strobes.

  Exclusive accesses can be enabled for LWX and SWX instructions.

  How memory locations are accessed depend on the parameter C_DCACHE_ALWAYS_USED. If the parameter is 1, the cached memory range is always accessed via the AXI4 cache interface. If the parameter is 0, the cached memory range is accessed over the AXI4 peripheral interface when the caches are software disabled (that is, MSR[DCE]=0).
Interface Parameters

The relationship between MicroBlaze parameter settings and AXI4 interface behavior for tool-assigned parameters is summarized in Table 3-2.

Table 3-2: AXI Memory Mapped Interface Parameters

<table>
<thead>
<tr>
<th>Interface</th>
<th>Parameter</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>M_AXI_DP</td>
<td>C_M_AXI_DP_PROTOCOL</td>
<td>AXI4-Lite: Default. AXI4: Used to allow exclusive access when C_M_AXI_DP_EXCLUSIVE_ACCESS is 1.</td>
</tr>
<tr>
<td>M_AXI_IC</td>
<td>C_M_AXI_IC_DATA_WIDTH</td>
<td>32: Default, single word accesses and burst accesses with C_ICACHE_LINE_LEN word bursts used. 128: Used when C_ICACHE_DATA_WIDTH is set to 1 and C_ICACHE_LINE_LEN is set to 4. Only single accesses can occur. 256: Used when C_ICACHE_DATA_WIDTH is set to 1 and C_ICACHE_LINE_LEN is set to 8. Only single accesses can occur. 512: Used when C_ICACHE_DATA_WIDTH is set to 2. Only single accesses can occur.</td>
</tr>
<tr>
<td>M_AXI_DC</td>
<td>C_M_AXI_DC_DATA_WIDTH</td>
<td>32: Default, single word accesses and burst accesses with C_DCACHE_LINE_LEN word bursts used. Write bursts are only used when C_DCACHE_USE_WRITEBACK is set to 1. 128: Used when C_DCACHE_DATA_WIDTH is set to 1 and C_DCACHE_LINE_LEN is set to 4. Only single accesses can occur. 256: Used when C_DCACHE_DATA_WIDTH is set to 1 and C_DCACHE_LINE_LEN is set to 8. Only single accesses can occur. 512: Used when C_DCACHE_DATA_WIDTH is set to 2. Only single accesses can occur.</td>
</tr>
<tr>
<td>M_AXI_IC</td>
<td>C_INTERCONNECT_M_AXI_IC_READ_ISSUING</td>
<td>2: Default, 2 simultaneous outstanding reads. 8: Used when C_ICACHE_STREAMS is set to 1, allowing 8 simultaneous outstanding reads. Can be set to 1, 2, 4, 8.</td>
</tr>
<tr>
<td>M_AXI_DC</td>
<td>C_INTERCONNECT_M_AXI_DC_READ_ISSUING</td>
<td>2: Default, 2 simultaneous outstanding reads. Can be set to 1 or 2.</td>
</tr>
<tr>
<td>M_AXI_DC</td>
<td>C_INTERCONNECT_M_AXI_DC_WRITE_ISSUING</td>
<td>32: Default, 32 simultaneous outstanding writes. Can be set to 1, 2, 4, 8, 16, or 32.</td>
</tr>
</tbody>
</table>

1. This value can be explicitly set by the user to limit the number of simultaneous accesses accepted by the AXI interconnect, which may lower performance but can reduce the interconnect size.
Stream Interfaces

The MicroBlaze AXI4-Stream interfaces (M0_AXIS..M15_AXIS, S0_AXIS..S15_AXIS) are implemented as 32-bit masters and slaves. Please refer to the *AMBA® AXI4-Stream Protocol Specification, Version 1.0, ARM IHI 0051A* document for further details.

The Mn_AXIS_TLAST and Sn_AXIS_TLAST signals directly correspond to the equivalent FSLn_M_Control and FSLn_S_Control signals, respectively.

Write Operation

A write to the stream interface is performed by MicroBlaze using one of the put or putd instructions. A write operation transfers the register contents to an output AXI4 interface. The transfer is completed in a single clock cycle for blocking mode writes (put and cput instructions) as long as the interface is not busy. If the interface is busy, the processor stalls until it becomes available. The non-blocking instructions (with prefix n), always complete in a single clock cycle even if the interface is busy. If the interface was busy, the write is inhibited and the carry bit is set in the MSR.

Read Operation

A read from the stream interface is performed by MicroBlaze using one of the get or getd instructions. A read operations transfers the contents of an input AXI4 interface to a general purpose register. The transfer is typically completed in 2 clock cycles for blocking mode reads as long as data is available. If data is not available, the processor stalls at this instruction until it becomes available. In the non-blocking mode (instructions with prefix n), the transfer is completed in one or two clock cycles irrespective of whether or not data was available. In case data was not available, the transfer of data does not take place and the carry bit is set in the MSR.

Processor Local Bus (PLB) Interface Description

The MicroBlaze PLB interfaces are implemented as byte-enable capable 32-bit masters. Please refer to the *IBM 128-Bit Processor Local Bus Architectural Specification (v4.6)* document for details.

Local Memory Bus (LMB) Interface Description

The LMB is a synchronous bus used primarily to access on-chip block RAM. It uses a minimum number of control signals and a simple protocol to ensure that local block RAM are accessed in a single clock cycle. LMB signals and definitions are shown in the following table. All LMB signals are active high.

LMB Signal Interface

<table>
<thead>
<tr>
<th>Signal</th>
<th>Data Interface</th>
<th>Instruction Interface</th>
<th>Type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>Addr[0:31]</td>
<td>Data_Addr[0:31]</td>
<td>Instr_Addr[0:31]</td>
<td>O</td>
<td>Address bus</td>
</tr>
<tr>
<td>Byte_Enable[0:3]</td>
<td>Byte_Enable[0:3]</td>
<td>not used</td>
<td>O</td>
<td>Byte enables</td>
</tr>
<tr>
<td>Data_Write[0:31]</td>
<td>Data_Write[0:31]</td>
<td>not used</td>
<td>O</td>
<td>Write data bus</td>
</tr>
</tbody>
</table>
Chapter 3: MicroBlaze Signal Interface Description

### Table 3-3: LMB Bus Signals (Continued)

<table>
<thead>
<tr>
<th>Signal</th>
<th>Data Interface</th>
<th>Instruction Interface</th>
<th>Type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>AS</td>
<td>D_AS</td>
<td>I_AS</td>
<td>O</td>
<td>Address strobe</td>
</tr>
<tr>
<td>Read_Strobe</td>
<td>Read_Strobe</td>
<td>IFetch</td>
<td>O</td>
<td>Read in progress</td>
</tr>
<tr>
<td>Write_Strobe</td>
<td>Write_Strobe</td>
<td><em>not used</em></td>
<td>O</td>
<td>Write in progress</td>
</tr>
<tr>
<td>Data_Read[0:31]</td>
<td>Data_Read[0:31]</td>
<td>Instr[0:31]</td>
<td>I</td>
<td>Read data bus</td>
</tr>
<tr>
<td>Ready</td>
<td>DReady</td>
<td>IReady</td>
<td>I</td>
<td>Ready for next transfer</td>
</tr>
<tr>
<td>Wait(^1)</td>
<td>DWait</td>
<td>IWaIt</td>
<td>I</td>
<td>Wait until accepted transfer is ready</td>
</tr>
<tr>
<td>CE(^1)</td>
<td>DCE</td>
<td>ICE</td>
<td>I</td>
<td>Correctable error</td>
</tr>
<tr>
<td>UE(^1)</td>
<td>DUE</td>
<td>IUE</td>
<td>I</td>
<td>Uncorrectable error</td>
</tr>
<tr>
<td>Clk</td>
<td>Clk</td>
<td>Clk</td>
<td>I</td>
<td>Bus clock</td>
</tr>
</tbody>
</table>

1. Added in LMB for MicroBlaze v8.00

### Addr[0:31]

The address bus is an output from the core and indicates the memory address that is being accessed by the current transfer. It is valid only when AS is high. In multicycle accesses (accesses requiring more than one clock cycle to complete), Addr[0:31] is valid only in the first clock cycle of the transfer.

### Byte_Enable[0:3]

The byte enable signals are outputs from the core and indicate which byte lanes of the data bus contain valid data. Byte_Enable[0:3] is valid only when AS is high. In multicycle accesses (accesses requiring more than one clock cycle to complete), Byte_Enable[0:3] is valid only in the first clock cycle of the transfer. Valid values for Byte_Enable[0:3] are shown in the following table:

<table>
<thead>
<tr>
<th>Table 3-4: Valid Values for Byte_Enable[0:3]</th>
</tr>
</thead>
<tbody>
<tr>
<td>Byte_Lanes Used</td>
</tr>
<tr>
<td>0001</td>
</tr>
<tr>
<td>0010</td>
</tr>
<tr>
<td>0100</td>
</tr>
<tr>
<td>1000</td>
</tr>
<tr>
<td>0011</td>
</tr>
<tr>
<td>1100</td>
</tr>
<tr>
<td>1111</td>
</tr>
</tbody>
</table>

\(^1\) Added in LMB for MicroBlaze v8.00
Local Memory Bus (LMB) Interface Description

Data_Write[0:31]
The write data bus is an output from the core and contains the data that is written to memory. It is valid only when AS is high. Only the byte lanes specified by Byte_Enable[0:3] contain valid data.

AS
The address strobe is an output from the core and indicates the start of a transfer and qualifies the address bus and the byte enables. It is high only in the first clock cycle of the transfer, after which it goes low and remains low until the start of the next transfer.

Read_Strobe
The read strobe is an output from the core and indicates that a read transfer is in progress. This signal goes high in the first clock cycle of the transfer, and may remain high until the clock cycle after Ready is sampled high. If a new read transfer is directly started in the next clock cycle, then Read_Strobe remains high.

Write_Strobe
The write strobe is an output from the core and indicates that a write transfer is in progress. This signal goes high in the first clock cycle of the transfer, and may remain high until the clock cycle after Ready is sampled high. If a new write transfer is directly started in the next clock cycle, then Write_Strobe remains high.

Data_Read[0:31]
The read data bus is an input to the core and contains data read from memory. Data_Read[0:31] is valid on the rising edge of the clock when Ready is high.

Ready
The Ready signal is an input to the core and indicates completion of the current transfer and that the next transfer can begin in the following clock cycle. It is sampled on the rising edge of the clock. For reads, this signal indicates the Data_Read[0:31] bus is valid, and for writes it indicates that the Data_Write[0:31] bus has been written to local memory.

Wait
The Wait signal is an input to the core and indicates that the current transfer has been accepted, but not yet completed. It is sampled on the rising edge of the clock.

CE
The CE signal is an input to the core and indicates that the current transfer had a correctable error. It is valid on the rising edge of the clock when Ready is high. For reads, this signal indicates that an error has been corrected on the Data_Read[0:31] bus, and for byte and halfword writes it indicates that the corresponding data word in local memory has been corrected before writing the new data.

UE
The UE signal is an input to the core and indicates that the current transfer had an uncorrectable error. It is valid on the rising edge of the clock when Ready is high. For reads, this signal indicates that the value of the Data_Read[0:31] bus is erroneous, and for byte and halfword writes it
indicates that the corresponding data word in local memory was erroneous before writing the new data.

Clk

All operations on the LMB are synchronous to the MicroBlaze core clock.

LMB Transactions

The following diagrams provide examples of LMB bus operations.

Generic Write Operations

![Figure 3-2: LMB Generic Write Operation, 0 Wait States](image-url)
Local Memory Bus (LMB) Interface Description

Figure 3-3: LMB Generic Write Operation, N Wait States

Generic Read Operations

Figure 3-4: LMB Generic Read Operation, 0 Wait States
Chapter 3: MicroBlaze Signal Interface Description

Back-to-Back Write Operation

Clk
Addr
Byte_Enable
Data_Write
AS
Read_Strobe
Write_Strobe
Data_Read
Ready
Wait
CE
UE

D0

Don’t Care

Don’t Care

Don’t Care

Figure 3-5: LMB Generic Read Operation, N Wait States

Figure 3-6: LMB Back-to-Back Write Operation
Back-to-Back Read Operation

<table>
<thead>
<tr>
<th>Clk</th>
<th>Addr</th>
<th>Byte_Enable</th>
<th>Data_Write</th>
<th>AS</th>
<th>Read_Strobe</th>
<th>Write_Strobe</th>
<th>Data_Read</th>
<th>Ready</th>
<th>Wait</th>
<th>CE</th>
<th>UE</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>A0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>D0</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>A1</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>D1</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>D2</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>D3</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>D4</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

*Figure 3-7: LMB Back-to-Back Read Operation*

Back-to-Back Mixed Write/Read Operation

<table>
<thead>
<tr>
<th>Clk</th>
<th>Addr</th>
<th>Byte_Enable</th>
<th>Data_Write</th>
<th>AS</th>
<th>Read_Strobe</th>
<th>Write_Strobe</th>
<th>Data_Read</th>
<th>Ready</th>
<th>Wait</th>
<th>CE</th>
<th>UE</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>A0</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>D0</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>A1</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>D1</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>D2</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>D3</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>D4</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

*Figure 3-8: Back-to-Back Mixed Write/Read Operation, 0 Wait States*
Figure 3-9: Back-to-Back Mixed Write/Read Operation, N Wait States
Read and Write Data Steering

The MicroBlaze data-side bus interface performs the read steering and write steering required to support the following transfers:

- byte, halfword, and word transfers to word devices
- byte and halfword transfers to halfword devices
- byte transfers to byte devices

MicroBlaze does not support transfers that are larger than the addressed device. These types of transfers require dynamic bus sizing and conversion cycles that are not supported by the MicroBlaze bus interface. Data steering for read cycles are shown in Table 3-5 and Table 3-6, and data steering for write cycles are shown in Table 3-7 and Table 3-8.

**Table 3-5: Big Endian Read Data Steering (Load to Register rD)**

<table>
<thead>
<tr>
<th>Address [30:31]</th>
<th>Byte_Enable [0:3]</th>
<th>Transfer Size</th>
<th>Register rD Data</th>
</tr>
</thead>
<tbody>
<tr>
<td>11</td>
<td>0001</td>
<td>byte</td>
<td>Byte3</td>
</tr>
<tr>
<td>10</td>
<td>0010</td>
<td>byte</td>
<td>Byte2</td>
</tr>
<tr>
<td>01</td>
<td>0100</td>
<td>byte</td>
<td>Byte1</td>
</tr>
<tr>
<td>00</td>
<td>1000</td>
<td>byte</td>
<td>Byte0</td>
</tr>
<tr>
<td>10</td>
<td>0011</td>
<td>halfword</td>
<td>Byte2 Byte3</td>
</tr>
<tr>
<td>00</td>
<td>1100</td>
<td>halfword</td>
<td>Byte0 Byte1</td>
</tr>
<tr>
<td>00</td>
<td>1111</td>
<td>word</td>
<td>Byte0 Byte1 Byte2 Byte3</td>
</tr>
</tbody>
</table>

**Table 3-6: Little Endian Read Data Steering (Load to Register rD)**

<table>
<thead>
<tr>
<th>Address [30:31]</th>
<th>Byte_Enable [0:3]</th>
<th>Transfer Size</th>
<th>Register rD Data</th>
</tr>
</thead>
<tbody>
<tr>
<td>11</td>
<td>1000</td>
<td>byte</td>
<td>Byte0</td>
</tr>
<tr>
<td>10</td>
<td>0100</td>
<td>byte</td>
<td>Byte1</td>
</tr>
<tr>
<td>01</td>
<td>0010</td>
<td>byte</td>
<td>Byte2</td>
</tr>
<tr>
<td>00</td>
<td>0001</td>
<td>byte</td>
<td>Byte3</td>
</tr>
<tr>
<td>10</td>
<td>1100</td>
<td>halfword</td>
<td>Byte0 Byte1</td>
</tr>
<tr>
<td>00</td>
<td>0011</td>
<td>halfword</td>
<td>Byte2 Byte3</td>
</tr>
<tr>
<td>00</td>
<td>1111</td>
<td>word</td>
<td>Byte0 Byte1 Byte2 Byte3</td>
</tr>
</tbody>
</table>
### Table 3-7: Big Endian Write Data Steering (Store from Register rD)

<table>
<thead>
<tr>
<th>Address [30:31]</th>
<th>Byte_Enable [0:3]</th>
<th>Transfer Size</th>
<th>Write Data Bus Bytes</th>
</tr>
</thead>
<tbody>
<tr>
<td>11</td>
<td>0001</td>
<td>byte</td>
<td>rD[24:31]</td>
</tr>
<tr>
<td>10</td>
<td>0010</td>
<td>byte</td>
<td>rD[24:31]</td>
</tr>
<tr>
<td>01</td>
<td>0100</td>
<td>byte</td>
<td>rD[24:31]</td>
</tr>
<tr>
<td>00</td>
<td>1000</td>
<td>byte</td>
<td>rD[24:31]</td>
</tr>
<tr>
<td>10</td>
<td>0011</td>
<td>halfword</td>
<td>rD[16:23] rD[24:31]</td>
</tr>
<tr>
<td>00</td>
<td>1100</td>
<td>halfword</td>
<td>rD[16:23] rD[24:31]</td>
</tr>
</tbody>
</table>

### Table 3-8: Little Endian Write Data Steering (Store from Register rD)

<table>
<thead>
<tr>
<th>Address [30:31]</th>
<th>Byte_Enable [0:3]</th>
<th>Transfer Size</th>
<th>Write Data Bus Bytes</th>
</tr>
</thead>
<tbody>
<tr>
<td>11</td>
<td>1000</td>
<td>byte</td>
<td>rD[24:31]</td>
</tr>
<tr>
<td>10</td>
<td>0100</td>
<td>byte</td>
<td>rD[24:31]</td>
</tr>
<tr>
<td>01</td>
<td>0010</td>
<td>byte</td>
<td>rD[24:31]</td>
</tr>
<tr>
<td>00</td>
<td>0001</td>
<td>byte</td>
<td>rD[24:31]</td>
</tr>
<tr>
<td>10</td>
<td>1100</td>
<td>halfword</td>
<td>rD[16:23] rD[24:31]</td>
</tr>
<tr>
<td>00</td>
<td>0011</td>
<td>halfword</td>
<td>rD[16:23] rD[24:31]</td>
</tr>
</tbody>
</table>

**Note:** Other masters may have more restrictive requirements for byte lane placement than those allowed by MicroBlaze. Slave devices are typically attached “left-justified” with byte devices attached to the most-significant byte lane, and halfword devices attached to the most significant halfword lane. The MicroBlaze steering logic fully supports this attachment method.
Fast Simplex Link (FSL) Interface Description

The Fast Simplex Link bus provides a point-to-point communication channel between an output FIFO and an input FIFO. For more information on the generic FSL protocol, see the Fast Simplex Link (FSL) Bus (DS449) data-sheet in the Xilinx EDK IP Documentation.

Master FSL Signal Interface

MicroBlaze may contain up to 16 master FSL interfaces. The master signals are depicted in Table 3-9.

Table 3-9: Master FSL Signals

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Description</th>
<th>VHDL Type</th>
<th>Direction</th>
</tr>
</thead>
<tbody>
<tr>
<td>FSLn_M_Clk</td>
<td>Clock</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>FSLn_M_Write</td>
<td>Write enable signal indicating that data is being written to the output FSL</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>FSLn_M_Data</td>
<td>Data value written to the output FSL</td>
<td>std_logic_vector</td>
<td>output</td>
</tr>
<tr>
<td>FSLn_M_Control</td>
<td>Control bit value written to the output FSL</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>FSLn_M_Full</td>
<td>Full Bit indicating output FSL FIFO is full when set</td>
<td>std_logic</td>
<td>input</td>
</tr>
</tbody>
</table>

Slave FSL Signal Interface

MicroBlaze may contain up to 16 slave FSL interfaces. The slave FSL interface signals are depicted in Table 3-10.

Table 3-10: Slave FSL Signals

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Description</th>
<th>VHDL Type</th>
<th>Direction</th>
</tr>
</thead>
<tbody>
<tr>
<td>FSLn_S_Clk</td>
<td>Clock</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>FSLn_S_Read</td>
<td>Read acknowledge signal indicating that data has been read from the input FSL</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>FSLn_S_Data</td>
<td>Data value currently available at the top of the input FSL</td>
<td>std_logic_vector</td>
<td>input</td>
</tr>
<tr>
<td>FSLn_S_Control</td>
<td>Control Bit value currently available at the top of the input FSL</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>FSLn_S_Exists</td>
<td>Flag indicating that data exists in the input FSL</td>
<td>std_logic</td>
<td>input</td>
</tr>
</tbody>
</table>
FSL Transactions

FSL BUS Write Operation

A write to the FSL bus is performed by MicroBlaze using one of the put or putd instructions. A write operation transfers the register contents to an output FSL bus. The transfer is completed in a single clock cycle for blocking mode writes to the FSL (put and cput instructions) as long as the FSL FIFO does not become full. If the FSL FIFO is full, the processor stalls until the FSL full flag is lowered. The non-blocking instructions (with prefix n), always complete in a single clock cycle even if the FSL was full. If the FSL was full, the write is inhibited and the carry bit is set in the MSR.

FSL BUS Read Operation

A read from the FSL bus is performed by MicroBlaze using one of the get or getd instructions. A read operation transfers the contents of an input FSL to a general purpose register. The transfer is typically completed in 2 clock cycles for blocking mode reads from the FSL as long as data exists in the FSL FIFO. If the FSL FIFO is empty, the processor stalls at this instruction until the FSL exists flag is set. In the non-blocking mode (instructions with prefix n), the transfer is completed in one or two clock cycles irrespective of whether or not the FSL was empty. In the case the FSL was empty, the transfer of data does not take place and the carry bit is set in the MSR.

Direct FSL Connections

A direct FSL connection can be used to avoid the need for the FSL bus. This can be useful in case no buffering is needed between the two connected IP cores, since the FSL bus FIFO is not included with a direct connection. No buffering reduces the communication latency and required implementation resources.

Each of the MicroBlaze FSL interfaces can either use a direct FSL connection or an FSL bus.

A MicroBlaze DWFSL interface is the initiator on a direct FSL connection, which can only be connected to a DWFSL target. The DWFSL initiator and target have exactly the same signal names, identical to the MFSL signals, depicted in Table 3-9. MicroBlaze uses the DWFSL interface to write data to the target with one of the put or putd instructions.

A MicroBlaze DRFSL interface is the target on a direct FSL connection, which can only be connected to a DRFSL initiator. The DRFSL initiator and target have exactly the same signal names, identical to the SFSL signals, depicted in Table 3-10. MicroBlaze uses the DRFSL interface to read data from the initiator with one of the get or getd instructions.

The Xilinx CacheLink (XCL) interface is implemented with direct FSL connections.
Xilinx CacheLink (XCL) Interface Description

Xilinx CacheLink (XCL) is a high performance solution for external memory accesses. The MicroBlaze CacheLink interface is designed to connect directly to a memory controller with integrated FSL buffers, for example, the MPMC. This method has the lowest latency and minimal number of instantiations (see Figure 3-10).

### Schematic

**Figure 3-10:** CacheLink Connection with Integrated FSL Buffers (Only Instruction Cache Used in this Example)

The interface is only available on MicroBlaze when caches are enabled. It is legal to use a CacheLink cache on the instruction side or the data side without caching the other.

How memory locations are accessed depend on the parameter `C_ICACHE_ALWAYS_USED` for the instruction cache and the parameter `C_DCACHE_ALWAYS_USED` for the data cache. If the parameter is 1, the cached memory range is always accessed via the CacheLink. If the parameter is 0, the cached memory range is accessed over PLB whenever the caches are software disabled (that is, MSR[ICE]=0 or MSR[DCE]=0).

Memory locations outside the cacheable range are accessed over PLB or LMB.

The CacheLink cache controllers handle 4 or 8-word cache lines, either using critical word first or linear fetch depending on the selected protocol. At the same time the separation from the PLB bus reduces contention for non-cached memory accesses.

### CacheLink Signal Interface

The CacheLink signals on MicroBlaze are listed in Table 3-11.

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Description</th>
<th>VHDL Type</th>
<th>Direction</th>
</tr>
</thead>
<tbody>
<tr>
<td>ICACHE_FSL_IN_Clk</td>
<td>Clock output to I-side return read data FSL</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>ICACHE_FSL_IN_Read</td>
<td>Read signal to I-side return read data FSL</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Signal Name</td>
<td>Description</td>
<td>VHDL Type</td>
<td>Direction</td>
</tr>
<tr>
<td>-----------------------</td>
<td>-----------------------------------------------------------------------------</td>
<td>--------------------------------</td>
<td>-----------</td>
</tr>
<tr>
<td>ICACHE_FSL_IN_Data</td>
<td>Read data from I-side return read data FSL</td>
<td>std_logic_vector (0 to 31)</td>
<td>input</td>
</tr>
<tr>
<td>ICACHE_FSL_IN_Control</td>
<td>FSL control-bit from I-side return read data FSL. Reserved for future use</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>ICACHE_FSL_IN_Exists</td>
<td>More read data exists in I-side return FSL</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>ICACHE_FSL_OUT_Clk</td>
<td>Clock output to I-side read access FSL</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>ICACHE_FSL_OUT_Write</td>
<td>Write new cache miss access request to I-side read access FSL</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>ICACHE_FSL_OUT_Data</td>
<td>Cache miss access (=address) to I-side read access FSL</td>
<td>std_logic_vector (0 to 31)</td>
<td>output</td>
</tr>
<tr>
<td>ICACHE_FSL_OUT_Control</td>
<td>FSL control-bit to I-side read access FSL. Reserved for future use</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>ICACHE_FSL_OUT_Full</td>
<td>FSL access buffer for I-side read accesses is full</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>DCACHE_FSL_IN_Clk</td>
<td>Clock output to D-side return read data FSL</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>DCACHE_FSL_IN_Read</td>
<td>Read signal to D-side return read data FSL</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>DCACHE_FSL_IN_Data</td>
<td>Read data from D-side return read data FSL</td>
<td>std_logic_vector (0 to 31)</td>
<td>input</td>
</tr>
<tr>
<td>DCACHE_FSL_IN_Control</td>
<td>FSL control bit from D-side return read data FSL</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>DCACHE_FSL_IN_Exists</td>
<td>More read data exists in D-side return FSL</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>DCACHE_FSL_OUT_Clk</td>
<td>Clock output to D-side read access FSL</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>DCACHE_FSL_OUT_Write</td>
<td>Write new cache miss access request to D-side read access FSL</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>DCACHE_FSL_OUT_Data</td>
<td>Cache miss access (read address or write address + write data + byte write enable + burst write encoding) to D-side read access FSL</td>
<td>std_logic_vector (0 to 31)</td>
<td>output</td>
</tr>
</tbody>
</table>
CacheLink Transactions

All individual CacheLink accesses follow the FSL FIFO based transaction protocol:

- Access information is encoded over the FSL data and control signals (e.g. DCACHE_FSL_OUT_Data, DCACHE_FSL_OUT_Control, ICACHE_FSL_IN_Data, and ICACHE_FSL_IN_Control)
- Information is sent (stored) by raising the write enable signal (e.g. DCACHE_FSL_OUT_Write)
- The sender is only allowed to write if the full signal from the receiver is inactive (e.g. DCACHE_FSL_OUT_Full = 0). The full signal is not used by the instruction cache controller.
- The use of ICACHE_FSL_IN_Read and DCACHE_FSL_IN_Read depends on the selected interface protocol:
  - With the IXCL and DXCL protocol, information is received (loaded) by raising the read signal. The signal is low, except when the sender signals that new data exists.
  - With the IXCL2 and DXCL2 protocol, lowering the read signal indicates that the receiver is not able to accept new data. New data is only read when the read signal is high, and the sender signals that data exists. Once a burst read has started, the read signal is not lowered.
- The receiver is only allowed to read as long as the sender signals that new data exists (e.g. ICACHE_FSL_IN_Exists = 1)

For details on the generic FSL protocol, please see the Fast Simplex Link (FSL) Bus (DS449) datasheet in the Xilinx EDK IP Documentation.

The CacheLink solution uses one incoming (slave) and one outgoing (master) FSL per cache controller. The outgoing FSL is used to send access requests, while the incoming FSL is used for receiving the requested cache lines. CacheLink also uses a specific encoding of the transaction information over the FSL data and control signals.

The cache lines used for reads in the CacheLink protocol are 4 or 8 words long. Each cache line is either fetched with the critical word first, or in linear order, depending on the selected interface protocol.

- Critical word first is used by the IXCL and DXCL protocol, selected when C_ICACHE_INTERFACE = 0 (IXCL) and C_DCACHE_INTERFACE = 0 (DXCL), respectively. Each cache line is expected to start with the critical word first (that is, if an access to address 0x348 is a miss with a 4 word cache line, then the returned cache line should have the following address sequence: 0x348, 0x34c, 0x340, 0x344). The cache controller forwards the first word to the execution unit as well as stores it in the cache memory. This allows execution to resume as soon as the first word is back. The cache controller then follows through by filling up the cache line with the remaining 3 or 7 words as they are received.

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Description</th>
<th>VHDL Type</th>
<th>Direction</th>
</tr>
</thead>
<tbody>
<tr>
<td>DCACHE_FSL_OUT_Control</td>
<td>FSL control-bit to D-side read access FSL. Used with address bits [30 to 31] for read/write, byte enable and burst write encoding.</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>DCACHE_FSL_OUT_Full</td>
<td>FSL access buffer for D-side read accesses is full</td>
<td>std_logic</td>
<td>input</td>
</tr>
</tbody>
</table>

### Table 3-11: MicroBlaze Cache Link Signals (Continued)
• Linear fetch is used by the IXCL2 and DXCL2 protocol, selected when \( \text{C\_ICACHE\_INTERFACE} = 1 \) (IXCL2) and \( \text{C\_DCACHE\_INTERFACE} = 1 \) (DXCL2), respectively. The address output on the CacheLink is then aligned to the cache line size (that is, if an access to address 0x348 is a miss with a 4 word cache line, then the address output on the CacheLink is 0x340). The cache controller stores data in the cache memory, and forwards the requested word to the execution unit when it is available.

When the parameter \( \text{C\_DCACHE\_USE\_WRITEBACK} \) is set to 1, write operations can store an entire cache line using burst write, as well as single-words. Each cache line is always stored in linear order, and the address output on the CacheLink is aligned to the cache line size. When the parameter \( \text{C\_DCACHE\_USE\_WRITEBACK} \) is cleared to 0, all write operations on the CacheLink are single-word. \( \text{C\_DCACHE\_INTERFACE} \) must be set to 1 (DXCL2) when write-back is used, since burst write is only available with the DXCL2 protocol.

### Instruction Cache Read Miss

On a read miss the cache controller performs the following sequence:

1. Write the word aligned\(^1\) or cache line aligned missed address to \( \text{ICACHE\_FSL\_OUT\_Data} \), with the control bit set low (\( \text{ICACHE\_FSL\_OUT\_Control} = 0 \)) to indicate a read access
2. Wait until \( \text{ICACHE\_FSL\_IN\_Exists} \) goes high to indicate that data is available

**Note:** There must be at least one clock cycle before \( \text{ICACHE\_FSL\_IN\_Exists} \) goes high (that is, at least one wait state must be used).

With the IXCL protocol (critical word first):

3. Store the word from \( \text{ICACHE\_FSL\_IN\_Data} \) to the cache
4. Forward the critical word to the execution unit in order to resume execution
5. Repeat 3 and 4 for the subsequent 3 or 7 words in the cache line

With the IXCL2 protocol (linear fetch):

3. Store words from \( \text{ICACHE\_FSL\_IN\_Data} \) to the cache
4. Forward the relevant word to the execution unit in order to resume execution
5. Store remaining words from \( \text{ICACHE\_FSL\_IN\_Data} \) to the cache

### Data Cache Read Miss

On a read miss the cache controller will perform the following sequence:

1. If \( \text{DCACHE\_FSL\_OUT\_Full} = 1 \) then stall until it goes low
2. Write the word aligned\(^1\) or cache line aligned missed address to \( \text{DCACHE\_FSL\_OUT\_Data} \), with the control bit set low (\( \text{DCACHE\_FSL\_OUT\_Control} = 0 \)) to indicate a read access
3. Wait until \( \text{DCACHE\_FSL\_IN\_Exists} \) goes high to indicate that data is available

**Note:** There must be at least one clock cycle before \( \text{DCACHE\_FSL\_IN\_Exists} \) goes high (that is, at least one wait state must be used).

With the DXCL protocol (critical word first):

4. Store the word from \( \text{DCACHE\_FSL\_IN\_Data} \) to the cache
5. Forward the critical word to the execution unit in order to resume execution
6. Repeat 4 and 5 for the subsequent 3 or 7 words in the cache line

---

1. Byte and halfword read misses are naturally expected to return complete words, the cache controller then provides the execution unit with the correct bytes.
With the DXCL2 protocol (linear fetch):

4. Store words from DCACHE_FSL_IN_Data to the cache
5. Forward the requested word to the execution unit in order to resume execution
6. Store remaining words from DCACHE_FSL_IN_Data to the cache

Data Cache Write

When C_DCACHE_INTERFACE is set to 1 (DXCL2), the CacheLink can either do burst write or single-word write.

A burst write is used when C_DCACHE_USE_WRITEBACK is set to 1 and an entire cache line is valid. There are two occasions when an entire cache line becomes valid:

- If a cache miss occurs for a load instruction or byte/halfword store instruction, which causes the entire cache line to be read into the cache with a burst read.
- All words in the cache line have been written with word store instructions.

Note that writes to the data cache always are write-through when C_DCACHE_USE_WRITEBACK is cleared to 0, and thus there is a write over the CacheLink regardless of whether there was a hit or miss in the cache.

With the DXCL2 protocol, on a burst cache line write, the cache controller performs the following sequence:

1. If DCACHE_FSL_OUT_Full = 1 then stall until it goes low
2. Write the cache aligned address to DCACHE_FSL_OUT_Data, with the control bit set high (DCACHE_FSL_OUT_Control = 1) to indicate a write access. The two least-significant bits (30:31) of the address are used to encode burst access: 0b10=burst. To separate a burst access from a single byte-write, the control bit for the first data word in step 4 is low for a burst access (DCACHE_FSL_OUT_Control = 0).
3. If DCACHE_FSL_OUT_Full = 1 then stall until it goes low
4. Write the data to be stored to DCACHE_FSL_OUT_Data. The control bit is low (DCACHE_FSL_OUT_Control = 0) for a burst access.
5. Repeat 3 and 4 for the subsequent words in the cache line.

With either the DXCL or DXCL2 protocol, on a single-word write, the cache controller performs the following sequence:

1. If DCACHE_FSL_OUT_Full = 1 then stall until it goes low
2. Write the missed address to DCACHE_FSL_OUT_Data, with the control bit set high (DCACHE_FSL_OUT_Control = 1) to indicate a write access. The two least-significant bits (30:31) of the address are used to encode byte and half-word enables: 0b00=byte0, 0b01=byte1 or halfword0, 0x10=byte2, and 0x11=byte3 or halfword1. The selection of half-word or byte access is based on the control bit for the data word in step 4.
3. If DCACHE_FSL_OUT_Full = 1 then stall until it goes low
4. Write the data to be stored to DCACHE_FSL_OUT_Data. For byte and halfword accesses the data is mirrored onto byte-lanes. Mirroring outputs the byte or halfword to be written on all four byte-lanes or on both halfword-lanes, respectively. The control bit should be low (DCACHE_FSL_OUT_Control = 0) for a word or halfword access, and high for a byte access to separate it from a burst access. Word or halfword accesses can be distinguished by the least significant bit of the address (0=word and 1=halfword).
Lockstep Interface Description

The lockstep interface on MicroBlaze is designed to connect a master and one or more slave MicroBlaze instances. The lockstep signals on MicroBlaze are listed in Table 3-12.

Table 3-12: MicroBlaze Lockstep Signals

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Description</th>
<th>VHDL Type</th>
<th>Direction</th>
</tr>
</thead>
<tbody>
<tr>
<td>Lockstep_Master_Out</td>
<td>Output with signals going from master to slave MicroBlaze. Not</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td></td>
<td>connected on slaves.</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Lockstep_Slave_In</td>
<td>Input with signals coming from master to slave MicroBlaze. Not</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td></td>
<td>connected on master.</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Lockstep_Out</td>
<td>Output with all comparison signals from both master and slaves.</td>
<td>std_logic</td>
<td>output</td>
</tr>
</tbody>
</table>

The comparison signals provided by Lockstep_Out are listed in Table 3-13.

Table 3-13: MicroBlaze Lockstep Comparison Signals

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Bus Index Range</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>MB_Halted</td>
<td>0</td>
<td>std_logic</td>
</tr>
<tr>
<td>MB_Error</td>
<td>1</td>
<td>std_logic</td>
</tr>
<tr>
<td>IFetch_POS</td>
<td>2</td>
<td>std_logic</td>
</tr>
<tr>
<td>I_AS_POS</td>
<td>3</td>
<td>std_logic</td>
</tr>
<tr>
<td>Instr_Addr</td>
<td>4 to 35</td>
<td>std_logic</td>
</tr>
<tr>
<td>Data_Addr</td>
<td>36 to 67</td>
<td>std_logic</td>
</tr>
<tr>
<td>Data_Write</td>
<td>68 to 99</td>
<td>std_logic</td>
</tr>
<tr>
<td>D_AS</td>
<td>100</td>
<td>std_logic</td>
</tr>
<tr>
<td>Read_Strobe</td>
<td>101</td>
<td>std_logic</td>
</tr>
<tr>
<td>Write_Strobe</td>
<td>102</td>
<td>std_logic</td>
</tr>
<tr>
<td>Byte_Enable</td>
<td>103 to 106</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_ABort</td>
<td>107</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_busLock</td>
<td>108</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_lockErr</td>
<td>109</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_rdBurst</td>
<td>110</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_request</td>
<td>111</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_RNW</td>
<td>112</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_wrBurst</td>
<td>113</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_MSize</td>
<td>114 to 115</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_priority</td>
<td>116 to 117</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_ABus</td>
<td>118 to 149</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_UABus</td>
<td>150 to 181</td>
<td>std_logic</td>
</tr>
<tr>
<td>IPLB_M_BE^1</td>
<td>182 to 197</td>
<td>std_logic</td>
</tr>
</tbody>
</table>
Table 3-13: MicroBlaze Lockstep Comparison Signals (Continued)

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Bus Index Range</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>IPLB_M_size</td>
<td>198 to 201</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>IPLB_M_TAttribute</td>
<td>202 to 217</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>IPLB_M_type</td>
<td>218 to 220</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>IPLB_M_wrDBus1</td>
<td>221 to 348</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>DPLB_M_ABORT</td>
<td>349</td>
<td>std_logic</td>
</tr>
<tr>
<td>DPLB_M_busLock</td>
<td>350</td>
<td>std_logic</td>
</tr>
<tr>
<td>DPLB_M_lockErr</td>
<td>351</td>
<td>std_logic</td>
</tr>
<tr>
<td>DPLB_M_rdBurst</td>
<td>352</td>
<td>std_logic</td>
</tr>
<tr>
<td>DPLB_M_request</td>
<td>353</td>
<td>std_logic</td>
</tr>
<tr>
<td>DPLB_M_RNW</td>
<td>354</td>
<td>std_logic</td>
</tr>
<tr>
<td>DPLB_M_wrBurst</td>
<td>355</td>
<td>std_logic</td>
</tr>
<tr>
<td>DPLB_M_ABus</td>
<td>356 to 387</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>DPLB_M_UABus</td>
<td>388 to 419</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>DPLB_M_BE1</td>
<td>420 to 435</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>DPLB_M_MSize</td>
<td>436 to 437</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>DPLB_M_priority</td>
<td>438 to 439</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>DPLB_M_size</td>
<td>440 to 443</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>DPLB_M_TAttribute</td>
<td>444 to 459</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>DPLB_M_type</td>
<td>460 to 462</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>DPLB_M_wrDBus1</td>
<td>463 to 590</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>ICACHE_FSL_IN_Clk</td>
<td>591</td>
<td>std_logic</td>
</tr>
<tr>
<td>ICACHE_FSL_IN_Read</td>
<td>592</td>
<td>std_logic</td>
</tr>
<tr>
<td>ICACHE_FSL_OUT_Clk</td>
<td>593</td>
<td>std_logic</td>
</tr>
<tr>
<td>ICACHE_FSL_OUT_Write</td>
<td>594</td>
<td>std_logic</td>
</tr>
<tr>
<td>ICACHE_FSL_OUT_Data</td>
<td>595 to 626</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>ICACHE_FSL_OUT_Control</td>
<td>627</td>
<td>std_logic</td>
</tr>
<tr>
<td>DCACHE_FSL_IN_Clk</td>
<td>628</td>
<td>std_logic</td>
</tr>
<tr>
<td>DCACHE_FSL_IN_Read</td>
<td>629</td>
<td>std_logic</td>
</tr>
<tr>
<td>DCACHE_FSL_OUT_Clk</td>
<td>630</td>
<td>std_logic</td>
</tr>
<tr>
<td>DCACHE_FSL_OUT_Write</td>
<td>631</td>
<td>std_logic</td>
</tr>
<tr>
<td>DCACHE_FSL_OUT_Data</td>
<td>632 to 663</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>DCACHE_FSL_OUT_Control</td>
<td>664</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IP_AWID</td>
<td>665</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IP_AWADDR</td>
<td>666 to 697</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_AWLEN</td>
<td>698 to 705</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_AWSIZE</td>
<td>706 to 708</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_AWBURST</td>
<td>709 to 710</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_AWLOCK</td>
<td>711</td>
<td>std_logic</td>
</tr>
</tbody>
</table>
### Table 3-13: MicroBlaze Lockstep Comparison Signals (Continued)

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Bus Index Range</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>M_AXI_IP_AWCACHE</td>
<td>712 to 715</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_AWPROT</td>
<td>716 to 718</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_AWQOS</td>
<td>719 to 722</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_AWVALID</td>
<td>723</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IP_WDATA</td>
<td>724 to 755</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_WSTRB</td>
<td>756 to 759</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_WLAST</td>
<td>760</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IP_WVALID</td>
<td>761</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IP_BREADY</td>
<td>762</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IP_ARID</td>
<td>763</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IP_ARADDR</td>
<td>764 to 795</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_ARLEN</td>
<td>796 to 803</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_ARSIZE</td>
<td>804 to 806</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_ARBURST</td>
<td>807 to 808</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_ARLOCK</td>
<td>809</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IP_ARCACHE</td>
<td>810 to 813</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_ARPROT</td>
<td>814 to 816</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_ARQOS</td>
<td>817 to 820</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IP_ARVALID</td>
<td>821</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IP_RREADY</td>
<td>822</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DP_AWID</td>
<td>823</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DP_AWADDR</td>
<td>824 to 855</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_ARLEN</td>
<td>856 to 863</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_AWSIZE</td>
<td>864 to 866</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_AWBURST</td>
<td>867 to 868</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_AWLOCK</td>
<td>869</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DP_AWCACHE</td>
<td>870 to 873</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_AWPROT</td>
<td>874 to 876</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_AWQOS</td>
<td>877 to 880</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_AWVALID</td>
<td>881</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DP_WDATA</td>
<td>882 to 913</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_WSTRB</td>
<td>914 to 917</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_WLAST</td>
<td>918</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DP_WVALID</td>
<td>919</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DP_BREADY</td>
<td>920</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DP_ARID</td>
<td>921</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DP_ARADDR</td>
<td>922 to 953</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_ARLEN</td>
<td>954 to 961</td>
<td>std_logic_vector</td>
</tr>
</tbody>
</table>
### Table 3-13: MicroBlaze Lockstep Comparison Signals (Continued)

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Bus Index Range</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>M_AXI_DP_ARSIZE</td>
<td>962 to 964</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_ARBURST</td>
<td>965 to 966</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_ARLOCK</td>
<td>967</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DP_ARCACHE</td>
<td>968 to 971</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_ARPROT</td>
<td>972 to 974</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_ARQOS</td>
<td>975 to 978</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DP_ARVALID</td>
<td>979</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DP_RREADY</td>
<td>980</td>
<td>std_logic</td>
</tr>
<tr>
<td>FSLn_S_Clk</td>
<td>981 + n * 37</td>
<td>std_logic</td>
</tr>
<tr>
<td>FSLn_S_Read</td>
<td>982 + n * 37</td>
<td>std_logic</td>
</tr>
<tr>
<td>FSLn_M_Clk</td>
<td>983 + n * 37</td>
<td>std_logic</td>
</tr>
<tr>
<td>FSLn_M_Write</td>
<td>984 + n * 37</td>
<td>std_logic</td>
</tr>
<tr>
<td>FSLn_M_Data</td>
<td>985 + n * 37 to 1016 + n * 37</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>FSLn_M_Control</td>
<td>1017 + n * 37</td>
<td>std_logic</td>
</tr>
<tr>
<td>Mn_AXISIS_TLAST</td>
<td>1573 + n * 35</td>
<td>std_logic</td>
</tr>
<tr>
<td>Mn_AXISIS_TDATA</td>
<td>1574 + n * 35 to 1606 + n * 35</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>Mn_AXISIS_TVALID</td>
<td>1607 + n * 35</td>
<td>std_logic</td>
</tr>
<tr>
<td>Sn_AXISIS_TREADY</td>
<td>1608 + n * 35</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IC_AWID</td>
<td>2133</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IC_AWADDR</td>
<td>2134 to 2165</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_AWLEN</td>
<td>2166 to 2173</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_AWSIZE</td>
<td>2174 to 2176</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_AWBURST</td>
<td>2177 to 2178</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_AWLOCK</td>
<td>2179</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IC_AWCACHE</td>
<td>2180 to 2183</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_AWPROT</td>
<td>2184 to 2186</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_AWQOS</td>
<td>2187 to 2190</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_AWVALID</td>
<td>2191</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IC_AWUSER</td>
<td>2192 to 2196</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_WDATA(^1)</td>
<td>2197 to 2708</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_WSTRB(^1)</td>
<td>2709 to 2772</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_WLAST</td>
<td>2773</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IC_WVALID</td>
<td>2774</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IC_WUSER</td>
<td>2775</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IC_BREADY</td>
<td>2776</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IC_ARID</td>
<td>2777</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_ARADDR</td>
<td>2778 to 2809</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_ARLEN</td>
<td>2810 to 2817</td>
<td>std_logic_vector</td>
</tr>
</tbody>
</table>
Table 3-13: MicroBlaze Lockstep Comparison Signals (Continued)

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Bus Index Range</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>M_AXI_IC_ARSIZE</td>
<td>2818 to 2820</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_ARBURST</td>
<td>2821 to 2822</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_ARLOCK</td>
<td>2823</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IC_ARCACHE</td>
<td>2824 to 2827</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_ARPROT</td>
<td>2828 to 2830</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_ARQOS</td>
<td>2831 to 2834</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_ARVALID</td>
<td>2835</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_IC_ARUSER</td>
<td>2836 to 2840</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_IC_RREADY</td>
<td>2841</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DC_AWID</td>
<td>2842</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DC_AWADDR</td>
<td>2843 to 2874</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_AWLEN</td>
<td>2875 to 2882</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_AWSIZE</td>
<td>2883 to 2885</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_AWBURST</td>
<td>2886 to 2887</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_AWLOCK</td>
<td>2888</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DC_AWCACHE</td>
<td>2889 to 2892</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_AWPROT</td>
<td>2893 to 2895</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_AWQOS</td>
<td>2896 to 2899</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_AWVALID</td>
<td>2900</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DC_AWUSER</td>
<td>2901 to 2905</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_WDATA1</td>
<td>2906 to 3417</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_WSTRB1</td>
<td>3418 to 3481</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_WLAST</td>
<td>3482</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DC_WVALID</td>
<td>3483</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DC_WUSER</td>
<td>3484</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DC_BREADY</td>
<td>3485</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DC_ARID</td>
<td>3486</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DC_ARADDR</td>
<td>3487 to 3518</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_ARLEN</td>
<td>3519 to 3526</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_ARSIZE</td>
<td>3527 to 3529</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_ARBURST</td>
<td>3530 to 3531</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_ARLOCK</td>
<td>3532</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DC_ARCACHE</td>
<td>3533 to 3536</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_ARPROT</td>
<td>3537 to 3539</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_ARQOS</td>
<td>3540 to 3543</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_ARVALID</td>
<td>3544</td>
<td>std_logic</td>
</tr>
<tr>
<td>M_AXI_DC_ARUSER</td>
<td>3545 to 3549</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>M_AXI_DC_RREADY</td>
<td>3550</td>
<td>std_logic</td>
</tr>
<tr>
<td>Signal Name</td>
<td>Bus Index Range</td>
<td>VHDL Type</td>
</tr>
<tr>
<td>--------------------------</td>
<td>---------------------</td>
<td>------------------</td>
</tr>
<tr>
<td>Trace_Instruction</td>
<td>3551 to 3582</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>Trace_Valid_Instr</td>
<td>3583</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_PC</td>
<td>3584 to 3615</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>Trace_Reg_Write</td>
<td>3616</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_Reg.Addr</td>
<td>3617 to 3621</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>Trace_MSR_Reg</td>
<td>3622 to 3636</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>Trace_PID_Reg</td>
<td>3637 to 3644</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>Trace_New_Reg_Value</td>
<td>3645 to 3676</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>Trace_Exception_Taken</td>
<td>3677</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_Exception_Kind</td>
<td>3678 to 3682</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>Trace_Jump_Taken</td>
<td>3683</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_Delay_Slot</td>
<td>3684</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_Data_Address</td>
<td>3685 to 3716</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>Trace_Data_Write_Value</td>
<td>3717 to 3748</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>Trace_Data_Byte_Enable</td>
<td>3749 to 3752</td>
<td>std_logic_vector</td>
</tr>
<tr>
<td>Trace_Data_Access</td>
<td>3753</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_Data_Read</td>
<td>3754</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_Data_Write</td>
<td>3755</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_DCache_Req</td>
<td>3756</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_DCache_Hit</td>
<td>3757</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_DCache_Rdy</td>
<td>3758</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_DCache_Read</td>
<td>3759</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_ICache_Req</td>
<td>3760</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_ICache_Hit</td>
<td>3761</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_ICache_Rdy</td>
<td>3762</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_OF_PipeRun</td>
<td>3763</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_EX_PipeRun</td>
<td>3764</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_MEM_PipeRun</td>
<td>3765</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_MB_Halted</td>
<td>3766</td>
<td>std_logic</td>
</tr>
<tr>
<td>Trace_Jump_Hit</td>
<td>3767</td>
<td>std_logic</td>
</tr>
<tr>
<td>Reserved for future use</td>
<td>3768 to 4095</td>
<td></td>
</tr>
</tbody>
</table>

1. This field accommodates the maximum signal width. The part used in the comparison extends from the lowest numbered bit to the actual signal width.
Debug Interface Description

The debug interface on MicroBlaze is designed to work with the Xilinx Microprocessor Debug Module (MDM) IP core. The MDM is controlled by the Xilinx Microprocessor Debugger (XMD) through the JTAG port of the FPGA. The MDM can control multiple MicroBlaze processors at the same time. The debug signals are grouped in the DEBUG bus. The debug signals on MicroBlaze are listed in Table 3-14.

Table 3-14: MicroBlaze Debug Signals

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Description</th>
<th>VHDL Type</th>
<th>Direction</th>
</tr>
</thead>
<tbody>
<tr>
<td>Dbg_Clk</td>
<td>JTAG clock from MDM</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>Dbg_TDI</td>
<td>JTAG TDI from MDM</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>Dbg_TDO</td>
<td>JTAG TDO to MDM</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Dbg_Reg_En</td>
<td>Debug register enable from MDM</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>Dbg_Shift</td>
<td>JTAG BSCAN shift signal from MDM</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>Dbg_Capture</td>
<td>JTAG BSCAN capture signal from MDM</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>Dbg_Update</td>
<td>JTAG BSCAN update signal from MDM</td>
<td>std_logic</td>
<td>input</td>
</tr>
<tr>
<td>Debug_Rst</td>
<td>Reset signal from MDM, active high. Should be held for at least 1 Clk clock cycle.</td>
<td>std_logic</td>
<td>input</td>
</tr>
</tbody>
</table>

1. Updated for MicroBlaze v7.00: Dbg_Shift added and Debug_Rst included in DEBUG bus

Trace Interface Description

The MicroBlaze core exports a number of internal signals for trace purposes. This signal interface is not standardized and new revisions of the processor may not be backward compatible for signal selection or functionality. It is recommended that you not design custom logic for these signals, but rather to use them via Xilinx provided analysis IP. The trace signals are grouped in the TRACE bus. The current set of trace signals were last updated for MicroBlaze v7.30 and are listed in Table 3-15. The Trace exception types are listed in Table 3-16. All unused Trace exception types are reserved.

Table 3-15: MicroBlaze Trace Signals

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Description</th>
<th>VHDL Type</th>
<th>Direction</th>
</tr>
</thead>
<tbody>
<tr>
<td>Trace_Valid_Instr</td>
<td>Valid instruction on trace port.</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Instruction</td>
<td>Instruction code</td>
<td>std_logic_vector (0 to 31)</td>
<td>output</td>
</tr>
<tr>
<td>Trace_PC</td>
<td>Program counter</td>
<td>std_logic_vector (0 to 31)</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Reg_Write</td>
<td>Instruction writes to the register file</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Reg.Addr</td>
<td>Destination register address</td>
<td>std_logic_vector (0 to 4)</td>
<td>output</td>
</tr>
<tr>
<td>Trace_MSR_Reg</td>
<td>Machine status register</td>
<td>std_logic_vector (0 to 14)</td>
<td>output</td>
</tr>
<tr>
<td>Trace_PID_Reg</td>
<td>Process identifier register</td>
<td>std_logic_vector (0 to 7)</td>
<td>output</td>
</tr>
</tbody>
</table>
### Table 3-15: MicroBlaze Trace Signals (Continued)

<table>
<thead>
<tr>
<th>Signal Name</th>
<th>Description</th>
<th>VHDL Type</th>
<th>Direction</th>
</tr>
</thead>
<tbody>
<tr>
<td>Trace_New_Reg_Value</td>
<td>Destination register update value</td>
<td>std_logic_vector (0 to 31)</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Exception_Taken</td>
<td>Instruction result in taken exception</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Exception_Kind</td>
<td>Exception type. The description for the exception type is documented below.</td>
<td>std_logic_vector (0 to 4)</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Jump_Taken</td>
<td>Branch instruction evaluated true, i.e taken</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Jump_Hit</td>
<td>Branch Target Cache hit</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Delay_Slot</td>
<td>Instruction is in delay slot of a taken branch</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Data_Access</td>
<td>Valid D-side memory access</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Data_Address</td>
<td>Address for D-side memory access</td>
<td>std_logic_vector (0 to 31)</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Data_Write_Value</td>
<td>Value for D-side memory write access</td>
<td>std_logic_vector (0 to 31)</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Data_Write_Enable</td>
<td>Byte enables for D-side memory access</td>
<td>std_logic_vector (0 to 3)</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Data_Read</td>
<td>D-side memory access is a read</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_Data_Write</td>
<td>D-side memory access is a write</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_DCache_Req</td>
<td>Data memory address is within D-Cache range</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_DCache_Hit</td>
<td>Data memory address is present in D-Cache</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_DCache_Rdy</td>
<td>Data memory address is within D-Cache range and the access is completed</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_DCache_Read</td>
<td>The D-Cache request is a read</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_ICache_Req</td>
<td>Instruction memory address is within I-Cache range</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_ICache_Hit</td>
<td>Instruction memory address is present in I-Cache</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_ICache_Rdy</td>
<td>Instruction memory address is within I-Cache range and the access is completed</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_OF_PipeRun</td>
<td>Pipeline advance for Decode stage</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_EX_PipeRun</td>
<td>Pipeline advance for Execution stage</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_MEM_PipeRun</td>
<td>Pipeline advance for Memory stage</td>
<td>std_logic</td>
<td>output</td>
</tr>
<tr>
<td>Trace_MB_Halted</td>
<td>Pipeline is halted by debug</td>
<td>std_logic</td>
<td>output</td>
</tr>
</tbody>
</table>

1. Valid only when Trace_Valid_Instr = 1
2. Updated for MicroBlaze v7.00: 4 bits added to Trace_MSR_Reg, Trace_PID_Reg added, Trace_MB_Halted added, and 1 bit added to Trace_Exception_Kind
3. Valid only when Trace_Exception_Taken = 1
4. Updated for MicroBlaze v7.30: Trace_DCache_Rdy, Trace_DCache_Read, Trace_ICache_Rdy, and Trace_Jump_Hit added
5. Valid only when Trace_DCache_Req = 1
6. Not used with area optimization feature
### Table 3-16: Type of Trace Exception

<table>
<thead>
<tr>
<th>Trace_Exception_Kind [0:4]</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>00000</td>
<td>Stream exception(^1)</td>
</tr>
<tr>
<td>00001</td>
<td>Unaligned exception</td>
</tr>
<tr>
<td>00010</td>
<td>Illegal Opcode exception</td>
</tr>
<tr>
<td>00011</td>
<td>Instruction Bus exception</td>
</tr>
<tr>
<td>00100</td>
<td>Data Bus exception</td>
</tr>
<tr>
<td>00101</td>
<td>Divide exception</td>
</tr>
<tr>
<td>00110</td>
<td>FPU exception</td>
</tr>
<tr>
<td>00111</td>
<td>Privileged instruction exception(^1)</td>
</tr>
<tr>
<td>01010</td>
<td>Interrupt</td>
</tr>
<tr>
<td>01011</td>
<td>External non maskable break</td>
</tr>
<tr>
<td>01100</td>
<td>External maskable break</td>
</tr>
<tr>
<td>10000</td>
<td>Data storage exception(^1)</td>
</tr>
<tr>
<td>10001</td>
<td>Instruction storage exception(^1)</td>
</tr>
<tr>
<td>10010</td>
<td>Data TLB miss exception(^1)</td>
</tr>
<tr>
<td>10011</td>
<td>Instruction TLB miss exception(^1)</td>
</tr>
</tbody>
</table>

1. Added for MicroBlaze v7.00
MicroBlaze Core Configurability

The MicroBlaze core has been developed to support a high degree of user configurability. This allows tailoring of the processor to meet specific cost/performance requirements.

Configuration is done via parameters that typically enable, size, or select certain processor features. For example, the instruction cache is enabled by setting the C_USE_ICACHE parameter. The size of the instruction cache, and the cacheable memory range, are all configurable using: C_CACHE_BYTE_SIZE, C_ICACHE_BASEADDR, and C_ICACHE_HIGHADDR respectively.

Parameters valid for MicroBlaze v8.00 are listed in Table 3-17. Not all of these are recognized by older versions of MicroBlaze; however, the configurability is fully backward compatibility.

*Note:* Shaded rows indicate that the parameter has a fixed value and cannot be modified.

<table>
<thead>
<tr>
<th>Parameter Name</th>
<th>Feature/Description</th>
<th>Allowable Values</th>
<th>Default Value</th>
<th>EDK Tool Assigned</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>C_FAMILY</td>
<td>Target Family</td>
<td>Listed in Table 3-18</td>
<td>virtex5</td>
<td>yes</td>
<td>string</td>
</tr>
<tr>
<td>C_DATA_SIZE</td>
<td>Data Size</td>
<td>32</td>
<td>32</td>
<td>NA</td>
<td>integer</td>
</tr>
<tr>
<td>C_DYNAMIC_BUS_SIZING</td>
<td>Legacy</td>
<td>1</td>
<td>1</td>
<td>NA</td>
<td>integer</td>
</tr>
<tr>
<td>C_SCO</td>
<td>Xilinx internal</td>
<td>0</td>
<td>0</td>
<td>NA</td>
<td>integer</td>
</tr>
<tr>
<td>C_AREA_OPTIMIZED</td>
<td>Select implementation to optimize area with lower instruction throughput</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_OPTIMIZATION</td>
<td>Reserved for future use</td>
<td>0</td>
<td>0</td>
<td>NA</td>
<td>integer</td>
</tr>
<tr>
<td>C_INTERCONNECT</td>
<td>Select interconnect</td>
<td>1 = PLBv46</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td></td>
<td>2 = AXI4</td>
<td>2</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ENDIANNESS</td>
<td>Select endianness</td>
<td>0 = Big Endian</td>
<td>0</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1 = Little Endian</td>
<td></td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_BASE_VECTORS(^1)</td>
<td>Configurable base vectors</td>
<td>0x00000000-0xffffffff80</td>
<td>0x00000000</td>
<td>std_logic_vector</td>
<td></td>
</tr>
<tr>
<td>C_FAULT_TOLERANT</td>
<td>Implement fault tolerance</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ECC_USE_CE_EXCEPTION</td>
<td>Generate exception for correctible ECC error</td>
<td>0,1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_LOCKSTEP_SLAVE</td>
<td>Lockstep Slave</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
</tbody>
</table>
### Table 3-17: MPD Parameters (Continued)

<table>
<thead>
<tr>
<th>Parameter Name</th>
<th>Feature/Description</th>
<th>Allowable Values</th>
<th>Default Value</th>
<th>EDK Tool Assigned</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>C_AVOID_PRIMITIVES</td>
<td>Disallow FPGA primitives</td>
<td>0, 1, 2, 3</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>0 = None</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1 = SRL</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>2 = LUTRAM</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>3 = Both</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>C_PVR</td>
<td>Processor version register</td>
<td>0, 1, 2</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>mode selection</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>0 = None</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1 = Basic</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>2 = Full</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>C_PVR_USER1</td>
<td>Processor version register</td>
<td>0x00-0xff</td>
<td>0x00</td>
<td>std_logic_vector</td>
<td>(0 to 7)</td>
</tr>
<tr>
<td></td>
<td>USER1 constant</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>C_PVR_USER2</td>
<td>Processor version register</td>
<td>0x00000000-0xffffffff</td>
<td>0x00000000</td>
<td>std_logic_vector</td>
<td>(0 to 31)</td>
</tr>
<tr>
<td></td>
<td>USER2 constant</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>C_RESET_MSR</td>
<td>Reset value for MSR register</td>
<td>0x00, 0x20, 0x80, 0xa0</td>
<td>0x00</td>
<td>std_logic_vector</td>
<td></td>
</tr>
<tr>
<td>C_INSTANCE</td>
<td>Instance Name</td>
<td>Any instance name</td>
<td>microblaze</td>
<td>yes</td>
<td>string</td>
</tr>
<tr>
<td>C_D_PLB</td>
<td>Data side PLB interface</td>
<td>0, 1</td>
<td>0</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_D_AXI</td>
<td>Data side AXI interface</td>
<td>0, 1</td>
<td>0</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_D_LMB</td>
<td>Data side LMB interface</td>
<td>0, 1</td>
<td>1</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_I_PLB</td>
<td>Instruction side PLB interface</td>
<td>0, 1</td>
<td>0</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_I_AXI</td>
<td>Instruction side AXI interface</td>
<td>0, 1</td>
<td>0</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_I_LMB</td>
<td>Instruction side LMB interface</td>
<td>0, 1</td>
<td>1</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_USE_BARREL</td>
<td>Include barrel shifter</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_USE_DIV</td>
<td>Include hardware divider</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_USE_HW_MUL</td>
<td>Include hardware multiplier</td>
<td>0, 1, 2</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>Parameter Name</td>
<td>Feature/Description</td>
<td>Allowable Values</td>
<td>Default Value</td>
<td>EDK Tool Assigned</td>
<td>VHDL Type</td>
</tr>
<tr>
<td>-------------------------------</td>
<td>--------------------------------------------------------------------------------------</td>
<td>------------------</td>
<td>---------------</td>
<td>------------------</td>
<td>-----------</td>
</tr>
<tr>
<td>C_USE_FPU</td>
<td>Include hardware floating point unit</td>
<td>0, 1, 2</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_USE_MSR_INSTR</td>
<td>Enable use of instructions: MSRSET and MSRCLR</td>
<td>0, 1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_USE_PCP_INSTR</td>
<td>Enable use of instructions: CLZ, PCMPBF, PCMPSEQ, and PCMPNE</td>
<td>0, 1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_USE_REORDER_INSTR</td>
<td>Enable use of instructions: Reverse load, reverse store, and swap</td>
<td>0, 1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_UNALIGNED_EXCEPTIONS</td>
<td>Enable exception handling for unaligned data accesses</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_IILL_OPCODE_EXCEPTION</td>
<td>Enable exception handling for illegal op-code</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_IPLB_BUS_EXCEPTION</td>
<td>Enable exception handling for IPLB bus error</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DPLB_BUS_EXCEPTION</td>
<td>Enable exception handling for DPLB bus error</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_I_BUS_EXCEPTION</td>
<td>Enable exception handling for M_AXI_I bus error</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_D_BUS_EXCEPTION</td>
<td>Enable exception handling for M_AXI_D bus error</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DIV_ZERO_EXCEPTION</td>
<td>Enable exception handling for division by zero or division overflow</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_FPU_EXCEPTION</td>
<td>Enable exception handling for hardware floating point unit exceptions</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_OPCODE_0x0_ILLEGAL</td>
<td>Detect opcode 0x0 as an illegal instruction</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_FSL_EXCEPTION</td>
<td>Enable exception handling for Stream Links</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
</tbody>
</table>
### Table 3-17: MPD Parameters (Continued)

<table>
<thead>
<tr>
<th>Parameter Name</th>
<th>Feature/Description</th>
<th>Allowable Values</th>
<th>Default Value</th>
<th>EDK Tool Assigned</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>C_ECC_USE_CE_EXCEPTION</td>
<td>Generate Bus Error Exceptions for correctable errors</td>
<td>0,1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_USE_STACK_PROTECTION</td>
<td>Generate exception for stack overflow or stack underflow</td>
<td>0,1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DEBUG_ENABLED</td>
<td>MDM Debug interface</td>
<td>0,1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_NUMBER_OF_PC_BRK</td>
<td>Number of hardware breakpoints</td>
<td>0-8</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_NUMBER_OF_RD_ADDR_BRK</td>
<td>Number of read address watchpoints</td>
<td>0-4</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_NUMBER_OF_WR_ADDR_BRK</td>
<td>Number of write address watchpoints</td>
<td>0-4</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_INTERRUPT_IS_EDGE</td>
<td>Level/Edge Interrupt</td>
<td>0, 1</td>
<td>0</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_EDGE_IS_POSITIVE</td>
<td>Negative/Positive Edge Interrupt</td>
<td>0, 1</td>
<td>1</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_FSL_LINKS^2</td>
<td>Number of stream interfaces (FSL or AXI)</td>
<td>0-16</td>
<td>0</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_FSL_DATA_SIZE</td>
<td>FSL data bus size</td>
<td>32</td>
<td>32</td>
<td>NA</td>
<td>integer</td>
</tr>
<tr>
<td>C_USE_EXTENDED_FSL_INSTR</td>
<td>Enable use of extended stream instructions</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ICACHE_BASEADDR</td>
<td>Instruction cache base address</td>
<td>0x0000000000 - 0xFFFFFFFF</td>
<td>0x0000000000</td>
<td>std_logic_vector</td>
<td></td>
</tr>
<tr>
<td>C_ICACHE_HIGHLADDR</td>
<td>Instruction cache high address</td>
<td>0x0000000000 - 0xFFFFFFFF</td>
<td>0x3FFFFF</td>
<td>std_logic_vector</td>
<td></td>
</tr>
<tr>
<td>C_USE_ICACHE</td>
<td>Instruction cache</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ALLOW_ICACHE_WR</td>
<td>Instruction cache write enable</td>
<td>0, 1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ICACHE_LINE_LEN</td>
<td>Instruction cache line length</td>
<td>4, 8</td>
<td>4</td>
<td></td>
<td>integer</td>
</tr>
</tbody>
</table>
### MicroBlaze Core Configurability

<table>
<thead>
<tr>
<th>Parameter Name</th>
<th>Feature/Description</th>
<th>Allowable Values</th>
<th>Default Value</th>
<th>EDK Tool Assigned</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>C_ICACHE_ALWAYS_USED</td>
<td>Instruction cache interface used for all memory accesses in the cacheable range</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ICACHE_INTERFACE</td>
<td>Instruction cache CacheLink interface protocol 0 = IXCL 1 = IXCL2</td>
<td>0, 1</td>
<td>0</td>
<td>yes³</td>
<td>integer</td>
</tr>
<tr>
<td>C_ICACHE_FORCE_TAG_LUTRAM</td>
<td>Instruction cache tag always implemented with distributed RAM</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ICACHE_STREAMS</td>
<td>Instruction cache streams</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ICACHE_VICTIMS</td>
<td>Instruction cache victims</td>
<td>0, 2, 4, 8</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ICACHE_DATA_WIDTH</td>
<td>Instruction cache data width 0 = 32 bits 1 = Full cache line 2 = 512 bits</td>
<td>0, 1, 2</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ADDR_TAG_BITS</td>
<td>Instruction cache address tags</td>
<td>0-25</td>
<td>17</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_CACHE_BYTE_SIZE</td>
<td>Instruction cache size 64, 128, 256, 512, 1024, 2048, 4096, 8192, 16384, 32768, 65536⁴</td>
<td>8192</td>
<td></td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ICACHE_USE_FSL</td>
<td>Cache over CacheLink instead of peripheral bus for instructions</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DCACHE_BASEADDR</td>
<td>Data cache base address 0x0000000000 - 0xFFFFFFFF</td>
<td>0x00000000 - 0xFFFFFFFF</td>
<td>0x0000000000</td>
<td>std_logic_vector</td>
<td></td>
</tr>
<tr>
<td>C_DCACHE_HIGHADDR</td>
<td>Data cache high address 0x0000000000 - 0xFFFFFFFF</td>
<td>0x0000000000 - 0x3FFFFFFFFFF</td>
<td>0x3FFFFFFFFFF</td>
<td>std_logic_vector</td>
<td></td>
</tr>
<tr>
<td>C_USE_DCACHE</td>
<td>Data cache</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_ALLOW_DCACHE_WR</td>
<td>Data cache write enable</td>
<td>0, 1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
</tbody>
</table>

### Table 3-17: MPD Parameters (Continued)
### MPD Parameters (Continued)

<table>
<thead>
<tr>
<th>Parameter Name</th>
<th>Feature/Description</th>
<th>Allowable Values</th>
<th>Default Value</th>
<th>EDK Tool Assigned</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>C_DCACHE_LINE_LEN</td>
<td>Data cache line length</td>
<td>4, 8</td>
<td>4</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DCACHE_ALWAYS_USED</td>
<td>Data cache interface used for all accesses in the cacheable range</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DCACHE_INTERFACE</td>
<td>Data cache CacheLink interface protocol</td>
<td>0, 1</td>
<td>0, yes³</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DCACHE_FORCE_TAG_LUTRAM</td>
<td>Data cache tag always implemented with distributed RAM</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DCACHE_USE_WRITEBACK</td>
<td>Data cache write-back storage policy used</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DCACHE_VICTIMS</td>
<td>Data cache victims</td>
<td>0, 2, 4, 8</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DCACHE_DATA_WIDTH</td>
<td>Data cache data width</td>
<td>0, 1, 2</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DCACHE_ADDR_TAG</td>
<td>Data cache address tags</td>
<td>0-25</td>
<td>17</td>
<td>yes³</td>
<td>integer</td>
</tr>
<tr>
<td>C_DCACHE_BYTE_SIZE</td>
<td>Data cache size</td>
<td>64, 128, 256, 512, 1024, 2048, 4096, 8192, 16384, 32768, 65536⁴</td>
<td>8192</td>
<td>integer</td>
<td></td>
</tr>
<tr>
<td>C_DCACHE_USE_FSL</td>
<td>Cache over CacheLink instead of peripheral bus for data</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DPLB_DWIDTH</td>
<td>Data side PLB data width</td>
<td>32</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DPLB_NATIVE_DWIDTH</td>
<td>Data side PLB native data width</td>
<td>32</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DPLB_BURST_EN</td>
<td>Data side PLB burst enable</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_DPLB_P2P</td>
<td>Data side PLB Point-to-point</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
</tbody>
</table>
### Table 3-17: MPD Parameters (Continued)

<table>
<thead>
<tr>
<th>Parameter Name</th>
<th>Feature/Description</th>
<th>Allowable Values</th>
<th>Default Value</th>
<th>EDK Tool Assigned</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>C_IPLB_DWIDTH</td>
<td>Instruction side PLB data width</td>
<td>32</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_IPLB_NATIVE_DWIDTH</td>
<td>Instruction side PLB native data width</td>
<td>32</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_IPLB_BURST_EN</td>
<td>Instruction side PLB burst enable</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_IPLB_P2P</td>
<td>Instruction side PLB Point-to-point</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_USE_MMU5</td>
<td>Memory Management:</td>
<td></td>
<td></td>
<td>0, 1, 2, 3</td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>0 = None</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1 = User Mode</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>2 = Protection</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>3 = Virtual</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>C_MMU_DTLB_SIZE5</td>
<td>Data shadow Translation Look-Aside Buffer size</td>
<td>1, 2, 4, 8</td>
<td>4</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_MMU_ITLB_SIZE5</td>
<td>Instruction shadow Translation Look-Aside Buffer size</td>
<td>1, 2, 4, 8</td>
<td>2</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_MMU_TLB_ACCESS5</td>
<td>Access to memory management special registers:</td>
<td></td>
<td></td>
<td>0, 1, 2, 3</td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>0 = Minimal</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1 = Read</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>2 = Write</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>3 = Full</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>C_MMU_ZONES5</td>
<td>Number of memory protection zones</td>
<td>0-16</td>
<td>16</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_MMU_PRIVILEGED_INSTR5</td>
<td>Privileged instructions</td>
<td>0, 1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>0 = Full protection</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1 = Allow stream instrs</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>C_USE_INTERRUPT</td>
<td>Enable interrupt handling</td>
<td>0, 1, 2</td>
<td>0</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>0 = No interrupt</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>1 = Standard interrupt</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td>2 = Low-latency interrupt</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>C_USE_EXT_BRK</td>
<td>Enable external break handling</td>
<td>0, 1</td>
<td>0</td>
<td>yes</td>
<td>integer</td>
</tr>
</tbody>
</table>
Table 3-17: MPD Parameters (Continued)

<table>
<thead>
<tr>
<th>Parameter Name</th>
<th>Feature/Description</th>
<th>Allowable Values</th>
<th>Default Value</th>
<th>EDK Tool Assigned</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>C_USE_EXT_NM_BRK</td>
<td>Enable external non-maskable break handling</td>
<td>0,1</td>
<td>0</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_USE_BRANCH_TARGET_CACHE</td>
<td>Enable Branch Target Cache</td>
<td>0,1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_BRANCH_TARGET_CACHE_SIZE</td>
<td>Branch Target Cache size:</td>
<td></td>
<td></td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>0 = Default</td>
<td></td>
<td></td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>1 = 8 entries</td>
<td></td>
<td></td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>2 = 16 entries</td>
<td></td>
<td></td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>3 = 32 entries</td>
<td></td>
<td></td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>4 = 64 entries</td>
<td></td>
<td></td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>5 = 512 entries</td>
<td></td>
<td></td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>6 = 1024 entries</td>
<td></td>
<td></td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td></td>
<td>7 = 2048 entries</td>
<td></td>
<td></td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DP_THREAD_ID_WIDTH</td>
<td>Data side AXI thread ID width</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DP_DATA_WIDTH</td>
<td>Data side AXI data width</td>
<td>32</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DP_ADDR_WIDTH</td>
<td>Data side AXI address width</td>
<td>32</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DP_SUPPORTS_THREADS</td>
<td>Data side AXI uses threads</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DP_SUPPORTS_READ</td>
<td>Data side AXI support for read accesses</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DP_SUPPORTS_WRITE</td>
<td>Data side AXI support for write accesses</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DP_SUPPORTS_NARROW_BURST</td>
<td>Data side AXI narrow burst support</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DP_PROTOCOL</td>
<td>Data side AXI protocol</td>
<td>AXI4, AXI4LITE</td>
<td></td>
<td></td>
<td>string</td>
</tr>
<tr>
<td>C_M_AXI_DP_EXCLUSIVE_ACCESS</td>
<td>Data side AXI exclusive access support</td>
<td>0,1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_INTERCONNECT_M_AXI_DP_READ_ISSUING</td>
<td>Data side AXI read accesses issued</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_INTERCONNECT_M_AXI_DP_WRITE_ISSUING</td>
<td>Data side AXI write accesses issued</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
</tbody>
</table>
### Table 3-17: MPD Parameters (Continued)

<table>
<thead>
<tr>
<th>Parameter Name</th>
<th>Feature/Description</th>
<th>Allowable Values</th>
<th>Default Value</th>
<th>EDK Tool Assigned</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>C_M_AXI_IP_THREAD_ID_WIDTH</td>
<td>Instruction side AXI thread ID width</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IP_DATA_WIDTH</td>
<td>Instruction side AXI data width</td>
<td>32</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IP_ADDR_WIDTH</td>
<td>Instruction side AXI address width</td>
<td>32</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IP_SUPPORTS_THREADS</td>
<td>Instruction side AXI uses threads</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IP_SUPPORTS_READ</td>
<td>Instruction side AXI support for read accesses</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IP_SUPPORTS_WRITE</td>
<td>Instruction side AXI support for write accesses</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IP_SUPPORTS_NARROW_BURST</td>
<td>Instruction side AXI narrow burst support</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IP_PROTOCOL</td>
<td>Instruction side AXI protocol</td>
<td>AXI4LITE</td>
<td>AXI4LITE</td>
<td></td>
<td>string</td>
</tr>
<tr>
<td>C_INTERCONNECT_M_AXI_IP_READ_ISSUING</td>
<td>Instruction side AXI read accesses issued</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_THREAD_ID_WIDTH</td>
<td>Data cache AXI ID width</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_DATA_WIDTH</td>
<td>Data cache AXI data width</td>
<td>32, 64, 128, 256, 512</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_ADDR_WIDTH</td>
<td>Data cache AXI address width</td>
<td>32</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_SUPPORTS_THREADS</td>
<td>Data cache AXI uses threads</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_SUPPORTS_READ</td>
<td>Data cache AXI support for read accesses</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_SUPPORTS_WRITE</td>
<td>Data cache AXI support for write accesses</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_SUPPORTS_NARROW_BURST</td>
<td>Data cache AXI narrow burst support</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
</tbody>
</table>
### Table 3-17: MPD Parameters (Continued)

<table>
<thead>
<tr>
<th>Parameter Name</th>
<th>Feature/Description</th>
<th>Allowable Values</th>
<th>Default Value</th>
<th>EDK Tool Assigned</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>C_M_AXI_DC_SUPPORTS_USER_SIGNALS</td>
<td>Data cache AXI user signal support</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_PROTOCOL</td>
<td>Data cache AXI protocol</td>
<td>AXI4</td>
<td>AXI4</td>
<td></td>
<td>string</td>
</tr>
<tr>
<td>C_M_AXI_DC_AWUSER_WIDTH</td>
<td>Data cache AXI user width</td>
<td>5</td>
<td>5</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_ARUSER_WIDTH</td>
<td>Data cache AXI user width</td>
<td>5</td>
<td>5</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_WUSER_WIDTH</td>
<td>Data cache AXI user width</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_RUSER_WIDTH</td>
<td>Data cache AXI user width</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_BUSER_WIDTH</td>
<td>Data cache AXI user width</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_EXCLUSIVE_ACCESS</td>
<td>Data cache AXI exclusive access support</td>
<td>0,1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_DC_USER_VALUE</td>
<td>Data cache AXI user value</td>
<td>0-31</td>
<td>31</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_INTERCONNECT_M_AXI_DC_READ_ISSUING</td>
<td>Data cache AXI read accesses issued</td>
<td>1,2</td>
<td>2</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_INTERCONNECT_M_AXI_DC_WRITE_ISSUING</td>
<td>Data cache AXI write accesses issued</td>
<td>1,2,4,8,16,32</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_THREAD_ID_WIDTH</td>
<td>Instruction cache AXI ID width</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_DATA_WIDTH</td>
<td>Instruction cache AXI data width</td>
<td>32, 64, 128, 256, 512</td>
<td>32</td>
<td>integer</td>
<td></td>
</tr>
<tr>
<td>C_M_AXI_IC_ADDR_WIDTH</td>
<td>Instruction cache AXI address width</td>
<td>32</td>
<td>32</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_SUPPORTS_THREADS</td>
<td>Instruction cache AXI uses threads</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_SUPPORTS_READ</td>
<td>Instruction cache AXI support for read accesses</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_SUPPORTS_WRITE</td>
<td>Instruction cache AXI support for write accesses</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_SUPPORTS_NARROW_BURST</td>
<td>Instruction cache AXI narrow burst support</td>
<td>0</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_SUPPORTS_USER_SIGNALS</td>
<td>Instruction cache AXI user signal support</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
</tbody>
</table>
Table 3-17: MPD Parameters (Continued)

<table>
<thead>
<tr>
<th>Parameter Name</th>
<th>Feature/Description</th>
<th>Allowable Values</th>
<th>Default Value</th>
<th>EDK Tool Assigned</th>
<th>VHDL Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>C_M_AXI_IC_PROTOCOL</td>
<td>Instruction cache AXI protocol</td>
<td>AXI4</td>
<td>AXI4</td>
<td></td>
<td>string</td>
</tr>
<tr>
<td>C_M_AXI_IC_AWUSER_WIDTH</td>
<td>Instruction cache AXI user width</td>
<td>5</td>
<td>5</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_ARUSER_WIDTH</td>
<td>Instruction cache AXI user width</td>
<td>5</td>
<td>5</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_WUSER_WIDTH</td>
<td>Instruction cache AXI user width</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_RUSER_WIDTH</td>
<td>Instruction cache AXI user width</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_BUSER_WIDTH</td>
<td>Instruction cache AXI user width</td>
<td>1</td>
<td>1</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_M_AXI_IC_USER_VALUE</td>
<td>Instruction cache AXI user value</td>
<td>0-31</td>
<td>31</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_INTERCONNECT_M_AXI_IC_READ_ISSUING</td>
<td>Instruction cache AXI read access issued</td>
<td>1,2,4,8</td>
<td>2</td>
<td>yes</td>
<td>integer</td>
</tr>
<tr>
<td>C_STREAM_INTERCONNECT</td>
<td>Select AXI4-Stream interconnect</td>
<td>0,1</td>
<td>0</td>
<td></td>
<td>integer</td>
</tr>
<tr>
<td>C_Mn_AXIS_PROTOCOL</td>
<td>AXI4-Stream protocol</td>
<td>GENERIC</td>
<td>GENERIC</td>
<td></td>
<td>string</td>
</tr>
<tr>
<td>C_Sn_AXIS_PROTOCOL</td>
<td>AXI4-Stream protocol</td>
<td>GENERIC</td>
<td>GENERIC</td>
<td></td>
<td>string</td>
</tr>
<tr>
<td>C_Mn_AXIS_DATA_WIDTH</td>
<td>AXI4-Stream master data width</td>
<td>32</td>
<td>32</td>
<td>NA</td>
<td>integer</td>
</tr>
<tr>
<td>C_Sn_AXIS_DATA_WIDTH</td>
<td>AXI4-Stream slave data width</td>
<td>32</td>
<td>32</td>
<td>NA</td>
<td>integer</td>
</tr>
</tbody>
</table>

1. The 7 least significant bits must all be 0.
2. The number of Stream Links (FSL or AXI4) is assigned by the tool itself if you are using the co-processor wizard. If you add the IP manually, you must update the parameter manually.
3. EDK tool assigned value can be overridden by explicit assignment.
4. Not all sizes are permitted in all architectures. The cache uses between 0 and 32 RAMB primitives (0 if cache size is less than 2048).
5. Not available when C_AREA_OPTIMIZED is set to 1.
### Table 3-18: Parameter C_FAMILY Allowable Values

<table>
<thead>
<tr>
<th></th>
<th>Allowable Values</th>
</tr>
</thead>
<tbody>
<tr>
<td>Artix</td>
<td>aartix7 aartix7 artix7l qartix7 qartix7l</td>
</tr>
<tr>
<td>Kintex</td>
<td>kintex7 kintex7l qkintex7 qkintex7l</td>
</tr>
<tr>
<td>Spartan</td>
<td>aspartan3 aspartan3a aspartan3adsp aspartan3e aspartan6 qaspartan6 qaspartan6l aspartan3 spartan3a spartan3adsp spartan3an spartan3e spartan6 spartan6l</td>
</tr>
<tr>
<td>Virtex</td>
<td>qrvirtex4 qrvirtex5 qrvirtex4 qrvirtex5 qrvirtex6 qrvirtex7 qrvirtex7l qrvirtex4 qrvirtex5 qrvirtex6 qrvirtex7 qrvirtex7l</td>
</tr>
<tr>
<td>Zynq</td>
<td>azynq azynq zynql qazynq</td>
</tr>
</tbody>
</table>
Chapter 4

MicroBlaze Application Binary Interface

This chapter describes MicroBlaze™ Application Binary Interface (ABI), which is important for developing software in assembly language for the soft processor. The MicroBlaze GNU compiler follows the conventions described in this document. Any code written by assembly programmers should also follow the same conventions to be compatible with the compiler generated code. Interrupt and Exception handling is also explained briefly.

Data Types

The data types used by MicroBlaze assembly programs are shown in Table 4-1. Data types such as data8, data16, and data32 are used in place of the usual byte, half-word, and word.

Table 4-1: Data Types in MicroBlaze Assembly Programs

<table>
<thead>
<tr>
<th>MicroBlaze data types (for assembly programs)</th>
<th>Corresponding ANSI C data types</th>
<th>Size (bytes)</th>
</tr>
</thead>
<tbody>
<tr>
<td>data8</td>
<td>char</td>
<td>1</td>
</tr>
<tr>
<td>data16</td>
<td>short</td>
<td>2</td>
</tr>
<tr>
<td>data32</td>
<td>int</td>
<td>4</td>
</tr>
<tr>
<td>data32</td>
<td>long int</td>
<td>4</td>
</tr>
<tr>
<td>data32</td>
<td>float</td>
<td>4</td>
</tr>
<tr>
<td>data32</td>
<td>enum</td>
<td>4</td>
</tr>
<tr>
<td>data16/data32</td>
<td>pointer¹</td>
<td>2/4</td>
</tr>
</tbody>
</table>

1. Pointers to small data areas, which can be accessed by global pointers are data16.
## Register Usage Conventions

The register usage convention for MicroBlaze is given in Table 4-2.

**Table 4-2: Register Usage Conventions**

<table>
<thead>
<tr>
<th>Register</th>
<th>Type</th>
<th>Enforcement</th>
<th>Purpose</th>
</tr>
</thead>
<tbody>
<tr>
<td>R0</td>
<td>Dedicated</td>
<td>HW</td>
<td>Value 0</td>
</tr>
<tr>
<td>R1</td>
<td>Dedicated</td>
<td>SW</td>
<td>Stack Pointer</td>
</tr>
<tr>
<td>R2</td>
<td>Dedicated</td>
<td>SW</td>
<td>Read-only small data area anchor</td>
</tr>
<tr>
<td>R3-R4</td>
<td>Volatile</td>
<td>SW</td>
<td>Return Values/Temporaries</td>
</tr>
<tr>
<td>R5-R10</td>
<td>Volatile</td>
<td>SW</td>
<td>Passing parameters/Temporaries</td>
</tr>
<tr>
<td>R11-R12</td>
<td>Volatile</td>
<td>SW</td>
<td>Temporaries</td>
</tr>
<tr>
<td>R13</td>
<td>Dedicated</td>
<td>SW</td>
<td>Read-write small data area anchor</td>
</tr>
<tr>
<td>R14</td>
<td>Dedicated</td>
<td>HW</td>
<td>Return address for Interrupt</td>
</tr>
<tr>
<td>R15</td>
<td>Dedicated</td>
<td>SW</td>
<td>Return address for Sub-routine</td>
</tr>
<tr>
<td>R16</td>
<td>Dedicated</td>
<td>HW</td>
<td>Return address for Trap (Debugger)</td>
</tr>
<tr>
<td>R17</td>
<td>Dedicated</td>
<td>HW, if configured to support HW exceptions, else SW</td>
<td>Return address for Exceptions</td>
</tr>
<tr>
<td>R18</td>
<td>Dedicated</td>
<td>SW</td>
<td>Reserved for Assembler/Compiler Temporaries</td>
</tr>
<tr>
<td>R19</td>
<td>Non-volatile</td>
<td>SW</td>
<td>Must be saved across function calls. Callee-save</td>
</tr>
<tr>
<td>R20</td>
<td>Dedicated or</td>
<td>SW</td>
<td>Reserved for storing a pointer to the Global Offset Table (GOT) in Position Independent Code (PIC). Non-volatile in non-PIC code. Must be saved across function calls. Callee-save</td>
</tr>
<tr>
<td></td>
<td>Non-volatile</td>
<td></td>
<td></td>
</tr>
<tr>
<td>R21-R31</td>
<td>Non-volatile</td>
<td>SW</td>
<td>Must be saved across function calls. Callee-save</td>
</tr>
<tr>
<td>RPC</td>
<td>Special</td>
<td>HW</td>
<td>Program counter</td>
</tr>
<tr>
<td>RMSR</td>
<td>Special</td>
<td>HW</td>
<td>Machine Status Register</td>
</tr>
<tr>
<td>REAR</td>
<td>Special</td>
<td>HW</td>
<td>Exception Address Register</td>
</tr>
<tr>
<td>RESR</td>
<td>Special</td>
<td>HW</td>
<td>Exception Status Register</td>
</tr>
<tr>
<td>RFSR</td>
<td>Special</td>
<td>HW</td>
<td>Floating Point Status Register</td>
</tr>
<tr>
<td>RBTR</td>
<td>Special</td>
<td>HW</td>
<td>Branch Target Register</td>
</tr>
<tr>
<td>REDR</td>
<td>Special</td>
<td>HW</td>
<td>Exception Data Register</td>
</tr>
<tr>
<td>RPID</td>
<td>Special</td>
<td>HW</td>
<td>Process Identifier Register</td>
</tr>
<tr>
<td>RZPR</td>
<td>Special</td>
<td>HW</td>
<td>Zone Protection Register</td>
</tr>
<tr>
<td>RTLblo</td>
<td>Special</td>
<td>HW</td>
<td>Translation Look-Aside Buffer Low Register</td>
</tr>
<tr>
<td>RTLbhi</td>
<td>Special</td>
<td>HW</td>
<td>Translation Look-Aside Buffer High Register</td>
</tr>
<tr>
<td>RTLbx</td>
<td>Special</td>
<td>HW</td>
<td>Translation Look-Aside Buffer Index Register</td>
</tr>
<tr>
<td>RTLbxsx</td>
<td>Special</td>
<td>HW</td>
<td>Translation Look-Aside Buffer Search Index</td>
</tr>
<tr>
<td>RPVR0-11</td>
<td>Special</td>
<td>HW</td>
<td>Processor Version Register 0 through 11</td>
</tr>
</tbody>
</table>
The architecture for MicroBlaze defines 32 general purpose registers (GPRs). These registers are classified as volatile, non-volatile, and dedicated.

- The volatile registers (also known as caller-save) are used as temporaries and do not retain values across the function calls. Registers R3 through R12 are volatile, of which R3 and R4 are used for returning values to the caller function, if any. Registers R5 through R10 are used for passing parameters between subroutines.

- Registers R19 through R31 retain their contents across function calls and are hence termed as non-volatile registers (a.k.a callee-save). The callee function is expected to save those non-volatile registers, which are being used. These are typically saved to the stack during the prologue and then reloaded during the epilogue.

- Certain registers are used as dedicated registers and programmers are not expected to use them for any other purpose.
  - Registers R14 through R17 are used for storing the return address from interrupts, subroutines, traps, and exceptions in that order. Subroutines are called using the branch and link instruction, which saves the current Program Counter (PC) onto register R15.
  - Small data area pointers are used for accessing certain memory locations with 16-bit immediate value. These areas are discussed in the memory model section of this document. The read only small data area (SDA) anchor R2 (Read-Only) is used to access the constants such as literals. The other SDA anchor R13 (Read-Write) is used for accessing the values in the small data read-write section.
  - Register R1 stores the value of the stack pointer and is updated on entry and exit from functions.
  - Register R18 is used as a temporary register for assembler operations.

- MicroBlaze includes special purpose registers such as: program counter (rpc), machine status register (rmsr), exception status register (resr), exception address register (rear), floating point status register (rfsr), branch target register (rbtr), exception data register (redr), memory management registers (rpid, rzpr, rtlblo, rtlbhi, rtlbx, rtlbsx), and processor version registers (rpvr0-rpvr11). These registers are not mapped directly to the register file and hence the usage of these registers is different from the general purpose registers. The value of a special purpose registers can be transferred to or from a general purpose register by using mts and mfs instructions respectively.

Stack Convention

The stack conventions used by MicroBlaze are detailed in Table 4-3.

The shaded area in Table 4-3 denotes a part of the stack frame for a caller function, while the unshaded area indicates the callee frame function. The ABI conventions of the stack frame define the protocol for passing parameters, preserving non-volatile register values, and allocating space for the local variables in a function.

Functions that contain calls to other subroutines are called as non-leaf functions. These non-leaf functions have to create a new stack frame area for its own use. When the program starts executing, the stack pointer has the maximum value. As functions are called, the stack pointer is decremented by the number of words required by every function for its stack frame. The stack pointer of a caller function always has a higher value as compared to the callee function.
Consider an example where Func1 calls Func2, which in turn calls Func3. The stack representation at different instances is depicted in Figure 4-1. After the call from Func 1 to Func 2, the value of the stack pointer (SP) is decremented. This value of SP is again decremented to accommodate the stack frame for Func3. On return from Func 3 the value of the stack pointer is increased to its original value in the function, Func 2.

Details of how the stack is maintained are shown in Figure 4-1.

### Table 4-3: Stack Convention

<table>
<thead>
<tr>
<th>High Address</th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>Function Parameters for called sub-routine (Arg n .. Arg1)</td>
<td>(Optional: Maximum number of arguments required for any called procedure from the current procedure).</td>
</tr>
<tr>
<td>Old Stack Pointer</td>
<td>Link Register (R15)</td>
</tr>
<tr>
<td>Callee Saved Register (R31....R19)</td>
<td>(Optional: Only those registers which are used by the current procedure are saved)</td>
</tr>
<tr>
<td>Local Variables for Current Procedure</td>
<td>(Optional: Present only if Locals defined in the procedure)</td>
</tr>
<tr>
<td>Functional Parameters (Arg n .. Arg 1)</td>
<td>(Optional: Maximum number of arguments required for any called procedure from the current procedure)</td>
</tr>
<tr>
<td>New Stack Pointer</td>
<td>Link Register</td>
</tr>
<tr>
<td>Low Address</td>
<td></td>
</tr>
</tbody>
</table>

![Figure 4-1: Stack Frame](image-url)
Calling Convention

The caller function passes parameters to the callee function using either the registers (R5 through R10) or on its own stack frame. The callee uses the stack area of the caller to store the parameters passed to the callee.

Refer to Figure 4-1. The parameters for Func 2 are stored either in the registers R5 through R10 or on the stack frame allocated for Func 1.

If Func 2 has more than six integer parameters, the first six parameters can be passed in registers R5 through R10, whereas all subsequent parameters must be passed on the stack frame allocated for Func 1, starting at offset SP + 28.

Memory Model

The memory model for MicroBlaze classifies the data into four different parts: Small Data Area, Data Area, Common Un-Initialized Area, and Literals or Constants.

Small Data Area

Global initialized variables which are small in size are stored in this area. The threshold for deciding the size of the variable to be stored in the small data area is set to 8 bytes in the MicroBlaze C compiler (mb-gcc), but this can be changed by giving a command line option to the compiler. Details about this option are discussed in the GNU Compiler Tools chapter. 64 kilobytes of memory is allocated for the small data areas. The small data area is accessed using the read-write small data area anchor (R13) and a 16-bit offset. Allocating small variables to this area reduces the requirement of adding IMM instructions to the code for accessing global variables. Any variable in the small data area can also be accessed using an absolute address.

Data Area

Comparatively large initialized variables are allocated to the data area, which can either be accessed using the read-write SDA anchor R13 or using the absolute address, depending on the command line option given to the compiler.

Common Un-Initialized Area

Un-initialized global variables are allocated in the common area and can be accessed either using the absolute address or using the read-write small data area anchor R13.

Literals or Constants

Constants are placed into the read-only small data area and are accessed using the read-only small data area anchor R2.

The compiler generates appropriate global pointers to act as base pointers. The actual values of the SDA anchors are decided by the linker, in the final linking stages. For more information on the various sections of the memory please refer to MicroBlaze Linker Script Sections in the Embedded System Tools Reference Manual. The compiler generates appropriate sections, depending on the command line options. Please refer to the GNU Compiler Tools chapter in the Embedded System Tools Reference Manual for more information about these options.
Interrupt and Exception Handling

MicroBlaze assumes certain address locations for handling interrupts and exceptions as indicated in Table 4-4. At these locations, code is written to jump to the appropriate handlers.

Table 4-4: Interrupt and Exception Handling

<table>
<thead>
<tr>
<th>On</th>
<th>Hardware jumps to</th>
<th>Software Labels</th>
</tr>
</thead>
<tbody>
<tr>
<td>Start / Reset</td>
<td>C_BASE_VECTORS + 0x0</td>
<td>_start</td>
</tr>
<tr>
<td>User exception</td>
<td>C_BASE_VECTORS + 0x8</td>
<td>_exception_handler</td>
</tr>
<tr>
<td>Interrupt</td>
<td>C_BASE_VECTORS + 0x10(^1)</td>
<td>_interrupt_handler</td>
</tr>
<tr>
<td>Break (HW/SW)</td>
<td>C_BASE_VECTORS + 0x18</td>
<td></td>
</tr>
<tr>
<td>Hardware exception</td>
<td>C_BASE_VECTORS + 0x20</td>
<td>_hw_exception_handler</td>
</tr>
<tr>
<td>Reserved by Xilinx for</td>
<td>C_BASE_VECTORS + 0x28-</td>
<td></td>
</tr>
<tr>
<td>future use</td>
<td>C_BASE_VECTORS + 0x4F</td>
<td></td>
</tr>
</tbody>
</table>

1. With low-latency interrupt mode, the vector address is supplied by the Interrupt Controller.

The code expected at these locations is as shown below. For programs compiled without the `-xl-mode-xmdstub` compiler option, the `crt0.o` initialization file is passed by the `mb-gcc` compiler to the `mb-ld` linker for linking. This file sets the appropriate addresses of the exception handlers.

For programs compiled with the `-xl-mode-xmdstub` compiler option, the `crt1.o` initialization file is linked to the output program. This program has to be run with the xmdstub already loaded in the memory at address location 0x0. Hence at run-time, the initialization code in `crt1.o` writes the appropriate instructions to location 0x8 through 0x14 depending on the address of the exception and interrupt handlers.

The following is code for passing control to Exception and Interrupt handlers, assuming the default `C_BASE_VECTORS` value of 0x00000000:

```
0x00: bri _start1
0x04: nop
0x08: imm high bits of address (user exception handler)
0x0c: bri _exception_handler
0x10: imm high bits of address (interrupt handler)
0x14: bri _interrupt_handler
0x20: imm high bits of address (HW exception handler)
0x24: bri _hw_exception_handler
```

With low-latency interrupt mode, control is directly passed to the interrupt handler for each individual interrupt utilizing this mode. In this case, it is the responsibility of each handler to save and restore used registers. The MicroBlaze C compiler (mb-gcc) attribute `fast_interrupt` is available to allow this task to be performed by the compiler:

```c
void interrupt_handler_name() __attribute__(({fast_interrupt}));
```

MicroBlaze allows exception and interrupt handler routines to be located at any address location addressable using 32 bits.

The user exception handler code starts with the label `_exception_handler`, the hardware exception handler starts with `_hw_exception_handler`, while the interrupt handler code starts with the label `_interrupt_handler` for interrupts that do not use low-latency handlers.
In the current MicroBlaze system, there are dummy routines for interrupt and exception handling, which you can change. In order to override these routines and link your interrupt and exception handlers, you must define the interrupt handler code with the attribute `interrupt_handler:

```c
void function_name() __attribute__((interrupt_handler));
```

For more details about the use and syntax of the interrupt handler attribute, please refer to the GNU Compiler Tools chapter in the *Embedded System Tools Reference Guide*.

When software breakpoints are used in the Xilinx Microprocessor Debug (XMD) tool, the Break (HW/SW) address location is reserved for handling the software breakpoint.
This chapter provides a detailed guide to the Instruction Set Architecture of MicroBlaze™.

**Notation**

The symbols used throughout this chapter are defined in Table 5-1.

<table>
<thead>
<tr>
<th>Symbol</th>
<th>Meaning</th>
</tr>
</thead>
<tbody>
<tr>
<td>+</td>
<td>Add</td>
</tr>
<tr>
<td>-</td>
<td>Subtract</td>
</tr>
<tr>
<td>×</td>
<td>Multiply</td>
</tr>
<tr>
<td>/</td>
<td>Divide</td>
</tr>
<tr>
<td>∧</td>
<td>Bitwise logical AND</td>
</tr>
<tr>
<td>∨</td>
<td>Bitwise logical OR</td>
</tr>
<tr>
<td>⊕</td>
<td>Bitwise logical XOR</td>
</tr>
<tr>
<td>x</td>
<td>Bitwise logical complement of x</td>
</tr>
<tr>
<td>←</td>
<td>Assignment</td>
</tr>
<tr>
<td>&gt;&gt;</td>
<td>Right shift</td>
</tr>
<tr>
<td>&lt;&lt;</td>
<td>Left shift</td>
</tr>
<tr>
<td>rx</td>
<td>Register x</td>
</tr>
<tr>
<td>x[i]</td>
<td>Bit i in register x</td>
</tr>
<tr>
<td>x[i:j]</td>
<td>Bits i through j in register x</td>
</tr>
<tr>
<td>=</td>
<td>Equal comparison</td>
</tr>
<tr>
<td>≠</td>
<td>Not equal comparison</td>
</tr>
<tr>
<td>&gt;</td>
<td>Greater than comparison</td>
</tr>
<tr>
<td>&gt;=</td>
<td>Greater than or equal comparison</td>
</tr>
<tr>
<td>&lt;</td>
<td>Less than comparison</td>
</tr>
<tr>
<td>&lt;=</td>
<td>Less than or equal comparison</td>
</tr>
<tr>
<td></td>
<td></td>
</tr>
</tbody>
</table>
### Table 5-1: Symbol Notation (Continued)

<table>
<thead>
<tr>
<th>Symbol</th>
<th>Meaning</th>
</tr>
</thead>
<tbody>
<tr>
<td>sext(x)</td>
<td>Sign-extend x</td>
</tr>
<tr>
<td>Mem(x)</td>
<td>Memory location at address x</td>
</tr>
<tr>
<td>FSLx</td>
<td>Stream interface x (FSL or AXI)</td>
</tr>
<tr>
<td>LSW(x)</td>
<td>Least Significant Word of x</td>
</tr>
<tr>
<td>isDnz(x)</td>
<td>Floating point: true if x is denormalized</td>
</tr>
<tr>
<td>isInfinite(x)</td>
<td>Floating point: true if x is +\infty or -\infty</td>
</tr>
<tr>
<td>isPosInfinite(x)</td>
<td>Floating point: true if x is +\infty</td>
</tr>
<tr>
<td>isNegInfinite(x)</td>
<td>Floating point: true if x is -\infty</td>
</tr>
<tr>
<td>isNaN(x)</td>
<td>Floating point: true if x is a quiet or signalling NaN</td>
</tr>
<tr>
<td>isZero(x)</td>
<td>Floating point: true if x is +0 or -0</td>
</tr>
<tr>
<td>isQuietNaN(x)</td>
<td>Floating point: true if x is a quiet NaN</td>
</tr>
<tr>
<td>isSigNaN(x)</td>
<td>Floating point: true if x is a signaling NaN</td>
</tr>
<tr>
<td>signZero(x)</td>
<td>Floating point: return +0 for x &gt; 0, and -0 if x &lt; 0</td>
</tr>
<tr>
<td>signInfinite(x)</td>
<td>Floating point: return +\infty for x &gt; 0, and -\infty if x &lt; 0</td>
</tr>
</tbody>
</table>
Formats

MicroBlaze uses two instruction formats: Type A and Type B.

Type A

Type A is used for register-register instructions. It contains the opcode, one destination and two source registers.

<table>
<thead>
<tr>
<th>Opcode</th>
<th>Destination Reg</th>
<th>Source Reg A</th>
<th>Source Reg B</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
<td>21</td>
<td>31</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Type B

Type B is used for register-immediate instructions. It contains the opcode, one destination and one source registers, and a source 16-bit immediate value.

<table>
<thead>
<tr>
<th>Opcode</th>
<th>Destination Reg</th>
<th>Source Reg A</th>
<th>Source Reg B</th>
<th>Immediate Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
<td>31</td>
</tr>
</tbody>
</table>

Instructions

This section provides descriptions of MicroBlaze instructions. Instructions are listed in alphabetical order. For each instruction Xilinx provides the mnemonic, encoding, a description, pseudocode of its semantics, and a list of registers that it modifies.
add  

**Arithmetic Add**

- `add rD, rA, rB`  Add
- `adde rD, rA, rB`  Add with Carry
- `addk rD, rA, rB`  Add and Keep Carry
- `addkc rD, rA, rB`  Add with Carry and Keep Carry

| 0 | 0 | 0 | K | C | 0 | rD | rA | rB | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|---|---|---|---|---|---|----|----|----|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 6 | 1 | 1 | 2 | 3 | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  |

**Description**

The sum of the contents of registers rA and rB, is placed into register rD.

Bit 3 of the instruction (labeled as K in the figure) is set to one for the mnemonic addk. Bit 4 of the instruction (labeled as C in the figure) is set to one for the mnemonic adde. Both bits are set to one for the mnemonic addkc.

When an add instruction has bit 3 set (addk, addkc), the carry flag will Keep its previous value regardless of the outcome of the execution of the instruction. If bit 3 is cleared (add, adde), then the carry flag will be affected by the execution of the instruction.

When bit 4 of the instruction is set to one (adde, addkc), the content of the carry flag (MSR[C]) affects the execution of the instruction. When bit 4 is cleared (add, addk), the content of the carry flag does not affect the execution of the instruction (providing a normal addition).

**Pseudocode**

```plaintext
if C = 0 then
    (rD) \leftarrow (rA) + (rB)
else
    (rD) \leftarrow (rA) + (rB) + MSR[C]
if K = 0 then
    MSR[C] \leftarrow CarryOut
```

**Registers Altered**

- rD
- MSR[C]

**Latency**

1 cycle

**Note**

The C bit in the instruction opcode is not the same as the carry bit in the MSR.

The “add r0, r0, r0” (= 0x00000000) instruction is never used by the compiler and usually indicates uninitialized memory. If you are using illegal instruction exceptions you can trap these instructions by setting the MicroBlaze parameter C_OPCODE_0x0_ILLEGAL=1.
addi

Arithmetic Add Immediate

<table>
<thead>
<tr>
<th>addi</th>
<th>rD, rA, IMM</th>
<th>Add Immediate</th>
</tr>
</thead>
<tbody>
<tr>
<td>addic</td>
<td>rD, rA, IMM</td>
<td>Add Immediate with Carry</td>
</tr>
<tr>
<td>addik</td>
<td>rD, rA, IMM</td>
<td>Add Immediate and Keep Carry</td>
</tr>
<tr>
<td>addikc</td>
<td>rD, rA, IMM</td>
<td>Add Immediate with Carry and Keep Carry</td>
</tr>
</tbody>
</table>

```
\begin{array}{c|c|c|c|c}
0 & 0 & 1 & K & C & 0 \\
0 & 6 & 1 & 1 & & \\
1 & 6 & 1 & 1 & & \\
\end{array}
```

**Description**

The sum of the contents of registers \( rA \) and the value in the IMM field, sign-extended to 32 bits, is placed into register \( rD \). Bit 3 of the instruction (labeled as \( K \) in the figure) is set to one for the mnemonic addik. Bit 4 of the instruction (labeled as \( C \) in the figure) is set to one for the mnemonic addic. Both bits are set to one for the mnemonic addikc.

When an addi instruction has bit 3 set (addik, addikc), the carry flag will keep its previous value regardless of the outcome of the execution of the instruction. If bit 3 is cleared (addi, addic), then the carry flag will be affected by the execution of the instruction.

When bit 4 of the instruction is set to one (addic, addikc), the content of the carry flag (MSR[\( C \)]) affects the execution of the instruction. When bit 4 is cleared (addi, addik), the content of the carry flag does not affect the execution of the instruction (providing a normal addition).

**Pseudocode**

```plaintext
if C = 0 then
   (rD) \leftarrow (rA) + \text{sext}(\text{IMM})
else
   (rD) \leftarrow (rA) + \text{sext}(\text{IMM}) + \text{MSR}[C]
if K = 0 then
   \text{MSR}[C] \leftarrow \text{CarryOut}
```

**Registers Altered**

- \( rD \)
- \( \text{MSR}[C] \)

**Latency**

1 cycle

**Notes**

The C bit in the instruction opcode is not the same as the carry bit in the MSR.

By default, Type B Instructions take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.
**and**

**Logical AND**

\[ \text{and} \quad rD, rA, rB \]

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>2</td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>6</td>
<td>3</td>
</tr>
</tbody>
</table>

**Description**

The contents of register \( rA \) are ANDed with the contents of register \( rB \); the result is placed into register \( rD \).

**Pseudocode**

\[(rD) \leftarrow (rA) \land (rB)\]

**Registers Altered**

- \( rD \)

**Latency**

1 cycle
### andi

**Logical AND with Immediate**

```
andi rD, rA, IMM
```

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>6</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>3</td>
</tr>
</tbody>
</table>

#### Description

The contents of register `rA` are ANDed with the value of the `IMM` field, sign-extended to 32 bits; the result is placed into register `rD`.

#### Pseudocode

```
(rD) ← (rA) ∧ sext(IMM)
```

#### Registers Altered

- `rD`

#### Latency

1 cycle

#### Note

By default, Type B Instructions will take the 16-bit `IMM` field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an `imm` instruction. See the instruction “`imm`,” page 198 for details on using 32-bit immediate values.
**andn**

**Logical AND NOT**

\[
\text{andn} \quad r_D, r_A, r_B
\]

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>2</td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
</tr>
</tbody>
</table>

**Description**

The contents of register rA are ANDed with the logical complement of the contents of register rB; the result is placed into register rD.

**Pseudocode**

\[
(r_D) \leftarrow (r_A) \land (\bar{r_B})
\]

**Registers Altered**

- rD

**Latency**

1 cycle
andni 

Logical AND NOT with Immediate

\[
\text{andni} \quad rD, rA, \text{IMM}
\]

<table>
<thead>
<tr>
<th>1</th>
<th>0</th>
<th>1</th>
<th>0</th>
<th>1</th>
<th>rD</th>
<th>rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>3</td>
<td>1</td>
<td>6</td>
<td>1</td>
</tr>
</tbody>
</table>

Description

The IMM field is sign-extended to 32 bits. The contents of register rA are ANDed with the logical complement of the extended IMM field; the result is placed into register rD.

Pseudocode

\[
(rD) \leftarrow (rA) \land (\text{sext(IMM)})
\]

Registers Altered

- rD

Latency

1 cycle

Note

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.
beq

Branch if Equal

beq rA, rB Branch if Equal
beqd rA, rB Branch if Equal with Delay

```
 1 0 0 1 1 1  D 0 0 0 0 |  rA |  rB | 0 0 0 0 0 0 0 0 0 0 0
   0     6       1  1 2  |  3
   1     1       6  1 1 1
```

Description

Branch if rA is equal to 0, to the instruction located in the offset value of rB. The target of the branch will be the instruction at address PC + rB.

The mnemonic beqd will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

Pseudocode

```
If rA = 0 then
  PC ← PC + rB
else
  PC ← PC + 4
if D = 1 then
  allow following instruction to complete execution
```

Registers Altered

- PC

Latency

1 cycle (if branch is not taken)
2 cycles (if branch is taken and the D bit is set)
3 cycles (if branch is taken and the D bit is not set)

Note

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
beqi

Branch Immediate if Equal

beqi rA, IMM Branch Immediate if Equal
beqid rA, IMM Branch Immediate if Equal with Delay

<table>
<thead>
<tr>
<th>1</th>
<th>0</th>
<th>1</th>
<th>1</th>
<th>1</th>
<th>D</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>3</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Description

Branch if rA is equal to 0, to the instruction located in the offset value of IMM. The target of the branch will be the instruction at address PC + IMM.

The mnemonic beqid will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

Pseudocode

If rA = 0 then
   PC ← PC + sext(IMM)
else
   PC ← PC + 4
if D = 1 then
   allow following instruction to complete execution

Registers Altered

- PC

Latency

1 cycle (if branch is not taken, or successful branch prediction occurs)
2 cycles (if branch is taken and the D bit is set)
3 cycles (if branch is taken and the D bit is not set, or a branch prediction mispredict occurs)

Note

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
Branch if Greater or Equal

**bge**  
**bged**

### Description

Branch if rA is greater or equal to 0, to the instruction located in the offset value of rB. The target of the branch will be the instruction at address PC + rB.

The mnemonic `bged` will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

### Pseudocode

```
If rA >= 0 then
    PC ← PC + rB
else
    PC ← PC + 4
if D = 1 then
    allow following instruction to complete execution
```

### Registers Altered

- PC

### Latency

- 1 cycle (if branch is not taken)
- 2 cycles (if branch is taken and the D bit is set)
- 3 cycles (if branch is taken and the D bit is not set)

### Note

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
**bgei**  
**Branch Immediate if Greater or Equal**

![Instruction Format](101111D0101 r A I M M)

<table>
<thead>
<tr>
<th>D</th>
<th>rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>3</td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>3</td>
</tr>
</tbody>
</table>

**Description**

Branch if rA is greater or equal to 0, to the instruction located in the offset value of IMM. The target of the branch will be the instruction at address PC + IMM.

The mnemonic bgeid will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

**Pseudocode**

If rA >= 0 then
   PC ← PC + sext(IMM)
else
   PC ← PC + 4
   if D = 1 then
      allow following instruction to complete execution

**Registers Altered**

- PC

**Latency**

- 1 cycle (if branch is not taken, or successful branch prediction occurs)
- 2 cycles (if branch is taken and the D bit is set)
- 3 cycles (if branch is taken and the D bit is not set, or a branch prediction mispredict occurs)

**Note**

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
Chapter 5: MicroBlaze Instruction Set Architecture

bgt

Branch if Greater Than

bgt rA, rB Branch if Greater Than
bgtd rA, rB Branch if Greater Than with Delay

<table>
<thead>
<tr>
<th>1 0 0 1 1 1</th>
<th>D 0 1 0 0</th>
<th>rA</th>
<th>rB</th>
<th>0 0 0 0 0 0 0 0 0 0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>1</td>
<td></td>
<td>3</td>
</tr>
</tbody>
</table>

Description

Branch if rA is greater than 0, to the instruction located in the offset value of rB. The target of the branch will be the instruction at address PC + rB.

The mnemonic bgtd will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

Pseudocode

If rA > 0 then
    PC ← PC + rB
else
    PC ← PC + 4
if D = 1 then
    allow following instruction to complete execution

Registers Altered

- PC

Latency

- 1 cycle (if branch is not taken)
- 2 cycles (if branch is taken and the D bit is set)
- 3 cycles (if branch is taken and the D bit is not set)

Note

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
**bgti**

**Branch Immediate if Greater Than**

| bgti | rA, IMM | Branch Immediate if Greater Than |
| bgtid | rA, IMM | Branch Immediate if Greater Than with Delay |

<table>
<thead>
<tr>
<th>1 0 1 1 1 1</th>
<th>D 0 1 0 0</th>
<th>rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0 6 1 1 1</td>
<td>0 1</td>
<td>1 6</td>
<td></td>
</tr>
<tr>
<td>1 3</td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

**Description**

Branch if rA is greater than 0, to the instruction located in the offset value of IMM. The target of the branch will be the instruction at address PC + IMM.

The mnemonic bgtid will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

**Pseudocode**

If rA > 0 then
  PC ← PC + sext(IMM)
else
  PC ← PC + 4
if D = 1 then
  allow following instruction to complete execution

**Registers Altered**

- PC

**Latency**

- 1 cycle (if branch is not taken, or successful branch prediction occurs)
- 2 cycles (if branch is taken and the D bit is set)
- 3 cycles (if branch is taken and the D bit is not set, or a branch prediction mispredict occurs)

**Note**

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
ble

**Branch if Less or Equal**

```
ble      rA, rB       Branch if Less or Equal
bled     rA, rB       Branch if Less or Equal with Delay
```

<table>
<thead>
<tr>
<th>100111</th>
<th>D 0 0 1 1</th>
<th>rA</th>
<th>rB</th>
<th>0 0 0 0 0 0 0 0 0</th>
<th>3</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
<td></td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td></td>
</tr>
</tbody>
</table>

**Description**

Branch if rA is less or equal to 0, to the instruction located in the offset value of rB. The target of the branch will be the instruction at address PC + rB.

The mnemonic bled will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

**Pseudocode**

```
If rA <= 0 then
    PC ← PC + rB
else
    PC ← PC + 4
if D = 1 then
    allow following instruction to complete execution
```

**Registers Altered**

- PC

**Latency**

- 1 cycle (if branch is not taken)
- 2 cycles (if branch is taken and the D bit is set)
- 3 cycles (if branch is taken and the D bit is not set)

**Note**

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
blei

Branch Immediate if Less or Equal

blei \ rA, IMM Branch Immediate if Less or Equal
bleid \ rA, IMM Branch Immediate if Less or Equal with Delay

<table>
<thead>
<tr>
<th>1 0 1 1 1 1</th>
<th>D 0 0 1 1</th>
<th>\ rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>\ rA</td>
<td>1</td>
<td>6</td>
<td>3</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>6</td>
<td>1</td>
</tr>
</tbody>
</table>

Description

Branch if rA is less or equal to 0, to the instruction located in the offset value of IMM. The target of the branch will be the instruction at address PC + IMM.

The mnemonic bleid will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

Pseudocode

If rA <= 0 then
    PC <- PC + sext(IMM)
elser
    PC <- PC + 4
if D = 1 then
    allow following instruction to complete execution

Registers Altered

- PC

Latency

- 1 cycle (if branch is not taken, or successful branch prediction occurs)
- 2 cycles (if branch is taken and the D bit is set)
- 3 cycles (if branch is taken and the D bit is not set, or a branch prediction mispredict occurs)

Note

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
**blt**

**Branch if Less Than**

|  1 0 0 1 1 1 | D 0 0 1 0 | rA | rB | 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
<table>
<thead>
<tr>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
</tr>
</tbody>
</table>

**Description**

Branch if rA is less than 0, to the instruction located in the offset value of rB. The target of the branch will be the instruction at address PC + rB.

The mnemonic bltd will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

**Pseudocode**

If rA < 0 then
  PC ← PC + rB
else
  PC ← PC + 4
if D = 1 then
  allow following instruction to complete execution

**Registers Altered**

- PC

**Latency**

- 1 cycle (if branch is not taken)
- 2 cycles (if branch is taken and the D bit is set)
- 3 cycles (if branch is taken and the D bit is not set)

**Note**

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
**blti**

**Branch Immediate if Less Than**

<table>
<thead>
<tr>
<th>blti</th>
<th>rA, IMM</th>
<th>Branch Immediate if Less Than</th>
</tr>
</thead>
<tbody>
<tr>
<td>bltid</td>
<td>rA, IMM</td>
<td>Branch Immediate if Less Than with Delay</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>D 0 0 1 0</th>
<th>rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0 6 1 1 1</td>
<td>1</td>
<td>6</td>
</tr>
<tr>
<td>1 1 1 1 0</td>
<td>3</td>
<td>1</td>
</tr>
</tbody>
</table>

**Description**

Branch if rA is less than 0, to the instruction located in the offset value of IMM. The target of the branch will be the instruction at address PC + IMM.

The mnemonic bltid will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

**Pseudocode**

```plaintext
If rA < 0 then
  PC ← PC + sext(IMM)
else
  PC ← PC + 4
if D = 1 then
  allow following instruction to complete execution
```

**Registers Altered**

- PC

**Latency**

- 1 cycle (if branch is not taken, or successful branch prediction occurs)
- 2 cycles (if branch is taken and the D bit is set)
- 3 cycles (if branch is taken and the D bit is not set, or a branch prediction mispredict occurs)

**Note**

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
bne

Branch if Not Equal

bne rA, rB Branch if Not Equal
bned rA, rB Branch if Not Equal with Delay

| 1 0 0 1 1 1 | D 0 0 0 1 | rA | rB | 0 0 0 0 0 0 0 0 0 | 0 6 1 2 3 1 6 1 1 |

Description

Branch if rA not equal to 0, to the instruction located in the offset value of rB. The target of the branch will be the instruction at address PC + rB.

The mnemonic bned will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

Pseudocode

If rA ≠ 0 then
   PC ← PC + rB
else
   PC ← PC + 4
if D = 1 then
   allow following instruction to complete execution

Registers Altered

- PC

Latency

1 cycle (if branch is not taken)
2 cycles (if branch is taken and the D bit is set)
3 cycles (if branch is taken and the D bit is not set)

Note

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
bnei

Branch Immediate if Not Equal

\[
\text{bnei} \quad \text{rA, IMM} \quad \text{Branch Immediate if Not Equal}
\]

\[
\text{bneid} \quad \text{rA, IMM} \quad \text{Branch Immediate if Not Equal with Delay}
\]

<table>
<thead>
<tr>
<th>1 0 1 1 1 1</th>
<th>D 0 0 0 1</th>
<th>rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td></td>
<td>6</td>
<td>3</td>
</tr>
</tbody>
</table>

Description

Branch if rA not equal to 0, to the instruction located in the offset value of IMM. The target of the branch will be the instruction at address PC + IMM.

The mnemonic bneid will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

Pseudocode

\[
\text{If rA} \neq 0 \text{ then}
\]
\[
\quad \text{PC} \leftarrow \text{PC} + \text{sext( IMM)}
\]
\[
\text{else}
\]
\[
\quad \text{PC} \leftarrow \text{PC} + 4
\]
\[
\text{if D} = 1 \text{ then}
\]
\[
\quad \text{allow following instruction to complete execution}
\]

Registers Altered

- PC

Latency

- 1 cycle (if branch is not taken, or successful branch prediction occurs)
- 2 cycles (if branch is taken and the D bit is set)
- 3 cycles (if branch is taken and the D bit is not set, or a branch prediction mispredict occurs)

Note

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
Chapter 5: MicroBlaze Instruction Set Architecture

Unconditional Branch

| mnemonic | rD | D | A | L | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| br       | 0  | 1 | 0 | 1 | 1 | 0 | rD | 1 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| bra      | 0  | 1 | 0 | 1 | 1 | 0 | rB | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| brd      | 0  | 1 | 0 | 1 | 1 | 0 | rB | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| bra d    | 0  | 1 | 0 | 1 | 1 | 0 | rB | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| brd      | 0  | 1 | 0 | 1 | 1 | 0 | rB | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| bra d    | 0  | 1 | 0 | 1 | 1 | 0 | rB | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |

Description

Branch to the instruction located at address determined by rB.

The mnemonics brld and brald will set the L bit. If the L bit is set, linking will be performed. The current value of PC will be stored in rD.

The mnemonics bra, brad and brald will set the A bit. If the A bit is set, it means that the branch is to an absolute value and the target is the value in rB, otherwise, it is a relative branch and the target will be PC + rB.

The mnemonics brd, brad, brld and brald will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction.

If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

Pseudocode

```plaintext
if L = 1 then
    (rD) ← PC
if A = 1 then
    PC ← (rB)
else
    PC ← PC + (rB)
if D = 1 then
    allow following instruction to complete execution
```

Registers Altered

- rD
- PC

Latency

- 2 cycles (if the D bit is set)
- 3 cycles (if the D bit is not set)
Note

The instructions brl and bral are not available. A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
### bri

**Unconditional Branch Immediate**

<table>
<thead>
<tr>
<th>bri</th>
<th>IMM</th>
<th>Branch Immediate</th>
</tr>
</thead>
<tbody>
<tr>
<td>brai</td>
<td>IMM</td>
<td>Branch Absolute Immediate</td>
</tr>
<tr>
<td>brid</td>
<td>IMM</td>
<td>Branch Immediate with Delay</td>
</tr>
<tr>
<td>braid</td>
<td>IMM</td>
<td>Branch Absolute Immediate with Delay</td>
</tr>
<tr>
<td>brlid</td>
<td>rD, IMM</td>
<td>Branch and Link Immediate with Delay</td>
</tr>
<tr>
<td>bralid</td>
<td>rD, IMM</td>
<td>Branch Absolute and Link Immediate with Delay</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>1 0 1 1 1 0</th>
<th>rD</th>
<th>D A L 0 0</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0 6 1 1 1 0</td>
<td></td>
<td>1 1</td>
<td>3</td>
</tr>
<tr>
<td>1 1 6 1 1 0</td>
<td></td>
<td></td>
<td>1</td>
</tr>
</tbody>
</table>

**Description**

Branch to the instruction located at address determined by IMM, sign-extended to 32 bits.

The mnemonics brlid and bralid will set the L bit. If the L bit is set, linking will be performed. The current value of PC will be stored in rD.

The mnemonics brai, braid and bralid will set the A bit. If the A bit is set, it means that the branch is to an absolute value and the target is the value in IMM, otherwise, it is a relative branch and the target will be PC + IMM.

The mnemonics brid, braid, brlid and bralid will set the D bit. The D bit determines whether there is a branch delay slot or not. If the D bit is set, it means that there is a delay slot and the instruction following the branch (that is, in the branch delay slot) is allowed to complete execution before executing the target instruction. If the D bit is not set, it means that there is no delay slot, so the instruction to be executed after the branch is the target instruction.

As a special case, when MicroBlaze is configured to use an MMU (C_USE_MMU >= 1) and “bralid rD, C_BASE_VECTORS+0x8” is used to perform a User Vector Exception, the Machine Status Register bits User Mode and Virtual Mode are cleared.

**Pseudocode**

```plaintext
if L = 1 then
    (rD) ← PC
if A = 1 then
    PC ← sext(IMM)
else
    PC ← PC + sext(IMM)
if D = 1 then
    allow following instruction to complete execution
if D = 1 and A = 1 and L = 1 and IMM = C_BASE_VECTORS+0x8 then
    MSR[UMS] ← MSR[UM]
    MSR[VMS] ← MSR[VM]
    MSR[UM] ← 0
    MSR[VM] ← 0
```
Registers Altered

- rD
- PC
- MSR[UM], MSR[VM]

Latency

- 1 cycle (if successful branch prediction occurs)
- 2 cycles (if the D bit is set)
- 3 cycles (if the D bit is not set, or a branch prediction mispredict occurs)

Notes

The instructions brli and brali are not available.

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
**brk**

Break

\[ \text{brk rD, rB} \]

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th></th>
<th>rB</th>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
<td>21</td>
<td>31</td>
<td></td>
</tr>
</tbody>
</table>

### Description

Branch and link to the instruction located at address value in \( rB \). The current value of \( PC \) will be stored in \( rD \). The BIP flag in the MSR will be set, and the reservation bit will be cleared.

When MicroBlaze is configured to use an MMU (\( \text{C\_USE\_MMU} \geq 1 \)) this instruction is privileged. This means that if the instruction is attempted in User Mode (\( \text{MSR}[\text{UM}] = 1 \)) a Privileged Instruction exception occurs.

### Pseudocode

\[
\text{if MSR}[\text{UM}] = 1 \text{ then}
\quad \text{ESR}[\text{EC}] \leftarrow 00111 \\
\text{else}
\quad (rD) \leftarrow PC \\
\quad PC \leftarrow (rB) \\
\quad \text{MSR}[\text{BIP}] \leftarrow 1 \\
\quad \text{Reservation} \leftarrow 0
\]

### Registers Altered

- \( rD \)
- \( PC \)
- \( \text{MSR}[\text{BIP}] \)
- \( \text{ESR}[\text{EC}] \), in case a privileged instruction exception is generated

### Latency

- 3 cycles
Instructions

brki

Break Immediate

brki  rD, IMM

<table>
<thead>
<tr>
<th>1 0 1 1 1 0</th>
<th>rD</th>
<th>0 1 1 0 0</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
</tr>
</tbody>
</table>

Description

Branch and link to the instruction located at address value in IMM, sign-extended to 32 bits. The current value of PC will be stored in rD. The BIP flag in the MSR will be set, and the reservation bit will be cleared.

When MicroBlaze is configured to use an MMU (C_USE_MPU >= 1) this instruction is privileged, except as a special case when “brki rD, C_BASE_VECTORS+0x8” or “brki rD, C_BASE_VECTORS+0x18” is used to perform a Software Break. This means that, apart from the special case, if the instruction is attempted in User Mode (MSR[UM] = 1) a Privileged Instruction exception occurs.

As a special case, when MicroBlaze is configured to use an MMU (C_USE_MPU >= 1) and “brki rD, C_BASE_VECTORS+0x8” or “brki rD, C_BASE_VECTORS+0x18” is used to perform a Software Break, the Machine Status Register bits User Mode and Virtual Mode are cleared.

Pseudocode

if MSR[UM] and IMM ≠ C_BASE_VECTORS+0x8 and IMM ≠ C_BASE_VECTORS+0x18 then
    ESR[EC] ← 00111
else
    (rD) ← PC
    PC ← sext(IMM)
    MSR[BIP] ← 1
    Reservation ← 0
    if IMM = C_BASE_VECTORS+0x8 or IMM = C_BASE_VECTORS+0x18 then
        MSR[UMS] ← MSR[UM] MSR[UM] ← 0
        MSR[VMS] ← MSR[VM] MSR[VM] ← 0

Registers Altered

- rD, unless an exception is generated, in which case the register is unchanged
- PC
- MSR[BIP], MSR[UM], MSR[VM]
- ESR[EC], in case a privileged instruction exception is generated

Latency

- 3 cycles

Note

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.

As a special case, the imm instruction does not override a Software Break “brki rD, 0x18” when C_USE_DEBUG is set, irrespective of the value of C_BASE_VECTORS, to allow Software Break after an imm instruction.
**Barrel Shift**

<table>
<thead>
<tr>
<th>bs</th>
<th>barrel shift</th>
<th>rD, rA, rB</th>
<th>barrel shift right logical</th>
</tr>
</thead>
<tbody>
<tr>
<td>bsr</td>
<td>rD, rA, rB</td>
<td>Barrel Shift Right Logical</td>
<td></td>
</tr>
<tr>
<td>bsra</td>
<td>rD, rA, rB</td>
<td>Barrel Shift Right Arithmetical</td>
<td></td>
</tr>
<tr>
<td>bsll</td>
<td>rD, rA, rB</td>
<td>Barrel Shift Left Logical</td>
<td></td>
</tr>
</tbody>
</table>

**Description**

Shifts the contents of register rA by the amount specified in register rB and puts the result in register rD.

The mnemonic bsll sets the S bit (Side bit). If the S bit is set, the barrel shift is done to the left. The mnemonics bsr and bsra clear the S bit and the shift is done to the right.

The mnemonic bsra will set the T bit (Type bit). If the T bit is set, the barrel shift performed is Arithmetical. The mnemonics bsr and bsll clear the T bit and the shift performed is Logical.

**Pseudocode**

```plaintext
if S = 1 then
    (rD) ← (rA) << (rB)[27:31]
else
    if T = 1 then
        if ((rB)[27:31]) ≠ 0 then
            (rD)[0:(rB)[27:31]-1] ← (rA)[0]
            (rD)[(rB)[27:31]:31] ← (rA) >> (rB)[27:31]
        else
            (rD) ← (rA)
    else
        (rD) ← (rA) >> (rB)[27:31]
```

**Registers Altered**

- rD

**Latency**

- 1 cycle with C_AREA_OPTIMIZED=0
- 2 cycles with C_AREA_OPTIMIZED=1

**Note**

These instructions are optional. To use them, MicroBlaze has to be configured to use barrel shift instructions (C_USE_BARREL=1).
Instructions

bsi

Barrel Shift Immediate

bsrlf rD, rA, IMM  Barrel Shift Right Logical Immediate
bsra rD, rA, IMM  Barrel Shift Right Arithmetical Immediate
bslli rD, rA, IMM  Barrel Shift Left Logical Immediate

Description

Shifts the contents of register rA by the amount specified by IMM and puts the result in register rD.

The mnemonic bsll sets the S bit (Side bit). If the S bit is set, the barrel shift is done to the left. The mnemonics bsrl and bsra clear the S bit and the shift is done to the right.

The mnemonic bsra will set the T bit (Type bit). If the T bit is set, the barrel shift performed is Arithmetical. The mnemonics bsrl and bsll clear the T bit and the shift performed is Logical.

Pseudocode

\[
\begin{align*}
\text{if } S = 1 \text{ then} \\
(rD) & \leftarrow (rA) \ll \text{IMM} \\
\text{else} \\
\text{if } T = 1 \text{ then} \\
\text{if } \text{IMM} \neq 0 \text{ then} \\
(rD)[0:\text{IMM}-1] & \leftarrow (rA)[0] \\
(rD)[\text{IMM}:31] & \leftarrow (rA) \gg \text{IMM} \\
\text{else} \\
(rD) & \leftarrow (rA) \\
\text{else} \\
(rD) & \leftarrow (rA) \gg \text{IMM}
\end{align*}
\]

Registers Altered

- rD

Latency

- 1 cycle with C_AREA_OPTIMIZED=0
- 2 cycles with C_AREA_OPTIMIZED=1

Notes

These are not Type B Instructions. There is no effect from a preceding imm instruction.

These instructions are optional. To use them, MicroBlaze has to be configured to use barrel shift instructions (C_USE_BARREL=1).
clz

**Count Leading Zeros**

```
clz  rD, rA  Count leading zeros in rA
```

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>1</td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td>1</td>
</tr>
</tbody>
</table>

**Description**

This instruction counts the number of leading zeros in register rA starting from the most significant bit. The result is a number between 0 and 32, stored in register rD.

The result in rD is 32 when rA is 0, and it is 0 if rA is 0xFFFFFFFF.

**Pseudocode**

```
n ← 0
while (rA)[n] = 0
    n ← n + 1
(rD) ← n
```

**Registers Altered**

- rD

**Latency**

- 1 cycle

**Notes**

This instruction is only available when the parameter C_USE_PCMP_INSTR is set to 1.
**cmp**  

**Integer Compare**

```
cmp     rD, rA, rB    compare rB with rA (signed)
cmpu    rD, rA, rB    compare rB with rA (unsigned)
```

<table>
<thead>
<tr>
<th>0 0 0 1 0 1</th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
<th>0 0 0 0 0 0 0 0 U 1</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
</tr>
</tbody>
</table>

**Description**

The contents of register rA is subtracted from the contents of register rB and the result is placed into register rD.

The MSB bit of rD is adjusted to shown true relation between rA and rB. If the U bit is set, rA and rB is considered unsigned values. If the U bit is clear, rA and rB is considered signed values.

**Pseudocode**

```
(rD) ← (rB) + (rA) + 1
(rD) (MSB) ← (rA) > (rB)
```

**Registers Altered**

- rD

**Latency**

- 1 cycle
Chapter 5: MicroBlaze Instruction Set Architecture

**fadd**

**Floating Point Arithmetic Add**

```
fadd      rD, rA, rB      Add
```

<table>
<thead>
<tr>
<th>0</th>
<th>1</th>
<th>0</th>
<th>1</th>
<th>1</th>
<th>0</th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
<td>21</td>
<td>31</td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

**Description**

The floating point sum of registers rA and rB, is placed into register rD.

**Pseudocode**

```plaintext
if isDnz(rA) or isDnz(rB) then
  (rD) ← 0xFFC00000
  FSR[DO] ← 1
  ESR[EC] ← 00110
else if isSigNaN(rA) or isSigNaN(rB) or
  (isPosInfinite(rA) and isNegInfinite(rB)) or
  (isNegInfinite(rA) and isPosInfinite(rB)) then
  (rD) ← 0xFFC00000
  FSR[IO] ← 1
  ESR[EC] ← 00110
else if isQuietNaN(rA) or isQuietNaN(rB) then
  (rD) ← 0xFFC00000
else if isDnz((rA)+(rB)) then
  (rD) ← signZero((rA)+(rB))
  FSR[UF] ← 1
  ESR[EC] ← 00110
else if isNaN((rA)+(rB)) then
  (rD) ← signInfinite((rA)+(rB))
  FSR[OF] ← 1
  ESR[EC] ← 00110
else
  (rD) ← (rA) + (rB)
```

**Registers Altered**

- rD, unless an FP exception is generated, in which case the register is unchanged
- ESR[EC], if an FP exception is generated
- FSR[IO,UF,OF,DO]

**Latency**

- 4 cycles with C_AREA_OPTIMIZED=0
- 6 cycles with C_AREA_OPTIMIZED=1

**Note**

This instruction is only available when the MicroBlaze parameter C_USE_FPU is greater than 0.
frsub

Reverse Floating Point Arithmetic Subtraction

frsub rD, rA, rB Reverse subtract

<table>
<thead>
<tr>
<th>0 1 0 1 1 0</th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
<th>0 0 0 1 0 0 0 0 0 0 0 0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
<td>21</td>
</tr>
<tr>
<td>31</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Description

The floating point value in rA is subtracted from the floating point value in rB and the result is placed into register rD.

Pseudocode

if isDnz(rA) or isDnz(rB) then
  (rD) ← 0xFFC00000
  FSR[DO] ← 1
  ESR[EC] ← 00110
else if (isSigNaN(rA) or isSigNaN(rB) or
  (isPosInfinite(rA) and isPosInfinite(rB)) or
  (isNegInfinite(rA) and isNegInfinite(rB))) then
  (rD) ← 0xFFC00000
  FSR[IO] ← 1
  ESR[EC] ← 00110
else if isQuietNaN(rA) or isQuietNaN(rB) then
  (rD) ← 0xFFC00000
else if isDnz((rB)-(rA)) then
  (rD) ← signZero((rB)-(rA))
  FSR[UF] ← 1
  ESR[EC] ← 00110
else if isNaN((rB)-(rA)) then
  (rD) ← signInfinite((rB)-(rA))
  FSR[OF] ← 1
  ESR[EC] ← 00110
else
  (rD) ← (rB) - (rA)

Registers Altered

- rD, unless an FP exception is generated, in which case the register is unchanged
- ESR[EC], if an FP exception is generated
- FSR[IO,UF,OF,DO]

Latency

- 4 cycles with C_AREA_OPTIMIZED=0
- 6 cycles with C_AREA_OPTIMIZED=1

Note

This instruction is only available when the MicroBlaze parameter C_USE_FPU is greater than 0.
The floating point value in rA is multiplied with the floating point value in rB and the result is placed into register rD.

Pseudocode

```plaintext
if isDnz(rA) or isDnz(rB) then
  (rD) ← 0xFFC00000
  FSR[DO] ← 1
  ESR[EC] ← 00110
else
  if isSigNaN(rA) or isSigNaN(rB) or (isZero(rA) and isInfinite(rB)) or
    (isZero(rB) and isInfinite(rA)) then
    (rD) ← 0xFFC00000
    FSR[IO] ← 1
    ESR[EC] ← 00110
  else if isQuietNaN(rA) or isQuietNaN(rB) then
    (rD) ← 0xFFC00000
  else if isDnz((rB)*(rA)) then
    (rD) ← signZero((rA)*(rB))
    FSR[UF] ← 1
    ESR[EC] ← 00110
  else if isNaN((rB)*(rA)) then
    (rD) ← signInfinite((rB)*(rA))
    FSR[OF] ← 1
    ESR[EC] ← 00110
  else
    (rD) ← (rB) * (rA)
```

Registers Altered
- rD, unless an FP exception is generated, in which case the register is unchanged
- ESR[EC], if an FP exception is generated
- FSR[IO,UF,OF,DO]

Latency
- 4 cycles with C_AREA_OPTIMIZED=0
- 6 cycles with C_AREA_OPTIMIZED=1

Note
This instruction is only available when the MicroBlaze parameter C_USE_FPU is greater than 0.
Instructions

fdiv
Floating Point Arithmetic Division

fdiv rD, rA, rB Divide

Description
The floating point value in rB is divided by the floating point value in rA and the result is placed into register rD.

Pseudocode

if isDnz(rA) or isDnz(rB) then
    (rD) \leftarrow 0xFFC00000
    FSR[DO] \leftarrow 1
    ESR[EC] \leftarrow 00110
else
    if isSigNaN(rA) or isSigNaN(rB) or (isZero(rA) and isZero(rB)) or
       (isInfinite(rA) and isInfinite(rB)) then
        (rD) \leftarrow 0xFFC00000
        FSR[IO] \leftarrow 1
        ESR[EC] \leftarrow 00110
    else if isQuietNaN(rA) or isQuietNaN(rB) then
        (rD) \leftarrow \text{signInfinite}((rB)/(rA))
        FSR[DZ] \leftarrow 1
        ESR[EC] \leftarrow 00110
    else if isZero(rA) and not isInfinite(rB) then
        (rD) \leftarrow \text{signZero}((rB)/(rA))
        FSR[UF] \leftarrow 1
        ESR[EC] \leftarrow 00110
    else if isDnz((rB)/(rA)) then
        (rD) \leftarrow \text{signNaN}((rB)/(rA))
        FSR[OF] \leftarrow 1
        ESR[EC] \leftarrow 00110
    else if isNaN((rB)/(rA)) then
        (rD) \leftarrow \text{signInfinite}((rB)/(rA))
        FSR[OF] \leftarrow 1
        ESR[EC] \leftarrow 00110
    else
        (rD) \leftarrow (rB) / (rA)

Registers Altered
- rD, unless an FP exception is generated, in which case the register is unchanged
- ESR[EC], if an FP exception is generated
- FSR[IO,UF,OF,DO,DZ]

Latency
- 28 cycles with C\_AREA\_OPTIMIZED=0, 30 cycles with C\_AREA\_OPTIMIZED=1

Note
This instruction is only available when the MicroBlaze parameter C\_USE\_FPU is greater than 0.
**fcmp**

**Floating Point Number Comparison**

- *fcmp.un* $rD, rA, rB$  
  Unordered floating point comparison
- *fcmp.lt* $rD, rA, rB$  
  Less-than floating point comparison
- *fcmp.eq* $rD, rA, rB$  
  Equal floating point comparison
- *fcmp.le* $rD, rA, rB$  
  Less-or-Equal floating point comparison
- *fcmp.gt* $rD, rA, rB$  
  Greater-than floating point comparison
- *fcmp.ne* $rD, rA, rB$  
  Not-Equal floating point comparison
- *fcmp.ge* $rD, rA, rB$  
  Greater-or-Equal floating point comparison

<table>
<thead>
<tr>
<th>0</th>
<th>1</th>
<th>0</th>
<th>1</th>
<th>1</th>
<th>0</th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
<th>0</th>
<th>1</th>
<th>0</th>
<th>0</th>
<th>OpSel</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
<td>21</td>
<td>25</td>
<td>28</td>
<td>31</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

**Description**

The floating point value in $rB$ is compared with the floating point value in $rA$ and the comparison result is placed into register $rD$. The OpSel field in the instruction code determines the type of comparison performed.

**Pseudocode**

```plaintext
if isDnz(rA) or isDnz(rB) then
    (rD) ← 0
    FSR[DO] ← 1
    ESR[EC] ← 00110
else
    {read out behavior from Table 5-2}
```

**Registers Altered**

- $rD$, unless an FP exception is generated, in which case the register is unchanged
- ESR[EC], if an FP exception is generated
- FSR[IO,DO]

**Latency**

- 1 cycle with C_AREA_OPTIMIZED=0
- 3 cycles with C_AREA_OPTIMIZED=1

**Note**

These instructions are only available when the MicroBlaze parameter C_USE_FPU is greater than 0.

Table 5-2, page 189 lists the floating point comparison operations.
### Table 5-2: Floating Point Comparison Operation

<table>
<thead>
<tr>
<th>Comparison Type</th>
<th>Operand Relationship</th>
<th>Description</th>
<th>OpSel</th>
<th>(rB) &gt; (rA)</th>
<th>(rB) &lt; (rA)</th>
<th>(rB) = (rA)</th>
<th>isSigNaN(rA) or isSigNaN(rB)</th>
<th>isQuietNaN(rA) or isQuietNaN(rB)</th>
</tr>
</thead>
<tbody>
<tr>
<td>Unordered</td>
<td></td>
<td></td>
<td>000</td>
<td>(rD) ← 0</td>
<td>(rD) ← 0</td>
<td>(rD) ← 0</td>
<td>(rD) ← 1</td>
<td>FSR[IO] ← 0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>ESR[EC] ← 00110</td>
<td></td>
</tr>
<tr>
<td>Less-than</td>
<td></td>
<td></td>
<td>001</td>
<td>(rD) ← 0</td>
<td>(rD) ← 1</td>
<td>(rD) ← 0</td>
<td>(rD) ← 0</td>
<td>FSR[IO] ← 0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>ESR[EC] ← 00110</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>ESR[EC] ← 00110</td>
<td></td>
</tr>
<tr>
<td>Equal</td>
<td></td>
<td></td>
<td>010</td>
<td>(rD) ← 0</td>
<td>(rD) ← 0</td>
<td>(rD) ← 1</td>
<td>(rD) ← 0</td>
<td>FSR[IO] ← 0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>ESR[EC] ← 00110</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>rD ← 0</td>
</tr>
<tr>
<td>Less-or-equal</td>
<td></td>
<td></td>
<td>011</td>
<td>(rD) ← 0</td>
<td>(rD) ← 1</td>
<td>(rD) ← 1</td>
<td>(rD) ← 0</td>
<td>FSR[IO] ← 0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>ESR[EC] ← 00110</td>
<td></td>
</tr>
<tr>
<td>Greater-than</td>
<td></td>
<td></td>
<td>100</td>
<td>(rD) ← 1</td>
<td>(rD) ← 0</td>
<td>(rD) ← 0</td>
<td>(rD) ← 0</td>
<td>FSR[IO] ← 0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>ESR[EC] ← 00110</td>
<td></td>
</tr>
<tr>
<td>Not-equal</td>
<td></td>
<td></td>
<td>101</td>
<td>(rD) ← 1</td>
<td>(rD) ← 1</td>
<td>(rD) ← 0</td>
<td>(rD) ← 1</td>
<td>FSR[IO] ← 0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>ESR[EC] ← 00110</td>
<td></td>
</tr>
<tr>
<td>Greater-or-equal</td>
<td></td>
<td></td>
<td>110</td>
<td>(rD) ← 1</td>
<td>(rD) ← 0</td>
<td>(rD) ← 1</td>
<td>(rD) ← 0</td>
<td>FSR[IO] ← 0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td>ESR[EC] ← 00110</td>
<td></td>
</tr>
</tbody>
</table>
flt

**Floating Point Convert Integer to Float**

```plaintext
flt rD, rA
```

| 0 1 0 1 1 0 | 0 1 0 1 0 0 0 0 0 0 |
| 0 6 11 16 21 31 |

**Description**

Converts the signed integer in register rA to floating point and puts the result in register rD. This is a 32-bit rounding signed conversion that will produce a 32-bit floating point result.

**Pseudocode**

```
(rD) ← float ((rA))
```

**Registers Altered**

- rD

**Latency**

- 4 cycles with C_AREA_OPTIMIZED=0
- 6 cycles with C_AREA_OPTIMIZED=1

**Note**

This instruction is only available when the MicroBlaze parameter C_USE_FPU is set to 2 (Extended).
fint

Floating Point Convert Float to Integer

\[ \text{fint} \quad \text{rD, rA} \]

| 0 | 1 | 0 | 1 | 1 | 0 | rD | rA | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|---|---|---|---|---|---|----|----|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 6 | 11 | 16 | 21 | 31 |

Description

Converts the floating point number in register rA to a signed integer and puts the result in register rD. This is a 32-bit signed conversion that will produce a 32-bit integer result.

Pseudocode

\[
\text{if } \text{isDnz}(rA) \text{ then}
\]
\[
\text{if } \text{isNaN}(rA) \text{ then}
\]
\[
\text{else if } \text{isInf}(rA) \text{ or } (rA) < -2^{31} \text{ or } (rA) > 2^{31} - 1 \text{ then}
\]
\[
\text{else}
\]
\[
(rD) \leftarrow \text{int} \left( (rA) \right)
\]

Registers Altered

- rD, unless an FP exception is generated, in which case the register is unchanged
- ESR[EC], if an FP exception is generated
- FSR[IO, DO]

Latency

- 5 cycles with C\_AREA\_OPTIMIZED=0
- 7 cycles with C\_AREA\_OPTIMIZED=1

Note

This instruction is only available when the MicroBlaze parameter C\_USE\_FPU is set to 2 (Extended).
**fsqrt**

Floating Point Arithmetic Square Root

<table>
<thead>
<tr>
<th>Instruction</th>
<th>rD, rA</th>
<th>Square Root</th>
</tr>
</thead>
<tbody>
<tr>
<td>fsqrt</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

**Description**

Performs a floating point square root on the value in rA and puts the result in register rD.

**Pseudocode**

```plaintext
if isDnz(rA) then
    (rD) ← 0xFFFCD0000
    FSR[DO] ← 1
    ESR[EC] ← 00110
else if isSigNaN(rA) then
    (rD) ← 0xFFFCD0000
    FSR[IO] ← 1
    ESR[EC] ← 00110
else if isQuietNaN(rA) then
    (rD) ← 0xFFFCD0000
else if (rA) < 0 then
    (rD) ← 0xFFFCD0000
    FSR[IO] ← 1
    ESR[EC] ← 00110
else if (rA) = -0 then
    (rD) ← -0
else
    (rD) ← sqrt ((rA))
```

**Registers Altered**

- rD, unless an FP exception is generated, in which case the register is unchanged
- ESR[EC], if an FP exception is generated
- FSR[IO,DO]

**Latency**

- 27 cycles with C_AREA_OPTIMIZED=0
- 29 cycles with C_AREA_OPTIMIZED=1

**Note**

This instruction is only available when the MicroBlaze parameter C_USE_FPU is set to 2 (Extended).
**get**

**get from stream interface**

<table>
<thead>
<tr>
<th>Instruction</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>tneaget</code> rD, FSLx</td>
<td>get data from link x</td>
</tr>
<tr>
<td>t = test-only</td>
<td></td>
</tr>
<tr>
<td>n = non-blocking</td>
<td></td>
</tr>
<tr>
<td>e = exception if control bit set</td>
<td></td>
</tr>
<tr>
<td>a = atomic</td>
<td></td>
</tr>
<tr>
<td><code>tneaget</code> rD, FSLx</td>
<td>get control from link x</td>
</tr>
<tr>
<td>t = test-only</td>
<td></td>
</tr>
<tr>
<td>n = non-blocking</td>
<td></td>
</tr>
<tr>
<td>e = exception if control bit not set</td>
<td></td>
</tr>
<tr>
<td>a = atomic</td>
<td></td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>Bit</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0-6</td>
<td>rD</td>
</tr>
<tr>
<td>7-11</td>
<td>0</td>
</tr>
<tr>
<td>12-16</td>
<td>n</td>
</tr>
<tr>
<td>17-19</td>
<td>c</td>
</tr>
<tr>
<td>20-21</td>
<td>t</td>
</tr>
<tr>
<td>22-23</td>
<td>e</td>
</tr>
<tr>
<td>24-27</td>
<td>a</td>
</tr>
<tr>
<td>28-31</td>
<td>FSLx</td>
</tr>
</tbody>
</table>

**Description**

MicroBlaze will read from the link x interface and place the result in register rD. If the available number of links set by `C_FSL_LINKS` is less than or equal to FSLx, link 0 is used.

The get instruction has 32 variants.

The blocking versions (when ‘n’ bit is ‘0’) will stall MicroBlaze until the data from the interface is valid. The non-blocking versions will not stall micro Blaze and will set carry to ‘0’ if the data was valid and to ‘1’ if the data was invalid. In case of an invalid access the destination register contents is undefined.

All data get instructions (when ‘c’ bit is ‘0’) expect the control bit from the interface to be ‘0’. If this is not the case, the instruction will set MSR[FSL] to ‘1’. All control get instructions (when ‘c’ bit is ‘1’) expect the control bit from the interface to be ‘1’. If this is not the case, the instruction will set MSR[FSL] to ‘1’.

The exception versions (when ‘e’ bit is ‘1’) will generate an exception if there is a control bit mismatch. In this case ESR is updated with EC set to the exception cause and ESS set to the link index. The target register, rD, is not updated when an exception is generated, instead the data is stored in EDR.

The test versions (when ‘t’ bit is ‘1’) will be handled as the normal case, except that the read signal to the link is not asserted.

Atomic versions (when ‘a’ bit is ‘1’) are not interruptible. This means that a sequence of atomic instructions can be grouped together without an interrupt breaking the program flow. However, note that exceptions may still occur.

When MicroBlaze is configured to use an MMU (`C_USE_MMU >= 1`) and not explicitly allowed by setting `C_MMU_PRIVILEGED_INSTR` to 1 these instructions are privileged. This means that if these instructions are attempted in User Mode (`MSR[UM] = 1`) a Privileged Instruction exception occurs.
Pseudocode

if MSR[UM] = 1 then
    ESR[EC] ← 00111
else
    x ← FSLx
    if x >= C_FSL_LINKS then
        x ← 0
    (rD) ← FSLx_S_DATA | Sx_AXIS_TDATA
    if (n = 1) then
        MSR[Carry] ← (FSLx_S_EXISTS | Sx_AXIS_TVALID)
    if (FSLx_S_CONTROL | Sx_AXIS_TLAST ≠ c) and
        (FSLx_S_EXISTS | Sx_AXIS_TVALID) then
        MSR[FSL] ← 1
    if (e = 1) then
        ESR[EC] ← 00000
        ESR[ESS] ← instruction bits [28:31]
        EDR ← FSLx_S_DATA | Sx_AXIS_TDATA

Registers Altered

- rD, unless an exception is generated, in which case the register is unchanged
- MSR[FSL]
- MSR[Carry]
- ESR[EC], in case a stream exception or a privileged instruction exception is generated
- ESR[ESS], in case a stream exception is generated
- EDR, in case a stream exception is generated

Latency

- 1 cycle with C_AREA_OPTIMIZED=0
- 2 cycles with C_AREA_OPTIMIZED=1

The blocking versions of this instruction will stall the pipeline of MicroBlaze until the instruction can be completed. Interrupts are served when the parameter C_USE_EXTENDED_FSL_INSTR is set to 1, and the instruction is not atomic.

Note

To refer to an FSLx interface in assembly language, use rfsl0, rfsl1, ... rfsl15.

The blocking versions of this instruction should not be placed in a delay slot when the parameter C_USE_EXTENDED_FSL_INSTR is set to 1, since this prevents interrupts from being served.

For non-blocking versions, an rsubc instruction can be used to decrement an index variable.

The ‘e’ bit does not have any effect unless C_FSL_EXCEPTION is set to 1.

These instructions are only available when the MicroBlaze parameter C_FSL_LINKS is greater than 0.

The extended instructions (exception, test and atomic versions) are only available when the MicroBlaze parameter C_USE_EXTENDED_FSL_INSTR is set to 1.

It is not recommended to allow these instructions in user mode, unless absolutely necessary for performance reasons, since that removes all hardware protection preventing incorrect use of a link.
Instructions

getd

get from stream interface dynamic

tneagetd  rD, rB  get data from link rB[28:31]
t = test-only
n = non-blocking
e = exception if control bit set
a = atomic

tneagetd  rD, rB  get control from link rB[28:31]
t = test-only
n = non-blocking
e = exception if control bit not set
a = atomic

<table>
<thead>
<tr>
<th>0 1 0 0 1 1</th>
<th>rD</th>
<th>0 0 0 0 0</th>
<th>rB</th>
<th>0 n c t a e 0 0 0 0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
<td>21</td>
</tr>
</tbody>
</table>

Description

MicroBlaze will read from the interface defined by the four least significant bits in rB and place the result in register rD. If the available number of links set by C_FSL_LINKS is less than or equal to the four least significant bits in rB, link 0 is used.

The getd instruction has 32 variants.

The blocking versions (when ‘n’ bit is ‘0’) will stall MicroBlaze until the data from the interface is valid. The non-blocking versions will not stall microblaze and will set carry to ‘0’ if the data was valid and to ‘1’ if the data was invalid. In case of an invalid access the destination register contents is undefined.

All data get instructions (when ‘c’ bit is ‘0’) expect the control bit from the interface to be ‘0’. If this is not the case, the instruction will set MSR[FSL] to ‘1’. All control get instructions (when ‘c’ bit is ‘1’) expect the control bit from the interface to be ‘1’. If this is not the case, the instruction will set MSR[FSL] to ‘1’.

The exception versions (when ‘e’ bit is ‘1’) will generate an exception if there is a control bit mismatch. In this case ESR is updated with EC set to the exception cause and ESS set to the link index. The target register, rD, is not updated when an exception is generated, instead the data is stored in EDR.

The test versions (when ‘t’ bit is ‘1’) will be handled as the normal case, except that the read signal to the link is not asserted.

Atomic versions (when ‘a’ bit is ‘1’) are not interruptible. This means that a sequence of atomic instructions can be grouped together without an interrupt breaking the program flow. However, note that exceptions may still occur.

When MicroBlaze is configured to use an MMU (C_USE_MMU >= 1) and not explicitly allowed by setting C_MU_PRIVILEGED_INSTR to 1 these instructions are privileged. This means that if these instructions are attempted in User Mode (MSR[UM] = 1) a Privileged Instruction exception occurs.
**Chapter 5: MicroBlaze Instruction Set Architecture**

**Pseudocode**

```plaintext
if MSR[UM] = 1 then
    ESR[EC] <- 00111
else
    x <- rB[28:31]
    if x >= C_FSL_LINKS then
        x <- 0
    (rD) <- FSLx_S_DATA | Sx_AXIS_TDATA
    if (n = 1) then
        MSR[Carry] <- (FSLx_S_EXISTS | Sx_AXIS_TVALID)
    if (FSLx_S_CONTROL | Sx_AXIS_TLAST ≠ c) and
       (FSLx_S_EXISTS | Sx_AXIS_TVALID) then
        MSR[FSL] <- 1
    if (e = 1) then
        ESR[EC] <- 00000
        ESR[ESS] <- rB[28:31]
        EDR <- FSLx_S_DATA | Sx_AXIS_TDATA
```

**Registers Altered**

- rD, unless an exception is generated, in which case the register is unchanged
- MSR[FSL]
- MSR[Carry]
- ESR[EC], in case a stream exception or a privileged instruction exception is generated
- ESR[ESS], in case a stream exception is generated
- EDR, in case a stream exception is generated

**Latency**

- 1 cycle with C_AREA_OPTIMIZED=0
- 2 cycles with C_AREA_OPTIMIZED=1

The blocking versions of this instruction will stall the pipeline of MicroBlaze until the instruction can be completed. Interrupts are served unless the instruction is atomic, which ensures that the instruction cannot be interrupted.

**Note**

The blocking versions of this instruction should not be placed in a delay slot, since this prevents interrupts from being served.

For non-blocking versions, an rsubc instruction can be used to decrement an index variable.

The ‘e’ bit does not have any effect unless C_FSL_EXCEPTION is set to 1.

These instructions are only available when the MicroBlaze parameter C_FSL_LINKS is greater than 0 and the parameter C_USE_EXTENDED_FSL_INSTR is set to 1.

It is not recommended to allow these instructions in user mode, unless absolutely necessary for performance reasons, since that removes all hardware protection preventing incorrect use of a link.
### idiv

**Integer Divide**

idiv rD, rA, rB  
divide rB by rA (signed)

idivu rD, rA, rB  
divide rB by rA (unsigned)

<table>
<thead>
<tr>
<th>0</th>
<th>1</th>
<th>0</th>
<th>0</th>
<th>1</th>
<th>0</th>
<th>1</th>
<th>1</th>
<th>2</th>
<th>1</th>
<th>1</th>
<th>3</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
</tbody>
</table>

**Description**

The contents of register rB is divided by the contents of register rA and the result is placed into register rD.

If the U bit is set, rA and rB are considered unsigned values. If the U bit is clear, rA and rB are considered signed values.

If the value of rA is 0, the DZO bit in MSR will be set and the value in rD will be 0, unless an exception is generated.

If the U bit is clear, the value of rA is -1, and the value of rB is -2147483648, the DZO bit in MSR will be set and the value in rD will be -2147483648, unless an exception is generated.

**Pseudocode**

```plaintext
if (rA) = 0 then
    (rD)  <- 0
    MSR[DZO]<- 1
    ESR[EC]  <- 00101
    ESR[DEC] <- 0
else if U = 0 and (rA) = -1 and (rB) = -2147483648 then
    (rD)  <- -2147483648
    MSR[DZO]<- 1
    ESR[EC]  <- 00101
    ESR[DEC] <- 1
else
    (rD)  <- (rB) / (rA)
```

**Registers Altered**

- rD, unless a divide exception is generated, in which case the register is unchanged
- MSR[DZO], if the value in rA is zero
- ESR[EC], if the value in rA is zero

**Latency**

- 1 cycle if (rA) = 0, otherwise 32 cycles with C_AREA_OPTIMIZED=0
- 1 cycle if (rA) = 0, otherwise 34 cycles with C_AREA_OPTIMIZED=1

**Note**

This instruction is only valid if MicroBlaze is configured to use a hardware divider (C_USE_DIV = 1).
imm

<table>
<thead>
<tr>
<th>imm</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>1 0 1 1 0 0</td>
<td>0 0 0 0 0 0</td>
</tr>
</tbody>
</table>

**Description**

The instruction imm loads the IMM value into a temporary register. It also locks this value so it can be used by the following instruction and form a 32-bit immediate value.

The instruction imm is used in conjunction with Type B instructions. Since Type B instructions have only a 16-bit immediate value field, a 32-bit immediate value cannot be used directly. However, 32-bit immediate values can be used in MicroBlaze. By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. The imm instruction locks the 16-bit IMM value temporarily for the next instruction. A Type B instruction that immediately follows the imm instruction will then form a 32-bit immediate value from the 16-bit IMM value of the imm instruction (upper 16 bits) and its own 16-bit immediate value field (lower 16 bits). If no Type B instruction follows the imm instruction, the locked value gets unlocked and becomes useless.

**Latency**

- 1 cycle

**Notes**

The imm instruction and the Type B instruction following it are atomic; consequently, no interrupts are allowed between them.

The assembler provided by Xilinx automatically detects the need for imm instructions. When a 32-bit IMM value is specified in a Type B instruction, the assembler converts the IMM value to a 16-bit one to assemble the instruction and inserts an imm instruction before it in the executable file.
**lbu**

**Load Byte Unsigned**

\[
lbu \quad rD, rA, rB
\]

\[
lbur \quad rD, rA, rB
\]

### Description

Loads a byte (8 bits) from the memory location that results from adding the contents of registers \( rA \) and \( rB \). The data is placed in the least significant byte of register \( rD \) and the other three bytes in \( rD \) are cleared.

If the R bit is set, a byte reversed memory location is used, loading data with the opposite endianness of the endianness defined by \( \text{C_ENDIANNESS} \) and the E bit (if virtual protected mode is enabled).

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB.

A data storage exception occurs if access is prevented by a no-access-allowed zone protection. This only applies to accesses with user mode and virtual protected mode enabled.

### Pseudocode

\[
\text{Addr} \leftarrow (rA) + (rB)
\]

if \( \text{TLB Miss(Addr)} \) and \( \text{MSR[VM]} = 1 \) then

\[
\begin{align*}
\text{ESR[EC]} & \leftarrow 10010; \text{ESR[S]} \leftarrow 0 \\
\text{MSR[UMS]} & \leftarrow \text{MSR[UM]}; \text{MSR[VMS]} \leftarrow \text{MSR[VM]}; \text{MSR[UM]} \leftarrow 0; \text{MSR[VM]} \leftarrow 0
\end{align*}
\]

else if \( \text{Access Protected(Addr)} \) and \( \text{MSR[UM]} = 1 \) and \( \text{MSR[VM]} = 1 \) then

\[
\begin{align*}
\text{ESR[EC]} & \leftarrow 10000; \text{ESR[S]} \leftarrow 0; \text{ESR[DIZ]} \leftarrow 1 \\
\text{MSR[UMS]} & \leftarrow \text{MSR[UM]}; \text{MSR[VMS]} \leftarrow \text{MSR[VM]}; \text{MSR[UM]} \leftarrow 0; \text{MSR[VM]} \leftarrow 0
\end{align*}
\]

else

\[
\begin{align*}
(rD)[24:31] & \leftarrow \text{Mem(Addr)} \\
(rD)[0:23] & \leftarrow 0
\end{align*}
\]

### Registers Altered

- \( rD \), unless an exception is generated, in which case the register is unchanged
- \( \text{MSR[UM]}, \text{MSR[VM]}, \text{MSR[UMS]}, \text{MSR[VMS]} \), if an exception is generated
- \( \text{ESR[EC]}, \text{ESR[S]} \), if an exception is generated
- \( \text{ESR[DIZ]} \), if a data storage exception is generated

### Latency

- 1 cycle with \( \text{C\_AREA\_OPTIMIZED}=0 \)
- 2 cycles with \( \text{C\_AREA\_OPTIMIZED}=1 \)

### Note

The byte reversed instruction is only valid if MicroBlaze is configured to use reorder instructions (\( \text{C\_USE\_REORDER\_INSTR} = 1 \)).
Chapter 5: MicroBlaze Instruction Set Architecture

**lbui**

**Load Byte Unsigned Immediate**

**lbui** rD, rA, IMM

```
<table>
<thead>
<tr>
<th></th>
<th></th>
<th></th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
</tr>
</tbody>
</table>
```

**Description**

Loads a byte (8 bits) from the memory location that results from adding the contents of register rA with the value in IMM, sign-extended to 32 bits. The data is placed in the least significant byte of register rD and the other three bytes in rD are cleared.

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB.

A data storage exception occurs if access is prevented by a no-access-allowed zone protection. This only applies to accesses with user mode and virtual protected mode enabled.

**Pseudocode**

```
Addr ← (rA) + sext(IMM)
if TLB_Miss(Addr) and MSR[VM] = 1 then
    ESR[EC] ← 10010; ESR[S] ← 0
    MSR[UMS] ← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else if Access_Protected(Addr) and MSR[UM] = 1 and MSR[VM] = 1 then
    ESR[EC] ← 10000; ESR[S] ← 0; ESR[DIZ] ← 1
    MSR[UMS] ← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else
    (rD)[24:31] ← Mem(Addr)
    (rD)[0:23] ← 0
```

**Registers Altered**

- rD, unless an exception is generated, in which case the register is unchanged
- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if an exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated

**Latency**

- 1 cycle with C_AREA_OPTIMIZED=0
- 2 cycles with C_AREA_OPTIMIZED=1

**Note**

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.
lhu

Load Halfword Unsigned

\[
\text{lhu} \quad rD, rA, rB \\
\text{lhur} \quad rD, rA, rB
\]

<table>
<thead>
<tr>
<th>1</th>
<th>1</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>1</th>
<th>6</th>
<th>11</th>
<th>16</th>
<th>21</th>
<th>31</th>
</tr>
</thead>
<tbody>
<tr>
<td>rD</td>
<td>rA</td>
<td>rB</td>
<td>0</td>
<td>R</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
</tbody>
</table>

Description

Loads a halfword (16 bits) from the halfword aligned memory location that results from adding the contents of registers rA and rB. The data is placed in the least significant halfword of register rD and the most significant halfword in rD is cleared.

If the R bit is set, a halfword reversed memory location is used and the two bytes in the halfword are reversed, loading data with the opposite endianness of the endianness defined by C_ENDIANNESS and the E bit (if virtual protected mode is enabled).

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB.

A data storage exception occurs if access is prevented by a no-access-allowed zone protection. This only applies to accesses with user mode and virtual protected mode enabled.

An unaligned data access exception occurs if the least significant bit in the address is not zero.

Pseudocode

\[
\text{Addr} \leftarrow (rA) + (rB) \\
\text{if TLB Miss(Addr) and MSR[VM] = 1 then} \\
\quad \text{ESR[EC]} \leftarrow 10010; \text{ESR[S]} \leftarrow 0 \\
\quad \text{MSR[UMS]} \leftarrow \text{MSR[UM]; MSR[VMS]} \leftarrow \text{MSR[VM]; MSR[UM]} \leftarrow 0; \text{MSR[VM]} \leftarrow 0 \\
\text{else if Access Protected(Addr) and MSR[UM] = 1 and MSR[VM] = 1 then} \\
\quad \text{ESR[EC]} \leftarrow 10000; \text{ESR[S]} \leftarrow 0; \text{ESR[DIZ]} \leftarrow 1 \\
\quad \text{MSR[UMS]} \leftarrow \text{MSR[UM]; MSR[VMS]} \leftarrow \text{MSR[VM]; MSR[UM]} \leftarrow 0; \text{MSR[VM]} \leftarrow 0 \\
\text{else if Addr[31] \neq 0 then} \\
\quad \text{ESR[EC]} \leftarrow 00001; \text{ESR[W]} \leftarrow 0; \text{ESR[S]} \leftarrow 0; \text{ESR[Rx]} \leftarrow rD \\
\text{else} \\
\quad (rD)[16:31] \leftarrow \text{Mem(Addr); (rD)[0:15]} \leftarrow 0
\]

Registers Altered

- rD, unless an exception is generated, in which case the register is unchanged
- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if an exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated
- ESR[W], ESR[Rx], if an unaligned data access exception is generated

Latency

- 1 cycle with C_AREA_OPTIMIZED=0
- 2 cycles with C_AREA_OPTIMIZED=1

Note

The halfword reversed instruction is only valid if MicroBlaze is configured to use reorder instructions (C_USE_REORDER_INSTR = 1).
**lhui**

**Load Halfword Unsigned Immediate**

lhui     rD, rA, IMM

<table>
<thead>
<tr>
<th>1 1 1 0 0 1</th>
<th>rD</th>
<th>rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
</tr>
</tbody>
</table>

**Description**

Loads a halfword (16 bits) from the halfword aligned memory location that results from adding the contents of register rA and the value in IMM, sign-extended to 32 bits. The data is placed in the least significant halfword of register rD and the most significant halfword in rD is cleared.

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB. A data storage exception occurs if access is prevented by a no-access-allowed zone protection. This only applies to accesses with user mode and virtual protected mode enabled. An unaligned data access exception occurs if the least significant bit in the address is not zero.

**Pseudocode**

```
Addr ← (rA) + sext(IMM)
if TLB_Miss(Addr) and MSR[VM] = 1 then
    ESR[EC] ← 10010; ESR[S] ← 0
    MSR[UMS] ← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else if Access.Protected(Addr) and MSR[UM] = 1 and MSR[VM] = 1 then
    ESR[EC] ← 10000; ESR[S] ← 0; ESR[DIZ] ← 1
    MSR[UMS] ← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else if Addr[31] ≠ 0 then
    ESR[EC] ← 00001; ESR[W] ← 0; ESR[S] ← 0; ESR[Rx] ← rD
else
    (rD)[16:31] ← Mem(Addr)
    (rD)[0:15] ← 0
```

**Registers Altered**

- rD, unless an exception is generated, in which case the register is unchanged
- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if a TLB miss exception or a data storage exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated
- ESR[W], ESR[Rx], if an unaligned data access exception is generated

**Latency**

- 1 cycle with C_AREA_OPTIMIZED=0
- 2 cycles with C_AREA_OPTIMIZED=1

**Note**

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.
Instructions

lw

Load Word

lw rD, rA, rB
lwr rD, rA, rB

Description

Loads a word (32 bits) from the word aligned memory location that results from adding the contents of registers rA and rB. The data is placed in register rD.

If the R bit is set, the bytes in the loaded word are reversed, loading data with the opposite endianness of the endianness defined by C_ENDIANNESS and the E bit (if virtual protected mode is enabled).

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB.

A data storage exception occurs if access is prevented by a no-access-allowed zone protection. This only applies to accesses with user mode and virtual protected mode enabled.

An unaligned data access exception occurs if the two least significant bits in the address are not zero.

Pseudocode

Addr ← (rA) + (rB)
if TLB_Miss(Addr) and MSR[VM] = 1 then
    ESR[EC] ← 10010; ESR[S] ← 0
    MSR[UMS] ← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else if Access_Protected(Addr) and MSR[UM] = 1 and MSR[VM] = 1 then
    ESR[EC] ← 10000; ESR[S] ← 0; ESR[DIZ] ← 1
    MSR[UMS] ← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else if Addr[30:31] ≠ 0 then
    ESR[EC] ← 00001; ESR[W] ← 1; ESR[S] ← 0; ESR[Rx] ← rD
else
    (rD) ← Mem(Addr)

Registers Altered

- rD, unless an exception is generated, in which case the register is unchanged
- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if a TLB miss exception or a data storage exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated
- ESR[W], ESR[Rx], if an unaligned data access exception is generated

Latency

- 1 cycle with C_AREA_OPTIMIZED=0
- 2 cycles with C_AREA_OPTIMIZED=1

Note

The word reversed instruction is only valid if MicroBlaze is configured to use reorder instructions (C_USE_REORDER_INSTR = 1).
Load Word Immediate

**lwi**

```
 lwi   rD, rA, IMM
```

### Description

Loads a word (32 bits) from the word aligned memory location that results from adding the contents of register rA and the value IMM, sign-extended to 32 bits. The data is placed in register rD. A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB. A data storage exception occurs if access is prevented by a no-access-allowed zone protection. This only applies to accesses with user mode and virtual protected mode enabled. An unaligned data access exception occurs if the two least significant bits in the address are not zero.

### Pseudocode

```
Addr ← (rA) + sext(IMM)
if TLB_Miss(Addr) and MSR[VM] = 1 then
    ESR[EC] ← 10010; ESR[S] ← 0
    MSR[UM] ← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else if Access_Protected(Addr) and MSR[UM] = 1 and MSR[VM] = 1 then
    ESR[EC] ← 10000; ESR[S] ← 0; ESR[DIZ] ← 1
    MSR[UM] ← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else if Addr[30:31] ≠ 0 then
    ESR[EC] ← 00001; ESR[W] ← 1; ESR[S] ← 0; ESR[Rx] ← rD
else
    (rD) ← Mem(Addr)
```

### Registers Altered

- rD, unless an exception is generated, in which case the register is unchanged
- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if a TLB miss exception or a data storage exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated
- ESR[W], ESR[Rx], if an unaligned data access exception is generated

### Latency

- 1 cycle with `C_AREA_OPTIMIZED=0`
- 2 cycles with `C_AREA_OPTIMIZED=1`

### Note

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.
Load Word Exclusive

\[ \text{lwx} \quad rD, rA, rB \]

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
</tr>
<tr>
<td>1</td>
<td>21</td>
<td>31</td>
<td>0</td>
</tr>
</tbody>
</table>

Description

Loads a word (32 bits) from the word aligned memory location that results from adding the contents of registers rA and rB. The data is placed in register rD, and the reservation bit is set. If an AXI4 interconnect with exclusive access enabled is used, and the interconnect response is not EXOKAY, the carry flag (MSR[C]) is set; otherwise the carry flag is cleared.

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB.

A data storage exception occurs if access is prevented by a no-access-allowed zone protection. This only applies to accesses with user mode and virtual protected mode enabled.

An unaligned data access exception will not occur, even if the two least significant bits in the address are not zero.

A data bus exception can occur when an AXI4 interconnect with exclusive access enabled is used, and the interconnect response is not EXOKAY, which means that an exclusive access cannot be handled.

Enabling AXI exclusive access ensures that the operation is protected from other bus masters, but requires that the addressed slave supports exclusive access. When exclusive access is not enabled, only the internal reservation bit is used. Exclusive access is enabled using the two parameters `C_M_AXI_DP_EXCLUSIVE_ACCESS` and `C_M_AXI_DC_EXCLUSIVE_ACCESS` for the peripheral and cache interconnect, respectively.

Pseudocode

\[
\text{Addr} \leftarrow (rA) + (rB)
\]

if TLB_Miss(Addr) and MSR[VM] = 1 then
  \[ \text{ESR[EC]} \leftarrow 10010; \text{ESR[S]} \leftarrow 0 \]
  MSR[UMS] \leftarrow MSR[UM]; MSR[VMS] \leftarrow MSR[VM]; MSR[UM] \leftarrow 0; MSR[VM] \leftarrow 0
else if Access_Protected(Addr) and MSR[UM] = 1 and MSR[VM] = 1 then
  \[ \text{ESR[EC]} \leftarrow 10000; \text{ESR[S]} \leftarrow 0; \text{ESR[DIZ]} \leftarrow 1 \]
  MSR[UMS] \leftarrow MSR[UM]; MSR[VMS] \leftarrow MSR[VM]; MSR[UM] \leftarrow 0; MSR[VM] \leftarrow 0
else if AXI_Exclusive_Used(Addr) && AXI_Response /= EXOKAY then
  \[ \text{ESR[EC]} \leftarrow 00100; \text{ESR[ECC]} \leftarrow 0; \]
  MSR[UMS] \leftarrow MSR[UM]; MSR[VMS] \leftarrow MSR[VM]; MSR[UM] \leftarrow 0; MSR[VM] \leftarrow 0
else
  (rD) \leftarrow \text{Mem(Addr)}; Reservation \leftarrow 1;
  if AXI_Exclusive_Used(Addr) && AXI_Response /= EXOKAY then
    MSR[C] \leftarrow 1
  else
    MSR[C] \leftarrow 0
Chapter 5: MicroBlaze Instruction Set Architecture

Registers Altered

- rD and MSR[C], unless an exception is generated, in which case they are unchanged
- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if a TLB miss exception or a data storage exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated

Latency

- 1 cycle with C_AREA_OPTIMIZED=0
- 2 cycles with C_AREA_OPTIMIZED=1

Note

This instruction is used together with SWX to implement exclusive access, such as semaphores and spinlocks.

The carry flag (MSR[C]) may not be set immediately (dependent on pipeline stall behavior). The LWX instruction should not be immediately followed by an SRC instruction, to ensure the correct value of the carry flag is obtained.
mbar  
Memory Barrier

<table>
<thead>
<tr>
<th>mbar</th>
<th>IMM</th>
<th>Memory Barrier</th>
</tr>
</thead>
<tbody>
<tr>
<td>1 0 1 1 1 0</td>
<td>0 0 0 1 0</td>
<td>0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0</td>
</tr>
<tr>
<td>0 6 11 16</td>
<td>31</td>
<td></td>
</tr>
</tbody>
</table>

**Description**

This instruction ensures that outstanding memory accesses on memory interfaces are completed before any subsequent instructions are executed. This is necessary to guarantee that self-modifying code is handled correctly, and that a DMA transfer can be safely started.

With self-modifying code, it is necessary to first use an MBAR instruction to wait for data accesses, which can be done by setting IMM to 1, and then use another MBAR instruction to clear the Branch Target Cache and empty the instruction prefetch buffer, which can be done by setting IMM to 2.

To ensure that data to be read by a DMA unit has been written to memory, it is only necessary to wait for data accesses, which can be done by setting IMM to 1.

When MicroBlaze is configured to use an MMU (C_USE_MMU >= 1) this instruction is privileged when the most significant bit in IMM is set to 1. This means that if the instruction is attempted in User Mode (MSR[UM] = 1) a Privileged Instruction exception occurs.

When the most significant bit in IMM is set to 1 and no exception occurs, MicroBlaze enters sleep mode after all outstanding accesses have been completed, and sets the Sleep output signal to indicate this. The pipeline is halted, and MicroBlaze will not continue execution until a bit in the Wakeup input signal is asserted.

**Pseudocode**

```plaintext
if (IMM & 1) = 0 then
    wait for instruction side memory accesses
if (IMM & 2) = 0 then
    wait for data side memory accesses
PC ← PC + 4
if (IMM & 16) = 16 then
    enter sleep mode
```

**Registers Altered**

- PC
- ESR[EC], in case a privileged instruction exception is generated

**Latency**

- $1 + N$ cycles, where $N$ is the number of cycles to wait for memory accesses to complete

**Notes**

This instruction must not be preceded by an imm instruction, and must not be placed in a delay slot.

With XCL, there is no way for this instruction to know when data writes are complete. Hence it is also necessary to read back the last written value in this case, to ensure that the access has completed.

The assembler pseudo-instruction sleep can be used instead of “mbar 16” to enter sleep mode.
mfs

Move From Special Purpose Register

mfs rD, rS

<table>
<thead>
<tr>
<th>1 0 0 1 0 1</th>
<th>0 0 0 0 0 0 1 0</th>
<th>0 0 0 0 0 0 0 0</th>
</tr>
</thead>
<tbody>
<tr>
<td>rD</td>
<td>rS</td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
</tr>
<tr>
<td>16</td>
<td>18</td>
<td>31</td>
</tr>
</tbody>
</table>

Description

Copies the contents of the special purpose register rS into register rD. The special purpose registers TLBLO and TLBHI are used to copy the contents of the Unified TLB entry indexed by TLBX.

Pseudocode

```
switch (rS):
  case 0x0000 : (rD) ← PC
  case 0x0001 : (rD) ← MSR
  case 0x0003 : (rD) ← EAR
  case 0x0005 : (rD) ← ESR
  case 0x0007 : (rD) ← FSR
  case 0x000B : (rD) ← BTR
  case 0x000D : (rD) ← EDR
  case 0x0800 : (rD) ← SLR
  case 0x0802 : (rD) ← SHR
  case 0x1000 : (rD) ← PID
  case 0x1001 : (rD) ← ZPR
  case 0x1002 : (rD) ← TLBX
  case 0x1003 : (rD) ← TLBLO
  case 0x1004 : (rD) ← TLBHI
  case 0x2000 : (rD) ← PVR[x] (where x = 0 to 11)
  default : (rD) ← Undefined
```

Registers Altered

- rD

Latency

- 1 cycle

Notes

To refer to special purpose registers in assembly language, use rpc for PC, rmsr for MSR, rear for EAR, resr for ESR, rfsr for FSR, rbr for BTR, redr for EDR, rslr for SLR, rshr for SHR, rpid for PID, rzpr for ZPR, rtlblo for TLBLO, rtlbhi for TLBHI, rtlbx for TLBX, and rpvrx - rpvrx for PVR0 - PVR11.

The value read from MSR may not include effects of the immediately preceding instruction (dependent on pipeline stall behavior). An instruction that does not affect MSR must precede the MFS instruction to guarantee correct MSR value.
The value read from FSR may not include effects of the immediately preceding instruction (dependent on pipeline stall behavior). An instruction that does not affect FSR must precede the MFS instruction to guarantee correct FSR value.

EAR, ESR and BTR are only valid as operands when at least one of the MicroBlaze C_*_EXCEPTION parameters are set to 1.

EDR is only valid as operand when the parameter C_FSL_EXCEPTION is set to 1 and the parameter C_FSL_LINKS is greater than 0.

FSR is only valid as an operand when the C_USE_FPU parameter is greater than 0.

SLR and SHR are only valid as an operand when the C_USE_STACK_PROTECTION parameter is set to 1.

PID, ZPR, TLBLO and TLBHI are only valid as operands when the parameter C_USE_MMU > 1 (User Mode) and the parameter C_MMU_TLB_ACCESS = 1 (Read) or 3 (Full).

TLBX is only valid as operand when the parameter C_USE_MMU > 1 (User Mode) and the parameter C_MMU_TLB_ACCESS > 0 (Minimal).

PVR0 is only valid as an operand when C_PVR is 1 (Basic) or 2 (Full), and PVR1 - PVR11 are only valid as operands when C_PVR is set to 2 (Full).
msrclr

Read MSR and clear bits in MSR

\[
\text{msrclr \hspace{0.5cm} rD, Imm}
\]

<table>
<thead>
<tr>
<th>1 0 0 1 0 1</th>
<th>rD</th>
<th>1 0 0 0 1 0</th>
<th>Imm15</th>
</tr>
</thead>
<tbody>
<tr>
<td>0 6 11 16 17 31</td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Description

Copies the contents of the special purpose register MSR into register rD. Bit positions in the IMM value that are 1 are cleared in the MSR. Bit positions that are 0 in the IMM value are left untouched.

When MicroBlaze is configured to use an MMU (C\_USE\_MMU >= 1) this instruction is privileged for all IMM values except those only affecting C. This means that if the instruction is attempted in User Mode (MSR[UM] = 1) in this case a Privileged Instruction exception occurs.

Pseudocode

\[
\text{if MSR[UM] = 1 and IMM \neq 0x4 then}\n\text{ESR[EC] \leftarrow 00111}\n\text{else}\n\text{(rD) \leftarrow (MSR)}\n\text{(MSR) \leftarrow (MSR) \land (\text{IMM})}\n\]

Registers Altered

- rD
- MSR
- ESR[EC], in case a privileged instruction exception is generated

Latency

- 1 cycle

Notes

MSRCLR will affect the Carry bit immediately while the remaining bits will take effect one cycle after the instruction has been executed. When clearing the IE bit, it is guaranteed that the processor will not react to any interrupt for the subsequent instructions.

The value read from MSR may not include effects of the immediately preceding instruction (dependent on pipeline stall behavior). An instruction that does not affect MSR must precede the MSRCLR instruction to guarantee correct MSR value.

The immediate values has to be less than \(2^{15}\) when C\_USE\_MMU >= 1 (User Mode), and less than \(2^{14}\) otherwise. Only bits 17 to 31 of the MSR can be cleared when C\_USE\_MMU >= 1 (User Mode), and bits 18 to 31 otherwise.

This instruction is only available when the parameter C\_USE\_MSR\_INSTR is set to 1.

When clearing MSR[VM] the instruction must always be followed by a synchronizing branch instruction, for example BRI 4.
msrset

Read MSR and set bits in MSR

Syntax

msrset rD, Imm

Description

Copies the contents of the special purpose register MSR into register rD. Bit positions in the IMM value that are 1 are set in the MSR. Bit positions that are 0 in the IMM value are left untouched.

When MicroBlaze is configured to use an MMU (C_USE_MMU >= 1) this instruction is privileged for all IMM values except those only affecting C. This means that if the instruction is attempted in User Mode (MSR[UM] = 1) in this case a Privileged Instruction exception occurs.

With low-latency interrupt mode (C_USE_INTERRUPT = 2), the Interrupt_Ack output port is set to 11 if the MSR{IE} bit is set by executing this instruction.

Pseudocode

if MSR[UM] = 1 and IMM ≠ 0x4 then
   ESR[EC] ← 00111
else
   (rD) ← (MSR)
   (MSR) ← (MSR) ∨ (IMM)
   if (IMM) & 2
      Interrupt_Ack ← 11

Registers Altered

- rD
- MSR
- ESR[EC], in case a privileged instruction exception is generated

Latency

- 1 cycle

Notes

MSRSET will affect the Carry bit immediately while the remaining bits will take effect one cycle after the instruction has been executed. When setting the EIP or BIP bit, it is guaranteed that the processor will not react to any interrupt or normal hardware break for the subsequent instructions.

The value read from MSR may not include effects of the immediately preceding instruction (dependent on pipeline stall behavior). An instruction that does not affect MSR must precede the MSRSET instruction to guarantee correct MSR value.

The immediate values has to be less than 2^{15} when C_USE_MMU >= 1 (User Mode), and less than 2^{14} otherwise. Only bits 17 to 31 of the MSR can be set when C_USE_MMU >= 1 (User Mode), and bits 18 to 31 otherwise.

This instruction is only available when the parameter C_USE_MSR_INSTR is set to 1.

When setting MSR[VM] the instruction must always be followed by a synchronizing branch instruction, for example BRI 4.
## Move To Special Purpose Register

### Description

Copies the contents of register rD into the special purpose register rS. The special purpose registers TLBLO and TLBHI are used to copy to the Unified TLB entry indexed by TLBX.

When MicroBlaze is configured to use an MMU (C\_USE\_MMU >= 1) this instruction is privileged. This means that if the instruction is attempted in User Mode (MSR[UM] = 1) a Privileged Instruction exception occurs.

With low-latency interrupt mode (C\_USE\_INTERRUPT = 2), the Interrupt_Ack output port is set to 11 if the MSR[IE] bit is set by executing this instruction.

### Pseudocode

```pseudocode
if MSR[UM] = 1 then
    ESR[EC] ← 00111
else
    switch (rS)
        case 0x0001 : MSR ← (rA)
        case 0x0007 : FSR ← (rA)
        case 0x0800 : SLR ← (rA)
        case 0x0802 : SHR ← (rA)
        case 0x1000 : PID ← (rA)
        case 0x1001 : ZPR ← (rA)
        case 0x1002 : TLBX ← (rA)
        case 0x1003 : TLBLO ← (rA)
        case 0x1004 : TLBHI ← (rA)
        case 0x1005 : TLBSX ← (rA)
        if (rS) = 0x0001 && (rA) & 2
            Interrupt_Ack ← 11
```

### Registers Altered

- rS
- ESR[EC], in case a privileged instruction exception is generated

### Latency

- 1 cycle

### Notes

When writing MSR using MTS, all bits take effect one cycle after the instruction has been executed. An MTS instruction writing MSR should never be followed back-to-back by an instruction that uses the MSR content. When clearing the IE bit, it is guaranteed that the processor will not react to any interrupt for the subsequent instructions. When setting the EIP or BIP bit, it is guaranteed that the processor will not react to any interrupt or normal hardware break for the subsequent instructions.
To refer to special purpose registers in assembly language, use rmsr for MSR, rfsr for FSR, rsr for SLR, rsr for SHR, rpid for PID, rzpr for ZPR, rtlblo for TLBLO, rtlbhi for TLBHI, rtlbx for TLBX, and rtlbsx for TLBSX.

The PC, ESR, EAR, BTR, EDR and PVR0 - PVR11 cannot be written by the MTS instruction.

The FSR is only valid as a destination if the MicroBlaze parameter C_USE_FPU is greater than 0.

The SLR and SHR are only valid as a destination if the MicroBlaze parameter C_USE_STACK_PROTECTION is set to 1.

PID, ZPR and TLBSX are only valid as destinations when the parameter C_USE_MMU > 1 (User Mode) and the parameter C_MMU_TLB_ACCESS > 1 (Read). TLBLO, TLBHI and TLBX are only valid as destinations when the parameter C_USE_MMU > 1 (User Mode).

When changing MSR[VM] or PID the instruction must always be followed by a synchronizing branch instruction, for example BRI 4.
mul     Multiply

mul     rD, rA, rB

|   0   | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|   0   | 6 | 1 | 1 | 2 | 3 | 0 | 2 | 6 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |

Description

Multiplies the contents of registers rA and rB and puts the result in register rD. This is a 32-bit by 32-bit multiplication that will produce a 64-bit result. The least significant word of this value is placed in rD. The most significant word is discarded.

Pseudocode

\[(rD) \leftarrow \text{LSW}( (rA) \times (rB) )\]

Registers Altered

- rD

Latency

- 1 cycle with C_AREA_OPTIMIZED=0
- 3 cycles with C_AREA_OPTIMIZED=1

Note

This instruction is only valid if the target architecture has multiplier primitives, and if present, the MicroBlaze parameter C_USE_HW_MUL is greater than 0.
mulh

Multiply High

\[
mulh \quad rD, rA, rB
\]

<table>
<thead>
<tr>
<th>0</th>
<th>1</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>1</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
<td>3</td>
<td>1</td>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Description

Multiplies the contents of registers rA and rB and puts the result in register rD. This is a 32-bit by 32-bit signed multiplication that will produce a 64-bit result. The most significant word of this value is placed in rD. The least significant word is discarded.

Pseudocode

\[
(rD) \leftarrow \text{MSW}( (rA) \times (rB) ), \text{ signed}
\]

Registers Altered

- rD

Latency

- 1 cycle with C\_AREA\_OPTIMIZED=0
- 3 cycles with C\_AREA\_OPTIMIZED=1

Note

This instruction is only valid if the target architecture has multiplier primitives, and if present, the MicroBlaze parameter C\_USE\_HW\_MUL is set to 2 (Mul64).

When MULH is used, bit 30 and 31 in the MUL instruction must be zero to distinguish between the two instructions. In previous versions of MicroBlaze, these bits were defined as zero, but the actual values were not relevant.
**mulhu**

**Multiply High Unsigned**

\[
\text{mulhu} \quad \text{rD, rA, rB}
\]

<table>
<thead>
<tr>
<th>0</th>
<th>1</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>1</th>
<th>1</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
<td>3</td>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

**Description**

Multiplies the contents of registers rA and rB and puts the result in register rD. This is a 32-bit by 32-bit unsigned multiplication that will produce a 64-bit unsigned result. The most significant word of this value is placed in rD. The least significant word is discarded.

**Pseudocode**

\[
(rD) \leftarrow \text{MSW}( (rA) \times (rB) ), \text{ unsigned}
\]

**Registers Altered**

- rD

**Latency**

- 1 cycle with C\_AREA\_OPTIMIZED=0
- 3 cycles with C\_AREA\_OPTIMIZED=1

**Note**

This instruction is only valid if the target architecture has multiplier primitives, and if present, the MicroBlaze parameter C\_USE\_HW\_MUL is set to 2 (Mul64).

When MULHU is used, bit 30 and 31 in the MUL instruction must be zero to distinguish between the two instructions. In previous versions of MicroBlaze, these bits were defined as zero, but the actual values were not relevant.
**mulhsu**

**Multiply High Signed Unsigned**

\[
\text{mulhsu} \quad rD, rA, rB
\]

<table>
<thead>
<tr>
<th>0</th>
<th>1</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
<td>3</td>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
<td>3</td>
<td>1</td>
<td>1</td>
</tr>
</tbody>
</table>

**Description**

Multiplies the contents of registers rA and rB and puts the result in register rD. This is a 32-bit signed by 32-bit unsigned multiplication that will produce a 64-bit signed result. The most significant word of this value is placed in rD. The least significant word is discarded.

**Pseudocode**

\[
(rD) \leftarrow \text{MSW}((rA), \text{signed} \times (rB), \text{unsigned}), \text{signed}
\]

**Registers Altered**

- rD

**Latency**

- 1 cycle with C\_AREA\_OPTIMIZED=0
- 3 cycles with C\_AREA\_OPTIMIZED=1

**Note**

This instruction is only valid if the target architecture has multiplier primitives, and if present, the MicroBlaze parameter C\_USE\_HW\_MUL is set to 2 (Mul64).

When MULHSU is used, bit 30 and 31 in the MUL instruction must be zero to distinguish between the two instructions. In previous versions of MicroBlaze, these bits were defined as zero, but the actual values were not relevant.
muli

Multiply Immediate

muli rD, rA, IMM

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>1</td>
<td>3</td>
</tr>
<tr>
<td>2</td>
<td>6</td>
<td>6</td>
<td>1</td>
</tr>
</tbody>
</table>

Description

Multiplies the contents of registers rA and the value IMM, sign-extended to 32 bits; and puts the result in register rD. This is a 32-bit by 32-bit multiplication that will produce a 64-bit result. The least significant word of this value is placed in rD. The most significant word is discarded.

Pseudocode

(rD) ← LSW((rA) × sext(IMM))

Registers Altered

• rD

Latency

• 1 cycle with C_AREA_OPTIMIZED=0
• 3 cycles with C_AREA_OPTIMIZED=1

Notes

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.

This instruction is only valid if the target architecture has multiplier primitives, and if present, the MicroBlaze parameter C_USE_HW_MUL is greater than 0.
Instructions

or

or

Logical OR

or rD, rA, rB

Description

The contents of register rA are ORed with the contents of register rB; the result is placed into register rD.

Pseudocode

\[(rD) \leftarrow (rA) \lor (rB)\]

Registers Altered

- rD

Latency

- 1 cycle

Note

The assembler pseudo-instruction nop is implemented as “or r0, r0, r0”.

\[\begin{array}{cccccccc}
1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\
\hline
rD & rA & rB & 0 & 0 & 0 & 0 & 0 \\
0 & 6 & 1 & 1 & 2 & 3 & 1 & 6 & 1 & 1 & 1
\end{array}\]
ori

Logical OR with Immediate

ori rD, rA, IMM

<table>
<thead>
<tr>
<th>1 0 1 0 0 0</th>
<th>rD</th>
<th>rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td></td>
<td>3</td>
</tr>
</tbody>
</table>

Description

The contents of register rA are ORed with the extended IMM field, sign-extended to 32 bits; the result is placed into register rD.

Pseudocode

\[(rD) \leftarrow (rA) \lor \text{sext(IMM)}\]

Registers Altered

- rD

Latency

- 1 cycle

Note

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.
pcmpbf

Pattern Compare Byte Find

pcmpbf rD, rA, rB bytewise comparison returning position of first match

<table>
<thead>
<tr>
<th>1 0 0 0 0 0</th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
<th>1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>6</td>
<td>1</td>
<td>3</td>
</tr>
</tbody>
</table>

Description

The contents of register rA is bytewise compared with the contents in register rB.

- rD is loaded with the position of the first matching byte pair, starting with MSB as position 1, and comparing until LSB as position 4
- If none of the byte pairs match, rD is set to 0

Pseudocode

```plaintext
if rB[0:7] = rA[0:7] then
    (rD) ← 1
else
        (rD) ← 2
    else
            (rD) ← 3
        else
                (rD) ← 4
            else
                (rD) ← 0
```

Registers Altered

- rD

Latency

- 1 cycle

Note

This instruction is only available when the parameter C_USE_PCPMP_INSTR is set to 1.
**pcmpeq**   

**Pattern Compare Equal**

pcmpeq rD, rA, rB  
equality comparison with a positive boolean result

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
<th></th>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
<td>1</td>
<td>1</td>
<td>3</td>
</tr>
</tbody>
</table>

**Description**

The contents of register rA is compared with the contents in register rB.

- rD is loaded with 1 if they match, and 0 if not

**Pseudocode**

```
if (rB) = (rA) then
    (rD) \leftarrow 1
else
    (rD) \leftarrow 0
```

**Registers Altered**

- rD

**Latency**

- 1 cycle

**Note**

This instruction is only available when the parameter C_USE_PCMP_INSTR is set to 1.
pcmpne

Pattern Compare Not Equal

pcmpne rD, rA, rB

Equality comparison with a negative boolean result

<table>
<thead>
<tr>
<th>1</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>1</th>
<th>1</th>
</tr>
</thead>
<tbody>
<tr>
<td>rD</td>
<td>rA</td>
<td>rB</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>6</td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>2</td>
<td>1</td>
<td>1</td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Description

The contents of register rA is compared with the contents in register rB.

- rD is loaded with 0 if they match, and 1 if not

Pseudocode

if (rB) = (rA) then
  (rD) ← 0
else
  (rD) ← 1

Registers Altered

- rD

Latency

- 1 cycle

Note

This instruction is only available when the parameter C_USE_PCPM_INSTR is set to 1.
Chapter 5: MicroBlaze Instruction Set Architecture

put

Put to stream interface

<table>
<thead>
<tr>
<th>na</th>
<th>put</th>
<th>rA, FSLx</th>
<th>put data to link x</th>
</tr>
</thead>
<tbody>
<tr>
<td>n</td>
<td>=</td>
<td></td>
<td>non-blocking</td>
</tr>
<tr>
<td></td>
<td>a</td>
<td></td>
<td>atomic</td>
</tr>
<tr>
<td>tn</td>
<td>nput</td>
<td>FSLx</td>
<td>put data to link x test-only</td>
</tr>
<tr>
<td>n</td>
<td>=</td>
<td></td>
<td>non-blocking</td>
</tr>
<tr>
<td></td>
<td>a</td>
<td></td>
<td>atomic</td>
</tr>
<tr>
<td>n</td>
<td>caput</td>
<td>rA, FSLx</td>
<td>put control to link x</td>
</tr>
<tr>
<td>n</td>
<td>=</td>
<td></td>
<td>non-blocking</td>
</tr>
<tr>
<td></td>
<td>a</td>
<td></td>
<td>atomic</td>
</tr>
<tr>
<td>tn</td>
<td>caput</td>
<td>FSLx</td>
<td>put control to link x test-only</td>
</tr>
<tr>
<td>n</td>
<td>=</td>
<td></td>
<td>non-blocking</td>
</tr>
<tr>
<td></td>
<td>a</td>
<td></td>
<td>atomic</td>
</tr>
</tbody>
</table>

| 01101100000 rA | 1 n c t a 0000000 FSLx |
| 0 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | rA | 1 | n | c | t | a | 0 | 0 | 0 | 0 | 0 | 0 | FSLx |
| 0 | 6 | 11 | 16 | 28 | 31 |

Description

MicroBlaze will write the value from register rA to the link x interface. If the available number of links set by C_FSL_LINKS is less than or equal to FSLx, link 0 is used.

The put instruction has 16 variants.

The blocking versions (when ‘n’ is ‘0’) will stall MicroBlaze until there is space available in the interface. The non-blocking versions will not stall MicroBlaze and will set carry to ‘0’ if space was available and to ‘1’ if no space was available.

All data put instructions (when ‘c’ is ‘0’) will set the control bit to the interface to ‘0’ and all control put instructions (when ‘c’ is ‘1’) will set the control bit to ‘1’.

The test versions (when ‘t’ bit is ‘1’) will be handled as the normal case, except that the write signal to the link is not asserted (thus no source register is required).

Atomic versions (when ‘a’ bit is ‘1’) are not interruptible. This means that a sequence of atomic instructions can be grouped together without an interrupt breaking the program flow. However, note that exceptions may still occur.

When MicroBlaze is configured to use an MMU (C_USE_MMU >= 1) and not explicitly allowed by setting C_MMU_PRIVILEGED_INSTR to 1 these instructions are privileged. This means that if these instructions are attempted in User Mode (MSR[UM] = 1) a Privileged Instruction exception occurs.
Pseudocode

\[
\begin{align*}
\text{if } \text{MSR}[\text{UM}] &= 1 \text{ then} \\
\quad \text{ESR}[\text{EC}] &\leftarrow 00111 \\
\text{else} \\
\quad x &\leftarrow \text{FSLx} \\
\quad \text{if } x \geq C_{\text{FSL\_LINKS}} \text{ then} \\
\quad \quad x &\leftarrow 0 \\
\quad (FSLx_{\text{M\_DATA}} \mid Mx_{\text{AXIS\_TDATA}}) &\leftarrow (rA) \\
\quad \text{if } (n = 1) \text{ then} \\
\quad \quad \text{MSR}[\text{Carry}] &\leftarrow \\
\quad \quad (FSLx_{\text{M\_FULL}} \mid Mx_{\text{AXIS\_TVALID}} \land Mx_{\text{AXIS\_TREADY}}) \\
\quad (FSLx_{\text{M\_CONTROL}} \mid Mx_{\text{AXIS\_TLAST}}) &\leftarrow C
\end{align*}
\]

Registers Altered

- MSR[Carry]
- ESR[EC], in case a privileged instruction exception is generated

Latency

- 1 cycle with \( C_{\text{AREA\_OPTIMIZED}} = 0 \)
- 2 cycles with \( C_{\text{AREA\_OPTIMIZED}} = 1 \)

The blocking versions of this instruction will stall the pipeline of MicroBlaze until the instruction can be completed. Interrupts are served when the parameter \( C_{\text{USE\_EXTENDED\_FSL\_INSTR}} \) is set to 1, and the instruction is not atomic.

Note

To refer to an FSLx interface in assembly language, use \text{rfslo}, \text{rfsl1}, \ldots \text{rfsl15}.

The blocking versions of this instruction should not be placed in a delay slot when the parameter \( C_{\text{USE\_EXTENDED\_FSL\_INSTR}} \) is set to 1, since this prevents interrupts from being served.

These instructions are only available when the MicroBlaze parameter \( C_{\text{FSL\_LINKS}} \) is greater than 0.

The extended instructions (test and atomic versions) are only available when the MicroBlaze parameter \( C_{\text{USE\_EXTENDED\_FSL\_INSTR}} \) is set to 1.

It is not recommended to allow these instructions in user mode, unless absolutely necessary for performance reasons, since that removes all hardware protection preventing incorrect use of a link.
putd

Put to stream interface dynamic

\[ \text{naputd} \quad rA, rB \quad \text{put data to link } rB[28:31] \]
\[ n = \text{non-blocking} \]
\[ a = \text{atomic} \]

\[ \text{tnaputd} \quad rB \quad \text{put data to link } rB[28:31] \text{ test-only} \]
\[ n = \text{non-blocking} \]
\[ a = \text{atomic} \]

\[ \text{ncaputd} \quad rA, rB \quad \text{put control to link } rB[28:31] \]
\[ n = \text{non-blocking} \]
\[ a = \text{atomic} \]

\[ \text{tncaputd} \quad rB \quad \text{put control to link } rB[28:31] \text{ test-only} \]
\[ n = \text{non-blocking} \]
\[ a = \text{atomic} \]

\begin{tabular}{cccccccccccc}
0 & 1 & 0 & 0 & 1 & 1 & 0 & 0 & 0 & 0 & rA & rB & 1 & n & c & t & a & 0 & 0 & 0 & 0 & 0
\end{tabular}

0 & 6 & 11 & 16 & 21 & 31

Description

MicroBlaze will write the value from register \( rA \) to the link interface defined by the four least significant bits in \( rB \). If the available number of links set by \( \text{C\_FSL\_LINKS} \) is less than or equal to the four least significant bits in \( rB \), link 0 is used.

The putd instruction has 16 variants.

The blocking versions (when ‘n’ is ‘0’) will stall MicroBlaze until there is space available in the interface. The non-blocking versions will not stall MicroBlaze and will set carry to ‘0’ if space was available and to ‘1’ if no space was available.

All data putd instructions (when ‘c’ is ‘0’) will set the control bit to the interface to ‘0’ and all control putd instructions (when ‘c’ is ‘1’) will set the control bit to ‘1’.

The test versions (when ‘t’ bit is ‘1’) will be handled as the normal case, except that the write signal to the link is not asserted (thus no source register is required).

Atomic versions (when ‘a’ bit is ‘1’) are not interruptible. This means that a sequence of atomic instructions can be grouped together without an interrupt breaking the program flow. However, note that exceptions may still occur.

When MicroBlaze is configured to use an MMU (\( \text{C\_USE\_MMU} \geq 1 \)) and not explicitly allowed by setting \( \text{C\_MMU\_PRIVILEGED\_INSTR} \) to 1 these instructions are privileged. This means that if these instructions are attempted in User Mode (\( \text{MSR[UM]} = 1 \)) a Privileged Instruction exception occurs.
Pseudocode

\[
\begin{align*}
  &\text{if } \text{MSR[UM]} = 1 \text{ then} \\
  &\quad \text{ESR[EC]} \leftarrow 00111 \\
  &\text{else} \\
  &\quad x \leftarrow rB[28:31] \\
  &\quad \text{if } x \geq C_{\text{FSL\_LINKS}} \text{ then} \\
  &\quad\quad x \leftarrow 0 \\
  &\quad (FSL_x\_M\_DATA \mid M_x\_AXIS\_TDATA) \leftarrow (rA) \\
  &\quad \text{if } (n = 1) \text{ then} \\
  &\quad\quad \text{MSR[Carry]} \leftarrow \\
  &\quad\quad (FSL_x\_M\_FULL \mid M_x\_AXIS\_TVALID \land M_x\_AXIS\_TREADY) \\
  &\quad\quad (FSL_x\_M\_CONTROL \mid M_x\_AXIS\_TLAST) \leftarrow C
\end{align*}
\]

Registers Altered

- MSR[Carry]
- ESR[EC], in case a privileged instruction exception is generated

Latency

- 1 cycle with \textit{C\_AREA\_OPTIMIZED}=0
- 2 cycles with \textit{C\_AREA\_OPTIMIZED}=1

The blocking versions of this instruction will stall the pipeline of MicroBlaze until the instruction can be completed. Interrupts are served unless the instruction is atomic, which ensures that the instruction cannot be interrupted.

Note

The blocking versions of this instruction should not be placed in a delay slot, since this prevents interrupts from being served.

These instructions are only available when the MicroBlaze parameter \textit{C\_FSL\_LINKS} is greater than 0 and the parameter \textit{C\_USE\_EXTENDED\_FSL\_INSTR} is set to 1.

It is not recommended to allow these instructions in user mode, unless absolutely necessary for performance reasons, since that removes all hardware protection preventing incorrect use of a link.
rsub

Arithmetic Reverse Subtract

- rsub  rD, rA, rB  Subtract
- rsubc rD, rA, rB  Subtract with Carry
- rsubk rD, rA, rB  Subtract and Keep Carry
- rsubkc rD, rA, rB  Subtract with Carry and Keep Carry

<table>
<thead>
<tr>
<th>0</th>
<th>0</th>
<th>0</th>
<th>K</th>
<th>C</th>
<th>1</th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
<td>3</td>
<td>0</td>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Description

The contents of register rA is subtracted from the contents of register rB and the result is placed into register rD. Bit 3 of the instruction (labeled as K in the figure) is set to one for the mnemonic rsubk. Bit 4 of the instruction (labeled as C in the figure) is set to one for the mnemonic rsubc. Both bits are set to one for the mnemonic rsubkc.

When an rsub instruction has bit 3 set (rsubk, rsubkc), the carry flag will Keep its previous value regardless of the outcome of the execution of the instruction. If bit 3 is cleared (rsub, rsubc), then the carry flag will be affected by the execution of the instruction.

When bit 4 of the instruction is set to one (rsubc, rsubkc), the content of the carry flag (MSR[C]) affects the execution of the instruction. When bit 4 is cleared (rsub, rsubk), the content of the carry flag does not affect the execution of the instruction (providing a normal subtraction).

Pseudocode

```plaintext
if C = 0 then
    (rD) ← (rB) + (rA) + 1
else
    (rD) ← (rB) + (rA) + MSR[C]
if K = 0 then
    MSR[C] ← CarryOut
```

Registers Altered

- rD
- MSR[C]

Latency

- 1 cycle

Notes

In subtractions, Carry = (Borrow). When the Carry is set by a subtraction, it means that there is no Borrow, and when the Carry is cleared, it means that there is a Borrow.
rsubi

Arithmetic Reverse Subtract Immediate

**rsubi**

- rD, rA, IMM  Subtract Immediate
- rD, rA, IMM  Subtract Immediate with Carry
- rD, rA, IMM  Subtract Immediate and Keep Carry
- rD, rA, IMM  Subtract Immediate with Carry and Keep Carry

### Description

The contents of register rA is subtracted from the value of IMM, sign-extended to 32 bits, and the result is placed into register rD. Bit 3 of the instruction (labeled as K in the figure) is set to one for the mnemonic rsubik. Bit 4 of the instruction (labeled as C in the figure) is set to one for the mnemonic rsubic. Both bits are set to one for the mnemonic rsubikc.

When an rsubi instruction has bit 3 set (rsubik, rsubikc), the carry flag will Keep its previous value regardless of the outcome of the execution of the instruction. If bit 3 is cleared (rsubi, rsubic), then the carry flag will be affected by the execution of the instruction. When bit 4 of the instruction is set to one (rsubic, rsubikc), the content of the carry flag (MSR[C]) affects the execution of the instruction. When bit 4 is cleared (rsubi, rsubik), the content of the carry flag does not affect the execution of the instruction (providing a normal subtraction).

### Pseudocode

```plaintext
if C = 0 then
    (rD) ← sext(IMM) + (rA) + 1
else
    (rD) ← sext(IMM) + (rA) + MSR[C]
if K = 0 then
    MSR[C] ← CarryOut
```

### Registers Altered

- rD
- MSR[C]

### Latency

- 1 cycle

### Notes

In subtractions, Carry = (Borrow). When the Carry is set by a subtraction, it means that there is no Borrow, and when the Carry is cleared, it means that there is a Borrow. By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.
Return from Break

rtbd rA, IMM

Description

Return from break will branch to the location specified by the contents of rA plus the IMM field, sign-extended to 32 bits. It will also enable breaks after execution by clearing the BIP flag in the MSR.

This instruction always has a delay slot. The instruction following the RTBD is always executed before the branch target. That delay slot instruction has breaks disabled.

When MicroBlaze is configured to use an MMU (C_USE_MMU >= 1) this instruction is privileged. This means that if the instruction is attempted in User Mode (MSR[UM] = 1) a Privileged Instruction exception occurs.

Pseudocode

if MSR[UM] = 1 then
    ESR[EC] ← 00111
else
    PC ← (rA) + sext(IMM)
    allow following instruction to complete execution
    MSR[BIP] ← 0
    MSR[UM] ← MSR[UMS]
    MSR[VM] ← MSR[VMS]

Registers Altered

- PC
- MSR[BIP], MSR[UM], MSR[VM]
- ESR[EC], in case a privileged instruction exception is generated

Latency

- 2 cycles

Note

Convention is to use general purpose register r16 as rA.

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
Return from Interrupt

rtid

rtid rA, IMM

<p>| | | | | | | |</p>
<table>
<thead>
<tr>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>rA</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>IMM</td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
<td>31</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Description

Return from interrupt will branch to the location specified by the contents of rA plus the IMM field, sign-extended to 32 bits. It will also enable interrupts after execution.

This instruction always has a delay slot. The instruction following the RTID is always executed before the branch target. That delay slot instruction has interrupts disabled.

When MicroBlaze is configured to use an MMU (C_USE_MMU >= 1) this instruction is privileged. This means that if the instruction is attempted in User Mode (MSR[UM] = 1) a Privileged Instruction exception occurs.

With low-latency interrupt mode (C_USE_INTERRUPT = 2), the Interrupt_Ack output port is set to 10 when this instruction is executed, and subsequently to 11 when the MSR[IE] bit is set.

Pseudocode

if MSR[UM] = 1 then
    ESR[EC] ← 00111
else
    PC ← (rA) + sext(IMM)
    Interrupt_Ack ← 10
    allow following instruction to complete execution
    MSR[IE] ← 1
    MSR[UM] ← MSR[UMS]
    MSR[VM] ← MSR[VMS]
    Interrupt_Ack ← 11

Registers Altered

- PC
- MSR[IE], MSR[UM], MSR[VM]
- ESR[EC], in case a privileged instruction exception is generated

Latency

- 2 cycles

Note

Convention is to use general purpose register r14 as rA.

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
Return from Exception

\texttt{rted} \hspace{3em} \texttt{rA, IMM}

### Description

Return from exception will branch to the location specified by the contents of \texttt{rA} plus the IMM field, sign-extended to 32 bits. The instruction will also enable exceptions after execution.

This instruction always has a delay slot. The instruction following the RTED is always executed before the branch target.

When MicroBlaze is configured to use an MMU (\texttt{C\_USE\_MMU} \geq 1) this instruction is privileged. This means that if the instruction is attempted in User Mode (\texttt{MSR[UM]} = 1) a Privileged Instruction exception occurs.

### Pseudocode

\begin{verbatim}
if MSR[UM] = 1 then
    ESR[EC] \leftarrow 00111
else
    PC \leftarrow (rA) + \text{sext}(IMM)
    allow following instruction to complete execution
    MSR[EE] \leftarrow 1
    MSR[EIP] \leftarrow 0
    MSR[UM] \leftarrow MSR[UMS]
    MSR[VM] \leftarrow MSR[VMS]
    ESR \leftarrow 0
\end{verbatim}

### Registers Altered

- \texttt{PC}
- \texttt{MSR[EE]}, \texttt{MSR[EIP]}, \texttt{MSR[UM]}, \texttt{MSR[VM]}
- \texttt{ESR}

### Latency

- 2 cycles

### Note

Convention is to use general purpose register \texttt{r17} as \texttt{rA}. This instruction requires that one or more of the MicroBlaze parameters \texttt{C\_*\_EXCEPTION} are set to 1 or that \texttt{C\_USE\_MMU} > 0.

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.

The instruction should normally not be used when \texttt{MSR[EE]} is set, since if the instruction in the delay slot would cause an exception, the exception handler would be entered with exceptions enabled.

\textbf{Note:} Code returning from an exception must first check if \texttt{MSR[DS]} is set, and in that case return to the address in BTR.
rtsd

Return from Subroutine

rtsd rA, IMM

Description

Return from subroutine will branch to the location specified by the contents of rA plus the IMM field, sign-extended to 32 bits.

This instruction always has a delay slot. The instruction following the RTSD is always executed before the branch target.

Pseudocode

PC ← (rA) + sext(IMM)
allow following instruction to complete execution

Registers Altered

- PC

Latency

- 1 cycle (if successful branch prediction occurs)
- 2 cycles (with Branch Target Cache disabled)
- 3 cycles (if branch prediction mispredict occurs)

Note

Convention is to use general purpose register r15 as rA.

A delay slot must not be used by the following: imm, branch, or break instructions. Interrupts and external hardware breaks are deferred until after the delay slot branch has been completed.
sb

**Store Byte**

\[
\begin{align*}
\text{sb} & \quad rD, rA, rB \\
\text{sbr} & \quad rD, rA, rB \\
\end{align*}
\]

### Description

Stores the contents of the least significant byte of register rD, into the memory location that results from adding the contents of registers rA and rB.

If the R bit is set, a byte reversed memory location is used, storing data with the opposite endianness of the endianness defined by `C_ENDIANNESS` and the E bit (if virtual protected mode is enabled).

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB.

A data storage exception occurs if virtual protected mode is enabled, and access is prevented by no-access-allowed or read-only zone protection. No-access-allowed can only occur in user mode.

### Pseudocode

```
Addr ← (rA) + (rB)
if TLB_Miss(Addr) and MSR[VM] = 1 then
    ESR[EC] ← 10010; ESR[S] ← 1
    MSR[UMS] ← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else if Access.Protected(Addr) and MSR[VM] = 1 then
    ESR[EC] ← 10000; ESR[S] ← 1; ESR[DIZ] ← No-access-allowed
    MSR[UMS] ← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else
    Mem(Addr) ← (rD)[24:31]
```

### Registers Altered

- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if an exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated

### Latency

- 1 cycle with `C_AREA_OPTIMIZED=0`
- 2 cycles with `C_AREA_OPTIMIZED=1`

### Note

The byte reversed instruction is only valid if MicroBlaze is configured to use reorder instructions (`C_USE_REORDER_INSTR = 1`).
sbi

Store Byte Immediate

\texttt{sbi rD, rA, IMM}

<table>
<thead>
<tr>
<th>1</th>
<th>1</th>
<th>1</th>
<th>0</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>rD</td>
<td>rA</td>
<td>IMM</td>
<td></td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
<td>31</td>
</tr>
</tbody>
</table>

**Description**

Stores the contents of the least significant byte of register \( rD \), into the memory location that results from adding the contents of register \( rA \) and the value \( IMM \), sign-extended to 32 bits.

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB.

A data storage exception occurs if virtual protected mode is enabled, and access is prevented by no-access-allowed or read-only zone protection. No-access-allowed can only occur in user mode.

**Pseudocode**

\[
\text{Addr} \leftarrow (rA) + \text{sext}(IMM) \\
\text{if TLB}_\text{Miss}(\text{Addr}) \text{ and } \text{MSR}[VM] = 1 \text{ then} \\
\quad \text{ESR}[EC] \leftarrow 10010; \text{ESR}[S] \leftarrow 1 \\
\quad \text{MSR}[UMS] \leftarrow \text{MSR}[UM]; \text{MSR}[VMS] \leftarrow \text{MSR}[VM]; \text{MSR}[UM] \leftarrow 0; \text{MSR}[VM] \leftarrow 0 \\
\text{else if Access}_\text{Protected}(\text{Addr}) \text{ and } \text{MSR}[VM] = 1 \text{ then} \\
\quad \text{ESR}[EC] \leftarrow 10000; \text{ESR}[S] \leftarrow 1; \text{ESR}[DIZ] \leftarrow \text{No-access-allowed} \\
\quad \text{MSR}[UMS] \leftarrow \text{MSR}[UM]; \text{MSR}[VMS] \leftarrow \text{MSR}[VM]; \text{MSR}[UM] \leftarrow 0; \text{MSR}[VM] \leftarrow 0 \\
\text{else} \\
\quad \text{Mem}(\text{Addr}) \leftarrow (rD)[24:31]
\]

**Registers Altered**

- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if an exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated

**Latency**

- 1 cycle with \( \text{C\_AREA\_OPTIMIZED}=0 \)
- 2 cycles with \( \text{C\_AREA\_OPTIMIZED}=1 \)

**Note**

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.
### sext16

#### Sign Extend Halfword

**sext16**  \( r_D, r_A \)

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
</tr>
</thead>
<tbody>
<tr>
<td>1 0 0 1 0 0</td>
<td>0 0 0 0 0 0 1 1 0 0 0 0 1</td>
<td></td>
</tr>
</tbody>
</table>

#### Description

This instruction sign-extends a halfword (16 bits) into a word (32 bits). Bit 16 in \( r_A \) will be copied into bits 0-15 of \( r_D \). Bits 16-31 in \( r_A \) will be copied into bits 16-31 of \( r_D \).

#### Pseudocode

\[
(r_D)[0:15] \leftarrow (r_A)[16] \\
(r_D)[16:31] \leftarrow (r_A)[16:31]
\]

#### Registers Altered

- \( r_D \)

#### Latency

- 1 cycle
**sext8**  
Sign Extend Byte

**sext8**  
rD, rA

| 1 | 0 | 0 | 1 | 0 | 0 | rD | rA | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| 0 | 6 | 1 | 1 | 1 | 6 | 3 | 1 | 1 |

**Description**

This instruction sign-extends a byte (8 bits) into a word (32 bits). Bit 24 in rA will be copied into bits 0-23 of rD. Bits 24-31 in rA will be copied into bits 24-31 of rD.

**Pseudocode**

\[
(rD)[0:23] \leftarrow (rA)[24] \\
(rD)[24:31] \leftarrow (rA)[24:31]
\]

**Registers Altered**

- rD

**Latency**

- 1 cycle
sh

Store Halfword

\[
\text{sh} \quad rD, rA, rB
\]

\[
\text{shr} \quad rD, rA, rB
\]

<table>
<thead>
<tr>
<th>1</th>
<th>1</th>
<th>0</th>
<th>1</th>
<th>0</th>
<th>1</th>
</tr>
</thead>
<tbody>
<tr>
<td>rD</td>
<td>rA</td>
<td>rB</td>
<td>0</td>
<td>R</td>
<td>0</td>
</tr>
</tbody>
</table>

Description

Stores the contents of the least significant halfword of register rD, into the halfword aligned memory location that results from adding the contents of registers rA and rB.

If the R bit is set, a halfword reversed memory location is used and the two bytes in the halfword are reversed, storing data with the opposite endianness of the endianness defined by \( \text{C_ENDIANNESS} \) and the E bit (if virtual protected mode is enabled).

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB.

A data storage exception occurs if virtual protected mode is enabled, and access is prevented by no-access-allowed or read-only zone protection. No-access-allowed can only occur in user mode.

An unaligned data access exception occurs if the least significant bit in the address is not zero.

Pseudocode

\[
\text{Addr} \leftarrow (rA) + (rB)
\]

if TLB_Miss(Addr) and MSR[VM] = 1 then
  ESR[EC] \leftarrow 10010; ESR[S] \leftarrow 1
  MSR[UM] \leftarrow MSR[UM]; MSR[VMS] \leftarrow MSR[VM]; MSR[UM] \leftarrow 0; MSR[VM] \leftarrow 0
else if Access Protected(Addr) and MSR[VM] = 1 then
  ESR[EC] \leftarrow 10000; ESR[S] \leftarrow 1; ESR[DIZ] \leftarrow \text{No-access-allowed}
  MSR[UM] \leftarrow MSR[UM]; MSR[VMS] \leftarrow MSR[VM]; MSR[UM] \leftarrow 0; MSR[VM] \leftarrow 0
else if Addr[31] \neq 0 then
  ESR[EC] \leftarrow 00001; ESR[W] \leftarrow 0; ESR[S] \leftarrow 1; ESR[Rx] \leftarrow rD
else
  Mem(Addr) \leftarrow (rD)[16:31]

Registers Altered

- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if a TLB miss exception or a data storage exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated
- ESR[W], ESR[Rx], if an unaligned data access exception is generated

Latency

- 1 cycle with \( \text{C\_AREA\_OPTIMIZED=0} \)
- 2 cycles with \( \text{C\_AREA\_OPTIMIZED=1} \)

Note

The halfword reversed instruction is only valid if MicroBlaze is configured to use reorder instructions (\( \text{C\_USE\_REORDER\_INSTR = 1} \)).
**shi**

### Store Halfword Immediate

**si**

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
<th>IMM</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
</tr>
</tbody>
</table>

#### Description

Stores the contents of the least significant halfword of register rD, into the halfword aligned memory location that results from adding the contents of register rA and the value IMM, sign-extended to 32 bits.

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB. A data storage exception occurs if virtual protected mode is enabled, and access is prevented by no-access-allowed or read-only zone protection. No-access-allowed can only occur in user mode. An unaligned data access exception occurs if the least significant bit in the address is not zero.

#### Pseudocode

Addr \(\leftarrow (rA) + \text{sext}(IMM)\)

if TLB_Miss(Addr) and MSR[VM] = 1 then

ESR[EC] \(\leftarrow 10010;\) ESR[S] \(\leftarrow 1\)
MSR[UMS] \(\leftarrow MSR[UM];\) MSR[VMS] \(\leftarrow MSR[VM];\) MSR[UM] \(\leftarrow 0;\) MSR[VM] \(\leftarrow 0\)
else if Access_Protected(Addr) and MSR[VM] = 1 then

ESR[EC] \(\leftarrow 10000;\) ESR[S] \(\leftarrow 1;\) ESR[DIZ] \(\leftarrow \text{No-access-allowed}\)
MSR[UMS] \(\leftarrow MSR[UM];\) MSR[VMS] \(\leftarrow MSR[VM];\) MSR[UM] \(\leftarrow 0;\) MSR[VM] \(\leftarrow 0\)
else if Addr[31] \(\neq 0\) then

ESR[EC] \(\leftarrow 00001;\) ESR[W] \(\leftarrow 0;\) ESR[S] \(\leftarrow 1;\) ESR[Rx] \(\leftarrow rD\)
else

Mem(Addr) \(\leftarrow (rD)[16:31]\)

#### Registers Altered

- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if a TLB miss exception or a data storage exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated
- ESR[W], ESR[Rx], if an unaligned data access exception is generated

#### Latency

- 1 cycle with \(\text{C\_AREA\_OPTIMIZED=0}\)
- 2 cycles with \(\text{C\_AREA\_OPTIMIZED=1}\)

#### Note

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.
**sra**

**Shift Right Arithmetic**

**sra**  rD, rA

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>3</td>
</tr>
</tbody>
</table>

**Description**

Shifts arithmetically the contents of register rA, one bit to the right, and places the result in rD. The most significant bit of rA (that is, the sign bit) placed in the most significant bit of rD. The least significant bit coming out of the shift chain is placed in the Carry flag.

**Pseudocode**

\[
(rD)[0] \leftarrow (rA)[0] \\
(rD)[1:31] \leftarrow (rA)[0:30] \\
MSR[C] \leftarrow (rA)[31]
\]

**Registers Altered**

- rD
- MSR[C]

**Latency**

- 1 cycle
**Shift Right with Carry**

**src**

```
src         rD, rA
```

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>6</td>
</tr>
<tr>
<td>6</td>
<td>3</td>
<td>1</td>
</tr>
</tbody>
</table>

**Description**

Shifts the contents of register rA, one bit to the right, and places the result in rD. The Carry flag is shifted in the shift chain and placed in the most significant bit of rD. The least significant bit coming out of the shift chain is placed in the Carry flag.

**Pseudocode**

```
(rD)[0] ← MSR[C]
(rD)[1:31] ← (rA)[0:30]
MSR[C] ← (rA)[31]
```

**Registers Altered**

- rD
- MSR[C]

**Latency**

- 1 cycle
srl

Shift Right Logical

srl rD, rA

<table>
<thead>
<tr>
<th>1 0 0 1 0 0</th>
<th>rD</th>
<th>0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>6</td>
</tr>
<tr>
<td></td>
<td></td>
<td>3</td>
</tr>
</tbody>
</table>

Description

Shifts logically the contents of register rA, one bit to the right, and places the result in rD. A zero is shifted in the shift chain and placed in the most significant bit of rD. The least significant bit coming out of the shift chain is placed in the Carry flag.

Pseudocode

(rD)[0] ← 0
(rD)[1:31] ← (rA)[0:30]
MSR[C] ← (rA)[31]

Registers Altered

- rD
- MSR[C]

Latency

- 1 cycle
**SW**

**Store Word**

\[ \text{sw} \quad rD, rA, rB \]

\[ \text{swr} \quad rD, rA, rB \]

<table>
<thead>
<tr>
<th>1 1 0 1 1 0</th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
<th>0 R 0 0 0 0 0</th>
<th>0 0 0 0</th>
</tr>
</thead>
<tbody>
<tr>
<td>6</td>
<td>11</td>
<td>16</td>
<td>21</td>
<td>31</td>
<td></td>
</tr>
</tbody>
</table>

**Description**

Stores the contents of register \( rD \), into the word aligned memory location that results from adding the contents of registers \( rA \) and \( rB \).

If the R bit is set, the bytes in the stored word are reversed, storing data with the opposite endianness of the endianness defined by \( \text{C_ENDIANNESS} \) and the E bit (if virtual protected mode is enabled).

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB.

A data storage exception occurs if virtual protected mode is enabled, and access is prevented by no-access-allowed or read-only zone protection. No-access-allowed can only occur in user mode.

An unaligned data access exception occurs if the two least significant bits in the address are not zero.

**Pseudocode**

\[
\text{Addr} \leftarrow (rA) + (rB) \\
\text{if TLB\_Miss(Addr) and MSR[VM] = 1 then} \\
\quad \text{ESR[EC]} \leftarrow 10010; \text{ESR[S]} \leftarrow 1 \\
\quad \text{MSR[UMS]} \leftarrow \text{MSR[UM]}; \text{MSR[VMS]} \leftarrow \text{MSR[VM]}; \text{MSR[UM]} \leftarrow 0; \text{MSR[VM]} \leftarrow 0 \\
\text{else if Access\_Protected(Addr) and MSR[VM] = 1 then} \\
\quad \text{ESR[EC]} \leftarrow 10000; \text{ESR[S]} \leftarrow 1; \text{ESR[DIZ]} \leftarrow \text{No-access-allowed} \\
\quad \text{MSR[UMS]} \leftarrow \text{MSR[UM]}; \text{MSR[VMS]} \leftarrow \text{MSR[VM]}; \text{MSR[UM]} \leftarrow 0; \text{MSR[VM]} \leftarrow 0 \\
\text{else if Addr[30:31] \neq 0 then} \\
\quad \text{ESR[EC]} \leftarrow 00001; \text{ESR[W]} \leftarrow 1; \text{ESR[S]} \leftarrow 1; \text{ESR[Rx]} \leftarrow rD \\
\text{else} \\
\quad \text{Mem(Addr)} \leftarrow (rD)[0:31]
\]

**Registers Altered**

- \( \text{MSR[UM]}, \text{MSR[VM]}, \text{MSR[UMS]}, \text{MSR[VMS]} \), if a TLB miss exception or a data storage exception is generated
- \( \text{ESR[EC]}, \text{ESR[S]} \), if an exception is generated
- \( \text{ESR[DIZ]} \), if a data storage exception is generated
- \( \text{ESR[W]}, \text{ESR[Rx]} \), if an unaligned data access exception is generated

**Latency**

- 1 cycle with \( \text{C\_AREA\_OPTIMIZED}=0 \)
- 2 cycles with \( \text{C\_AREA\_OPTIMIZED}=1 \)

**Note**

The word reversed instruction is only valid if MicroBlaze is configured to use reorder instructions (\( \text{C\_USE\_REORDER\_INSTR} = 1 \)).
### swapb

**Swap Bytes**

**swapb** \[ rD, rA \]

<table>
<thead>
<tr>
<th>0 0 0 1 0 0</th>
<th></th>
<th>0 0 0 0 0 0 0 0 0 1 1 1 0 0 0 0 0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>3</td>
</tr>
</tbody>
</table>

#### Description

Swaps the contents of register \( rA \) treated as four bytes, and places the result in \( rD \). This effectively converts the byte sequence in the register between endianness formats, either from little-endian to big-endian or vice versa.

#### Pseudocode

\[
\begin{align*}
(rD)[24:31] & \leftarrow (rA)[0:7] \\
(rD)[16:23] & \leftarrow (rA)[8:15] \\
(rD)[8:15]  & \leftarrow (rA)[16:23] \\
(rD)[0:7]   & \leftarrow (rA)[24:31]
\end{align*}
\]

#### Registers Altered

- \( rD \)

#### Latency

- 1 cycle

#### Note

This instruction is only valid if MicroBlaze is configured to use reorder instructions \( (C\_USE\_REORDER\_INSTR = 1) \).
### Instructions

**swaph**

**Swap Halfwords**

**Syntax**

```
swaph rD, rA
```

**Pseudocode**

```
(rD)[0:15] ← (rA)[16:31]
(rD)[16:31] ← (rA)[0:15]
```

**Description**

Swaps the contents of register rA treated as two halfwords, and places the result in rD. This effectively converts the two halfwords in the register between endianness formats, either from little-endian to big-endian or vice versa.

**Registers Altered**

- rD

**Latency**

- 1 cycle

**Note**

This instruction is only valid if MicroBlaze is configured to use reorder instructions (C_USE_REORDER_INSTR = 1).
Description

Stores the contents of register rD, into the word aligned memory location that results from adding the contents of registers rA and the value IMM, sign-extended to 32 bits.

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB.

A data storage exception occurs if virtual protected mode is enabled, and access is prevented by no-access-allowed or read-only zone protection. No-access-allowed can only occur in user mode.

An unaligned data access exception occurs if the two least significant bits in the address are not zero.

Pseudocode

```
Addr ← (rA) + sext(IMM)
if TLB_Miss(Addr) and MSR[VM] = 1 then
    ESR[EC]← 10010; ESR[S]← 1
    MSR[UMS] ← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else if Access_Protected(Addr) and MSR[VM] = 1 then
    ESR[EC]← 10000; ESR[S]← 1; ESR[DIZ] ← No-access-allowed
    MSR[UMS]← MSR[UM]; MSR[VMS] ← MSR[VM]; MSR[UM] ← 0; MSR[VM] ← 0
else if Addr[30:31] ≠ 0 then
    ESR[EC] ← 00001; ESR[W] ← 1; ESR[S] ← 1; ESR[Rx] ← rD
else
    Mem(Addr) ← (rD)[0:31]
```

Register Altered

- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if a TLB miss exception or a data storage exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated
- ESR[W], ESR[Rx], if an unaligned data access exception is generated

Latency

- 1 cycle with C_AREA_OPTIMIZED=0
- 2 cycles with C_AREA_OPTIMIZED=1

Note

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.
**SWX**

**Store Word Exclusive**

\[ \text{swx} \ rD, rA, rB \]

<table>
<thead>
<tr>
<th>1</th>
<th>1</th>
<th>0</th>
<th>1</th>
<th>1</th>
<th>0</th>
<th>1</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>11</td>
<td>16</td>
<td>21</td>
<td>31</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

**Description**

Conditionally stores the contents of register \( rD \), into the word aligned memory location that results from adding the contents of registers \( rA \) and \( rB \). If an AXI4 interconnect with exclusive access enabled is used, the store occurs if the interconnect response is EXOKAY, and the reservation bit is set; otherwise the store occurs when the reservation bit is set. The carry flag (MSR\([C]\)) is set if the store does not occur, otherwise it is cleared. The reservation bit is cleared.

A data TLB miss exception occurs if virtual protected mode is enabled, and a valid translation entry corresponding to the address is not found in the TLB.

A data storage exception occurs if virtual protected mode is enabled, and access is prevented by no-access-allowed or read-only zone protection. No-access-allowed can only occur in user mode.

An unaligned data access exception will not occur even if the two least significant bits in the address are not zero.

Enabling AXI exclusive access ensures that the operation is protected from other bus masters, but requires that the addressed slave supports exclusive access. When exclusive access is not enabled, only the internal reservation bit is used. Exclusive access is enabled using the two parameters \text{C\_M\_AXI\_DP\_EXCLUSIVE\_ACCESS} and \text{C\_M\_AXI\_DC\_EXCLUSIVE\_ACCESS} for the peripheral and cache interconnect, respectively.

**Pseudocode**

\[
\text{Addr} \leftarrow (rA) + (rB) \\
\text{if Reservation} = 0 \text{ then} \\
\quad \text{MSR}[C] \leftarrow 1 \\
\text{else} \\
\quad \text{if TLB\_Miss(Addr) and MSR}[VM] = 1 \text{ then} \\
\quad \\
\quad \text{ESR}[EC] \leftarrow 10011; \text{ESR}[S] \leftarrow 1 \\
\quad \text{MSR}[UMS] \leftarrow \text{MSR}[UM]; \text{MSR}[VMS] \leftarrow \text{MSR}[VM]; \text{MSR}[UM] \leftarrow 0; \text{MSR}[VM] \leftarrow 0 \\
\quad \text{else if Access\_Protected(Addr) and MSR}[VM] = 1 \text{ then} \\
\quad \\
\quad \text{ESR}[EC] \leftarrow 10000; \text{ESR}[S] \leftarrow 1; \text{ESR}[DIZ] \leftarrow \text{No-access-allowed} \\
\quad \text{MSR}[UMS] \leftarrow \text{MSR}[UM]; \text{MSR}[VMS] \leftarrow \text{MSR}[VM]; \text{MSR}[UM] \leftarrow 0; \text{MSR}[VM] \leftarrow 0 \\
\quad \text{else} \\
\quad \quad \text{Reservation} \leftarrow 0 \\
\quad \text{if AXI\_Exclusive\_Used(Addr) \&\& AXI\_Response} /\neq \text{EXOKAY then} \\
\quad \quad \text{MSR}[C] \leftarrow 1 \\
\quad \text{else} \\
\quad \quad \text{Mem(Addr)} \leftarrow (rD)[0:31] \\
\quad \quad \text{MSR}[C] \leftarrow 0
\]
Chapter 5: MicroBlaze Instruction Set Architecture

Registers Altered

- MSR[C], unless an exception is generated
- MSR[UM], MSR[VM], MSR[UMS], MSR[VMS], if a TLB miss exception or a data storage exception is generated
- ESR[EC], ESR[S], if an exception is generated
- ESR[DIZ], if a data storage exception is generated

Latency

- 1 cycle with C_AREA_OPTIMIZED=0
- 2 cycles with C_AREA_OPTIMIZED=1

Note

This instruction is used together with LWX to implement exclusive access, such as semaphores and spinlocks.

The carry flag (MSR[C]) may not be set immediately (dependent on pipeline stall behavior). The SWX instruction should not be immediately followed by an SRC instruction, to ensure the correct value of the carry flag is obtained.
**wdc**

Write to Data Cache

```
wdc       rA,rB
wdc.flush rA,rB
wdc.clear rA,rB
```

### Description

Write into the data cache tag to invalidate or flush a cache line. The mnemonic `wdc.flush` is used to set the F bit, and `wdc.clear` is used to set the T bit.

When `C_DCACHE_USE_WRITEBACK` is set to 1, the instruction will flush the cache line and invalidate it if the F bit is set, otherwise it will only invalidate the cache line and discard any data that has not been written to memory. If the T bit is set, only a cache line with a matching address is invalidated. Register `rA` added with `rB` is the address of the affected cache line.

When `C_DCACHE_USE_WRITEBACK` is cleared to 0, the instruction will always invalidate the cache line. Register `rA` contains the address of the affected cache line, and the register `rB` value is not used.

When MicroBlaze is configured to use an MMU (`C_USE_MMU >= 1`) the instruction is privileged. This means that if the instruction is attempted in User Mode (MSR[UM] = 1) a Privileged Instruction exception occurs.

### Pseudocode

```
if MSR[UM] = 1 then
    ESR[EC] ← 00111
else
    if C_DCACHE_USE_WRITEBACK = 1 then
        address ← (Ra) + (Rb)
    else
        address ← (Ra)
    if C_DCACHE_LINE_LEN = 4 then
        cacheline_mask ← (1 << log2(C_DCACHE_BYTE_SIZE) - 4) - 1
        cacheline ← (DCache Line)[(address >> 4) ∧ cacheline_mask]
        cacheline_addr ← address & 0xffffffff
    if C_DCACHE_LINE_LEN = 8 then
        cacheline_mask ← (1 << log2(C_DCACHE_BYTE_SIZE) - 5) - 1
        cacheline ← (DCache Line)[(address >> 5) ∧ cacheline_mask]
        cacheline_addr ← address & 0xffffffff
    if F = 1 and cacheline.Dirty then
        for i = 0 .. C_DCACHE_LINE_LEN - 1 loop
            if cacheline.Valid[i] then
                Mem(cacheline_addr + i * 4) ← cacheline.Data[i]
            if T = 0 then
                cacheline.Tag ← 0
            else if cacheline.Address = cacheline_addr then
                cacheline.Tag ← 0
```
Chapter 5: MicroBlaze Instruction Set Architecture

Registers Altered

- ESR[EC], in case a privileged instruction exception is generated

Latency

- 2 cycles for wdc.clear
- 2 cycles for wdc with C_AREA_OPTIMIZED=1
- 3 cycles for wdc with C_AREA_OPTIMIZED=0
- $2 + N$ cycles for wdc.flush, where $N$ is the number of clock cycles required to flush the cache line to memory when necessary

Note

The wdc, wdc.flush and wdc.clear instructions are independent of data cache enable (MSR[DCE]), and can be used either with the data cache enabled or disabled.

The wdc.clear instruction is intended to invalidate a specific area in memory, for example a buffer to be written by a Direct Memory Access device. Using this instruction ensures that other cache lines are not inadvertently invalidated, erroneously discarding data that has not yet been written to memory.

The address of the affected cache line is always the physical address, independent of the parameter C_USE_MMU and whether the MMU is in virtual mode or real mode.

When using wdc.flush in a loop to flush the entire cache, the loop can be optimized by using Ra as the cache base address and Rb as the loop counter:

```
addik r5,r0,C_DCACHE_BASEADDR
addik r6,r0,C_DCACHE_BYTE_SIZE-C_DCACHE_LINE_LEN*4
loop: wdc.flush r5,r6
bgtid r6,loop
addik r6,r6,-C_DCACHE_LINE_LEN*4
```

When using wdc.clear in a loop to invalidate a memory area in the cache, the loop can be optimized by using Ra as the memory area base address and Rb as the loop counter:

```
addik r5,r0,memory_area_base_address
addik r6,r0,memory_area_byte_size-C_DCACHE_LINE_LEN*4
loop: wdc.clear r5,r6
bgtid r6,loop
addik r6,r6,-C_DCACHE_LINE_LEN*4
```
Instructions

**wic**  
*Write to Instruction Cache*

**wic**

<table>
<thead>
<tr>
<th></th>
<th></th>
<th>rA</th>
<th></th>
<th></th>
<th>rB</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>3</td>
<td>1</td>
</tr>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>3</td>
<td>1</td>
</tr>
</tbody>
</table>

**Description**

Write into the instruction cache tag to invalidate a cache line. The register rB value is not used. Register rA contains the address of the affected cache line.

When MicroBlaze is configured to use an MMU (C_USE_MMU \( \geq 1 \)) this instruction is privileged. This means that if the instruction is attempted in User Mode (MSR[UM] = 1) a Privileged Instruction exception occurs.

**Pseudocode**

```plaintext
if MSR[UM] = 1 then
  ESR[EC] \leftarrow 00111
else
  if C_ICACHE_LINE_LEN = 4 then
    cacheline_mask \leftarrow (1 << \log_2(CACHE_BYTE_SIZE) - 4) - 1
    (ICache Line)[((Ra) >> 4) \& cacheline_mask].Tag \leftarrow 0
  else
    if C_ICACHE_LINE_LEN = 8 then
      cacheline_mask \leftarrow (1 << \log_2(CACHE_BYTE_SIZE) - 5) - 1
      (ICache Line)[((Ra) >> 5) \& cacheline_mask].Tag \leftarrow 0
```

**Registers Altered**

- ESR[EC], in case a privileged instruction exception is generated

**Latency**

- 2 cycles

**Note**

The WIC instruction is independent of instruction cache enable (MSR[ICE]), and can be used either with the instruction cache enabled or disabled.

The address of the affected cache line is the virtual address when the parameter C_USE_MMU = 3 (VIRTUAL) and the MMU is in virtual mode, otherwise it is the physical address.
**xor**

**Logical Exclusive OR**

```
xor rD, rA, rB
```

<table>
<thead>
<tr>
<th></th>
<th>rD</th>
<th>rA</th>
<th>rB</th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>2</td>
</tr>
<tr>
<td>1</td>
<td>6</td>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
</tbody>
</table>

**Description**

The contents of register rA are XORed with the contents of register rB; the result is placed into register rD.

**Pseudocode**

```
(rD) ← (rA) ⊕ (rB)
```

**Registers Altered**

- rD

**Latency**

- 1 cycle
The IMM field is extended to 32 bits by concatenating 16 0-bits on the left. The contents of register rA are XOR’ed with the extended IMM field; the result is placed into register rD.

Pseudocode

\[(rD) \leftarrow (rA) \oplus \text{sext}(\text{IMM})\]

Registers Altered

- rD

Latency

- 1 cycle

Note

By default, Type B Instructions will take the 16-bit IMM field value and sign extend it to 32 bits to use as the immediate operand. This behavior can be overridden by preceding the Type B instruction with an imm instruction. See the instruction “imm,” page 198 for details on using 32-bit immediate values.
Appendix A

Additional Resources

EDK Documentation

The following documents are available in your EDK installation. You can also access the entire documentation set online at http://www.xilinx.com/ise/embedded/edk_docs.htm.

Relevant individual documents are listed below.

- EDK Concepts, Tools, and Techniques (UG683)
  Note: The accompanying design files are in edk_ctt.zip.
- Embedded System Tools Reference Manual (UG111)
- Platform Specification Format Reference Manual (UG642)
- XPS Help
- SDK Help
- PowerPC 405 Processor Reference Guide (UG011)

Additional Resources

The following lists some of the resources you can access directly using the provided URLs.

- The entire set of GNU manuals: http://www.gnu.org/manual
- Xilinx Data Sheets: http://www.xilinx.com/support/documentation/data_sheets.htm
- Xilinx Documentation: http://www.xilinx.com/support/documentation
- Xilinx Support: http://www.xilinx.com/support