doc/contributing/fpga/ref_manual_fpga.md - 3p/lowrisc/opentitan - Git at Google

 # FPGA Reference Manual

 This manual provides additional usage details about the FPGA.
 Specifically, it provides instructions on SW development flows and testing procedures.
 Refer to the [FPGA Setup](../../guides/getting_started/src/setup_fpga.md) guide for more information on initial setup.

 ## FPGA SW Development Flow

 The FPGA is meant for both boot ROM and general software development.
 The flow for each is different, as the boot ROM is meant to be fairly static while general software can change very frequently.
 However, for both flows, Vivado must be installed, with instructions described [here](../../guides/getting_started/src/install_vivado/README.md).

 ### Boot ROM development

 The FPGA bitstream is built after compiling whatever code is sitting in `sw/device/lib/testing/test_rom`.
 This binary is used to initialize internal FPGA memory and is part of the bitstream directly.

 To update this content without rebuilding the FPGA, a flow is required to splice a new boot ROM binary into the bitstream.
 First, you must splice the new content into an existing bitstream.
 Then, you can flash the new bitstream onto the FPGA with `opentitantool`.

 There are two ways to splice content into a bitstream.

 1. Define a Bazel target (or use an existing one).
 For example, see the `//hw/bitstream:rom` target defined in [hw/bitstream/BUILD](https://github.com/lowRISC/opentitan/blob/master/hw/bitstream/BUILD).

 2. Use the [`splice_rom.sh`](https://github.com/lowRISC/opentitan/blob/master/util/fpga/splice_rom.sh) script.

     There are two prerequisites in order for this flow to work:

     * The boot ROM during the build process must be correctly inferred by the tool.
       * See [prim_rom](https://github.com/lowRISC/opentitan/blob/master/hw/ip/prim_generic/rtl/prim_generic_rom.sv) and [vivado_hook_opt_design_post.tcl](https://github.com/lowRISC/opentitan/blob/master/hw/top_earlgrey/util/vivado_hook_opt_design_post.tcl).
     * The MMI file outlining the physical boot ROM placement and mapping to FPGA block RAM primitives needs to be generated by the tool.
       * See [vivado_hook_write_bitstream_pre.tcl](https://github.com/lowRISC/opentitan/blob/master/hw/top_earlgrey/util/vivado_hook_write_bitstream_pre.tcl).

     With these steps in place, a script can be invoked to take a new binary and push its contents into an existing bitfile.
     For details, please see the [`splice_rom.sh`](https://github.com/lowRISC/opentitan/blob/master/util/fpga/splice_rom.sh) script.

     See example below:

     ```console
     $ cd $REPO_TOP
     $ ./util/fpga/splice_rom.sh
     $ bazel run //sw/host/opentitantool fpga load-bitstream build/lowrisc_systems_chip_earlgrey_cw310_0.1/synth-vivado/lowrisc_systems_chip_earlgrey_cw310_0.1.bit
     ```

     The script assumes that there is an existing bitfile `build/lowrisc_systems_chip_earlgrey_cw310_0.1/synth-vivado/lowrisc_systems_chip_earlgrey_cw310_0.1.bit` (this is created after following the steps in [FPGA Setup](../../guides/getting_started/src/setup_fpga.md)).

     The script assumes that there is an existing boot ROM image under `build-bin/sw/device/lib/testing/test_rom` and then creates a new bitfile of the same name at the same location.
     The original input bitfile is moved to `build/lowrisc_systems_chip_earlgrey_cw310_0.1/synth-vivado/lowrisc_systems_chip_earlgrey_cw310_0.1.bit.orig`.

     `opentitantool` can then be used to directly flash the updated bitstream to the FPGA.

 ### General Software Development

 After building, the FPGA bitstream contains only the boot ROM.
 Using this boot ROM, the FPGA is able to load additional software to the emulated flash, such as software in the `sw/device/benchmark`, `sw/device/examples` and `sw/device/tests` directories.
 To load additional software, `opentitantool` is required.

 Also the binary you wish to load needs to be built first.
 For the purpose of this demonstration, we will use `sw/device/examples/hello_world`, but it applies to any software image that is able to fit in the emulated flash space.
 The example below builds the `hello_world` image and loads it onto the FPGA.

 ```console
 $ cd ${REPO_TOP}
 $ bazel run //sw/host/opentitantool fpga set-pll # This needs to be done only once.
 $ bazel build //sw/device/examples/hello_world:hello_world_fpga_cw310_bin
 $ bazel run //sw/host/opentitantool bootstrap $(ci/scripts/target-location.sh //sw/device/examples/hello_world:hello_world_fpga_cw310_bin)
 ```

 Uart output:
 ```
 I00000 test_rom.c:81] Version: earlgrey_silver_release_v5-5886-gde4cb1bb9, Build Date: 2022-06-13 09:17:56
 I00001 test_rom.c:87] TestROM:6b2ca9a1
 I00000 test_rom.c:81] Version: earlgrey_silver_release_v5-5886-gde4cb1bb9, Build Date: 2022-06-13 09:17:56
 I00001 test_rom.c:87] TestROM:6b2ca9a1
 I00002 test_rom.c:118] Test ROM complete, jumping to flash!
 I00000 hello_world.c:66] Hello World!
 I00001 hello_world.c:67] Built at: Jun 13 2022, 14:16:59
 I00002 demos.c:18] Watch the LEDs!
 I00003 hello_world.c:74] Try out the switches on the board
 I00004 hello_world.c:75] or type anything into the console window.
 I00005 hello_world.c:76] The LEDs show the ASCII code of the last character.
 ```

 For more details on the exact operation of the loading flow and how the boot ROM processes incoming data, please refer to the [boot ROM readme](https://github.com/lowRISC/opentitan/tree/master/sw/device/lib/testing/test_rom).

 ### Accelerating `git bisect` with the bitstream cache

 To set the stage, let's say you've discovered a test regression.
 The test used to pass on `GOOD_COMMIT`, but now it fails at `BAD_COMMIT`.
 Your goal is to find the *first bad commit*.

 In general, a linear search from `GOOD_COMMIT` to `BAD_COMMIT` is one of the slowest ways to find the first bad commit.
 We can save time by testing fewer commits with `git bisect`, which effectively applies binary search to the range of commits.
 We can save even more time by leveraging the bitstream cache with **[`//util/fpga:bitstream_bisect`](https://github.com/lowRISC/opentitan/tree/master/util/fpga/bitstream_bisect.py)**.

 The `:bitstream_bisect` tool is faster than regular `git bisect` because it restricts itself to cached bitstreams until it can make no more progress.
 Building a bitstream is many times slower than running a test (hours compared to minutes), and `git bisect` has no idea that some commits will be faster to classify than others due to the bitstream cache.

 For example, suppose that `//sw/device/tests:uart_smoketest` has regressed sometime in the last 30 commits.
 The following command could easily save hours compared to a naive `git bisect`:

 ```sh
 # This will use the fast command to classify commits with cached bitstreams. If
 # the results are ambiguous, it will narrow them down with the slow command.
 ./bazelisk.sh run //util/fpga:bitstream_bisect -- \
     --good HEAD~30 \
     --bad HEAD \
     --fast-command "./bazelisk.sh test //sw/device/tests:uart_smoketest_fpga_cw310_rom" \
     --slow-command "./bazelisk.sh test --define bitstream=vivado //sw/device/tests:uart_smoketest_fpga_cw310_rom"
 ```

 One caveat is that neither `git bisect` nor `:bitstream_bisect` will help if the FPGA somehow retains state between tests.
 That is, if the test command bricks the FPGA causing future tests to fail, bisection will return entirely bogus results.
 We plan to add a "canary" feature to `:bitstream_bisect` that will abort the bisect when FPGA flakiness is detected (issue [#16788](https://github.com/lowRISC/opentitan/issues/16788)).
 For now, if you suspect this kind of FPGA flakiness, the best strategy may be a linear progression from `GOOD_COMMIT` to `BAD_COMMIT`.

 Note that the slow command doesn't necessarily have to build a bitstream.
 If you don't have a Vivado license and the test regression is reproducible in Verilator, it could make sense to fall back to the Verilated test.
 Building on the example above, you could replace the slow command with `"./bazelisk.sh test //sw/device/tests:uart_smoketest_sim_verilator"` and the `:bitstream_bisect` tool would never build any bitstreams.

 For more information, consult the `:bitstream_bisect` tool directly!

 ```sh
 ./bazelisk.sh run //util/fpga:bitstream_bisect -- --help
 ```

 ## Implementation of Bitstream Caching and Splicing

 This section gives an overview of where bitstreams are generated, how they are uploaded to the GCP cache, and how Bazel reaches into the cache.

 OpenTitan runs CI tasks on Azure Pipelines that build FPGA bitstreams.
 A full bitstream build can take hours, so we cache the output artifacts in a GCS bucket.
 These cached bitstreams can be downloaded and used as-is, or we can splice in freshly-compiled components, including the ROM and the OTP image.

 ### Building bitstreams on CI and uploading artifacts to GCS

 The `chip_earlgrey_cw310` CI job builds the `//hw/bitstream/vivado:standard` target which will build a bitstream with the test ROM and RMA OTP image.
 This target will also produce bitstreams with the ROM spliced in and the DEV OTP image spliced in.
 The following files are produced as a result:

 * `fpga_cw310_rom.bit` (ROM, RMA OTP image)
 * `fpga_cw310_rom_otp_dev.bit` (ROM, DEV OTP image)
 * `lowrisc_systems_chip_earlgrey_cw310_0.1.bit` (test ROM, RMA OTP image)
 * `otp.mmi`
 * `rom.mmi`

 If CI is working on the `master` branch, it puts selected build artifacts into a tarball, which it then uploads to the GCS bucket. The latest tarball is available here: https://storage.googleapis.com/opentitan-bitstreams/master/bitstream-latest.tar.gz

 ### Exposing GCS-cached artifacts to Bazel

 The `@bitstreams//` workspace contains autogenerated Bazel targets for the GCS-cached artifacts.
 This magic happens in `rules/scripts/bitstreams_workspace.py`.
 Under the hood, it fetches the latest tarball from the GCS bucket and constructs a BUILD file that defines one target per artifact.

 One meta-level up, we have targets in `//hw/bitstream` that decide whether to use cached artifacts or to build them from scratch.
 By default, these targets use cached artifacts by pulling in their corresponding `@bitstreams//` targets.

 ### Related work

 * TODO Define the new naming scheme.
   https://github.com/lowRISC/opentitan/issues/13807
	# FPGA Reference Manual

	This manual provides additional usage details about the FPGA.
	Specifically, it provides instructions on SW development flows and testing procedures.
	Refer to the [FPGA Setup](../../guides/getting_started/src/setup_fpga.md) guide for more information on initial setup.

	## FPGA SW Development Flow

	The FPGA is meant for both boot ROM and general software development.
	The flow for each is different, as the boot ROM is meant to be fairly static while general software can change very frequently.
	However, for both flows, Vivado must be installed, with instructions described [here](../../guides/getting_started/src/install_vivado/README.md).

	### Boot ROM development

	The FPGA bitstream is built after compiling whatever code is sitting in `sw/device/lib/testing/test_rom`.
	This binary is used to initialize internal FPGA memory and is part of the bitstream directly.

	To update this content without rebuilding the FPGA, a flow is required to splice a new boot ROM binary into the bitstream.
	First, you must splice the new content into an existing bitstream.
	Then, you can flash the new bitstream onto the FPGA with `opentitantool`.

	There are two ways to splice content into a bitstream.

	1. Define a Bazel target (or use an existing one).
	For example, see the `//hw/bitstream:rom` target defined in [hw/bitstream/BUILD](https://github.com/lowRISC/opentitan/blob/master/hw/bitstream/BUILD).

	2. Use the [`splice_rom.sh`](https://github.com/lowRISC/opentitan/blob/master/util/fpga/splice_rom.sh) script.

	There are two prerequisites in order for this flow to work:

	* The boot ROM during the build process must be correctly inferred by the tool.
	* See [prim_rom](https://github.com/lowRISC/opentitan/blob/master/hw/ip/prim_generic/rtl/prim_generic_rom.sv) and [vivado_hook_opt_design_post.tcl](https://github.com/lowRISC/opentitan/blob/master/hw/top_earlgrey/util/vivado_hook_opt_design_post.tcl).
	* The MMI file outlining the physical boot ROM placement and mapping to FPGA block RAM primitives needs to be generated by the tool.
	* See [vivado_hook_write_bitstream_pre.tcl](https://github.com/lowRISC/opentitan/blob/master/hw/top_earlgrey/util/vivado_hook_write_bitstream_pre.tcl).

	With these steps in place, a script can be invoked to take a new binary and push its contents into an existing bitfile.
	For details, please see the [`splice_rom.sh`](https://github.com/lowRISC/opentitan/blob/master/util/fpga/splice_rom.sh) script.

	See example below:

	```console
	$ cd $REPO_TOP
	$ ./util/fpga/splice_rom.sh
	$ bazel run //sw/host/opentitantool fpga load-bitstream build/lowrisc_systems_chip_earlgrey_cw310_0.1/synth-vivado/lowrisc_systems_chip_earlgrey_cw310_0.1.bit
	```

	The script assumes that there is an existing bitfile `build/lowrisc_systems_chip_earlgrey_cw310_0.1/synth-vivado/lowrisc_systems_chip_earlgrey_cw310_0.1.bit` (this is created after following the steps in [FPGA Setup](../../guides/getting_started/src/setup_fpga.md)).

	The script assumes that there is an existing boot ROM image under `build-bin/sw/device/lib/testing/test_rom` and then creates a new bitfile of the same name at the same location.
	The original input bitfile is moved to `build/lowrisc_systems_chip_earlgrey_cw310_0.1/synth-vivado/lowrisc_systems_chip_earlgrey_cw310_0.1.bit.orig`.

	`opentitantool` can then be used to directly flash the updated bitstream to the FPGA.

	### General Software Development

	After building, the FPGA bitstream contains only the boot ROM.
	Using this boot ROM, the FPGA is able to load additional software to the emulated flash, such as software in the `sw/device/benchmark`, `sw/device/examples` and `sw/device/tests` directories.
	To load additional software, `opentitantool` is required.

	Also the binary you wish to load needs to be built first.
	For the purpose of this demonstration, we will use `sw/device/examples/hello_world`, but it applies to any software image that is able to fit in the emulated flash space.
	The example below builds the `hello_world` image and loads it onto the FPGA.

	```console
	$ cd ${REPO_TOP}
	$ bazel run //sw/host/opentitantool fpga set-pll # This needs to be done only once.
	$ bazel build //sw/device/examples/hello_world:hello_world_fpga_cw310_bin
	$ bazel run //sw/host/opentitantool bootstrap $(ci/scripts/target-location.sh //sw/device/examples/hello_world:hello_world_fpga_cw310_bin)
	```

	Uart output:
	```
	I00000 test_rom.c:81] Version: earlgrey_silver_release_v5-5886-gde4cb1bb9, Build Date: 2022-06-13 09:17:56
	I00001 test_rom.c:87] TestROM:6b2ca9a1
	I00000 test_rom.c:81] Version: earlgrey_silver_release_v5-5886-gde4cb1bb9, Build Date: 2022-06-13 09:17:56
	I00001 test_rom.c:87] TestROM:6b2ca9a1
	I00002 test_rom.c:118] Test ROM complete, jumping to flash!
	I00000 hello_world.c:66] Hello World!
	I00001 hello_world.c:67] Built at: Jun 13 2022, 14:16:59
	I00002 demos.c:18] Watch the LEDs!
	I00003 hello_world.c:74] Try out the switches on the board
	I00004 hello_world.c:75] or type anything into the console window.
	I00005 hello_world.c:76] The LEDs show the ASCII code of the last character.
	```

	For more details on the exact operation of the loading flow and how the boot ROM processes incoming data, please refer to the [boot ROM readme](https://github.com/lowRISC/opentitan/tree/master/sw/device/lib/testing/test_rom).

	### Accelerating `git bisect` with the bitstream cache

	To set the stage, let's say you've discovered a test regression.
	The test used to pass on `GOOD_COMMIT`, but now it fails at `BAD_COMMIT`.
	Your goal is to find the first bad commit.

	In general, a linear search from `GOOD_COMMIT` to `BAD_COMMIT` is one of the slowest ways to find the first bad commit.
	We can save time by testing fewer commits with `git bisect`, which effectively applies binary search to the range of commits.
	We can save even more time by leveraging the bitstream cache with [`//util/fpga:bitstream_bisect`](https://github.com/lowRISC/opentitan/tree/master/util/fpga/bitstream_bisect.py).

	The `:bitstream_bisect` tool is faster than regular `git bisect` because it restricts itself to cached bitstreams until it can make no more progress.
	Building a bitstream is many times slower than running a test (hours compared to minutes), and `git bisect` has no idea that some commits will be faster to classify than others due to the bitstream cache.

	For example, suppose that `//sw/device/tests:uart_smoketest` has regressed sometime in the last 30 commits.
	The following command could easily save hours compared to a naive `git bisect`:

	```sh
	# This will use the fast command to classify commits with cached bitstreams. If
	# the results are ambiguous, it will narrow them down with the slow command.
	./bazelisk.sh run //util/fpga:bitstream_bisect -- \
	--good HEAD~30 \
	--bad HEAD \
	--fast-command "./bazelisk.sh test //sw/device/tests:uart_smoketest_fpga_cw310_rom" \
	--slow-command "./bazelisk.sh test --define bitstream=vivado //sw/device/tests:uart_smoketest_fpga_cw310_rom"
	```

	One caveat is that neither `git bisect` nor `:bitstream_bisect` will help if the FPGA somehow retains state between tests.
	That is, if the test command bricks the FPGA causing future tests to fail, bisection will return entirely bogus results.
	We plan to add a "canary" feature to `:bitstream_bisect` that will abort the bisect when FPGA flakiness is detected (issue [#16788](https://github.com/lowRISC/opentitan/issues/16788)).
	For now, if you suspect this kind of FPGA flakiness, the best strategy may be a linear progression from `GOOD_COMMIT` to `BAD_COMMIT`.

	Note that the slow command doesn't necessarily have to build a bitstream.
	If you don't have a Vivado license and the test regression is reproducible in Verilator, it could make sense to fall back to the Verilated test.
	Building on the example above, you could replace the slow command with `"./bazelisk.sh test //sw/device/tests:uart_smoketest_sim_verilator"` and the `:bitstream_bisect` tool would never build any bitstreams.

	For more information, consult the `:bitstream_bisect` tool directly!

	```sh
	./bazelisk.sh run //util/fpga:bitstream_bisect -- --help
	```

	## Implementation of Bitstream Caching and Splicing

	This section gives an overview of where bitstreams are generated, how they are uploaded to the GCP cache, and how Bazel reaches into the cache.

	OpenTitan runs CI tasks on Azure Pipelines that build FPGA bitstreams.
	A full bitstream build can take hours, so we cache the output artifacts in a GCS bucket.
	These cached bitstreams can be downloaded and used as-is, or we can splice in freshly-compiled components, including the ROM and the OTP image.

	### Building bitstreams on CI and uploading artifacts to GCS

	The `chip_earlgrey_cw310` CI job builds the `//hw/bitstream/vivado:standard` target which will build a bitstream with the test ROM and RMA OTP image.
	This target will also produce bitstreams with the ROM spliced in and the DEV OTP image spliced in.
	The following files are produced as a result:

	* `fpga_cw310_rom.bit` (ROM, RMA OTP image)
	* `fpga_cw310_rom_otp_dev.bit` (ROM, DEV OTP image)
	* `lowrisc_systems_chip_earlgrey_cw310_0.1.bit` (test ROM, RMA OTP image)
	* `otp.mmi`
	* `rom.mmi`

	If CI is working on the `master` branch, it puts selected build artifacts into a tarball, which it then uploads to the GCS bucket. The latest tarball is available here: https://storage.googleapis.com/opentitan-bitstreams/master/bitstream-latest.tar.gz

	### Exposing GCS-cached artifacts to Bazel

	The `@bitstreams//` workspace contains autogenerated Bazel targets for the GCS-cached artifacts.
	This magic happens in `rules/scripts/bitstreams_workspace.py`.
	Under the hood, it fetches the latest tarball from the GCS bucket and constructs a BUILD file that defines one target per artifact.

	One meta-level up, we have targets in `//hw/bitstream` that decide whether to use cached artifacts or to build them from scratch.
	By default, these targets use cached artifacts by pulling in their corresponding `@bitstreams//` targets.

	### Related work

	* TODO Define the new naming scheme.
	https://github.com/lowRISC/opentitan/issues/13807