I'm still not going to do anything further with the Lorenz tests - but I wasn't happy about not having fulfilled my goal of having some kind of illegal opcode test suite that actually runs on the Beeb. So I thought I'd start on writing one, and it ended up actually taking a lot less time than I thought:
https://github.com/tom-seddon/6502-tests
The goal here is to be useful for testing BBC emulators, so it tries not to duplicate what's already in the Dormann test suite, which is something you can run on a Beeb already and covers the documented aspects. My tests comprehensively test the BCD behaviour, and the illegal opcodes that are known to behave consistently on the C64, and it runs to completion on my BBC B and on b2. I'm now officially a bit more confident than I was before that b2's 6502 emulator reflects the BBC B's behaviour, when all I had was a (admittedly suggestive) successful Lorenz test suite run and no obvious issues.
(Also includes some tests for the BCD ADC/SBC behaviour on CMOS chips, but while putting the readme together I discovered that the Dormann suite can actually apparently test that too! - something I'd somehow previously missed. That part of these tests is actually redundant

)
Plans for this in the longer term:
* more automation-friendly behaviour, so I can run it as part of the CI process for b2. At the moment, the only real way to tell if things succeeded or failed is to have an actual person look at it running using their eyes
* add test for SBX, which is reportedly consistent on C64, but has a time-consuming 24 bits of input
* add tests for the inconsistent C64 opcodes. Some of the inconsistent C64 opcodes are inconsistent on my Beeb too - but is that true for all of them?
* maybe add some portability stuff. The code is Beeb-specific at the moment, but in principle all it needs is 4 KB RAM and ideally some way of producing output
--Tom