yggdrasil/ring - ring - alnyan's gitea

Author	SHA1	Message	Date
Tamas Petz	a0b49d63fd	aarch64: support BTI and pointer authentication in assembly This change adds optional support for - Armv8.3-A Pointer Authentication (PAuth) and - Armv8.5-A Branch Target Identification (BTI) features to the perl scripts. Both features can be enabled with additional compiler flags. Unless any of these are enabled explicitly there is no code change at all. The extensions are briefly described below. Please read the appropriate chapters of the Arm Architecture Reference Manual for the complete specification. Scope ----- This change only affects generated assembly code. Armv8.3-A Pointer Authentication -------------------------------- Pointer Authentication extension supports the authentication of the contents of registers before they are used for indirect branching or load. PAuth provides a probabilistic method to detect corruption of register values. PAuth signing instructions generate a Pointer Authentication Code (PAC) based on the value of a register, a seed and a key. The generated PAC is inserted into the original value in the register. A PAuth authentication instruction recomputes the PAC, and if it matches the PAC in the register, restores its original value. In case of a mismatch, an architecturally unmapped address is generated instead. With PAuth, mitigation against ROP (Return-oriented Programming) attacks can be implemented. This is achieved by signing the contents of the link-register (LR) before it is pushed to stack. Once LR is popped, it is authenticated. This way a stack corruption which overwrites the LR on the stack is detectable. The PAuth extension adds several new instructions, some of which are not recognized by older hardware. To support a single codebase for both pre Armv8.3-A targets and newer ones, only NOP-space instructions are added by this patch. These instructions are treated as NOPs on hardware which does not support Armv8.3-A. Furthermore, this patch only considers cases where LR is saved to the stack and then restored before branching to its content. There are cases in the code where LR is pushed to stack but it is not used later. We do not address these cases as they are not affected by PAuth. There are two keys available to sign an instruction address: A and B. PACIASP and PACIBSP only differ in the used keys: A and B, respectively. The keys are typically managed by the operating system. To enable generating code for PAuth compile with -mbranch-protection=<mode>: - standard or pac-ret: add PACIASP and AUTIASP, also enables BTI (read below) - pac-ret+b-key: add PACIBSP and AUTIBSP Armv8.5-A Branch Target Identification -------------------------------------- Branch Target Identification features some new instructions which protect the execution of instructions on guarded pages which are not intended branch targets. If Armv8.5-A is supported by the hardware, execution of an instruction changes the value of PSTATE.BTYPE field. If an indirect branch lands on a guarded page the target instruction must be one of the BTI <jc> flavors, or in case of a direct call or jump it can be any other instruction. If the target instruction is not compatible with the value of PSTATE.BTYPE a Branch Target Exception is generated. In short, indirect jumps are compatible with BTI <j> and <jc> while indirect calls are compatible with BTI <c> and <jc>. Please refer to the specification for the details. Armv8.3-A PACIASP and PACIBSP are implicit branch target identification instructions which are equivalent with BTI c or BTI jc depending on system register configuration. BTI is used to mitigate JOP (Jump-oriented Programming) attacks by limiting the set of instructions which can be jumped to. BTI requires active linker support to mark the pages with BTI-enabled code as guarded. For ELF64 files BTI compatibility is recorded in the .note.gnu.property section. For a shared object or static binary it is required that all linked units support BTI. This means that even a single assembly file without the required note section turns-off BTI for the whole binary or shared object. The new BTI instructions are treated as NOPs on hardware which does not support Armv8.5-A or on pages which are not guarded. To insert this new and optional instruction compile with -mbranch-protection=standard (also enables PAuth) or +bti. When targeting a guarded page from a non-guarded page, weaker compatibility restrictions apply to maintain compatibility between legacy and new code. For detailed rules please refer to the Arm ARM. Compiler support ---------------- Compiler support requires understanding '-mbranch-protection=<mode>' and emitting the appropriate feature macros (__ARM_FEATURE_BTI_DEFAULT and __ARM_FEATURE_PAC_DEFAULT). The current state is the following: ------------------------------------------------------- \| Compiler \| -mbranch-protection \| Feature macros \| +----------+---------------------+--------------------+ \| clang \| 9.0.0 \| 11.0.0 \| +----------+---------------------+--------------------+ \| gcc \| 9 \| expected in 10.1+ \| ------------------------------------------------------- Available Platforms ------------------ Arm Fast Model and QEMU support both extensions. https://developer.arm.com/tools-and-software/simulation-models/fast-models https://www.qemu.org/ Implementation Notes -------------------- This change adds BTI landing pads even to assembly functions which are likely to be directly called only. In these cases, landing pads might be superfluous depending on what code the linker generates. Code size and performance impact for these cases would be negligble. Interaction with C code ----------------------- Pointer Authentication is a per-frame protection while Branch Target Identification can be turned on and off only for all code pages of a whole shared object or static binary. Because of these properties if C/C++ code is compiled without any of the above features but assembly files support any of them unconditionally there is no incompatibility between the two. Useful Links ------------ To fully understand the details of both PAuth and BTI it is advised to read the related chapters of the Arm Architecture Reference Manual (Arm ARM): https://developer.arm.com/documentation/ddi0487/latest/ Additional materials: "Providing protection for complex software" https://developer.arm.com/architectures/learn-the-architecture/providing-protection-for-complex-software Arm Compiler Reference Guide Version 6.14: -mbranch-protection https://developer.arm.com/documentation/101754/0614/armclang-Reference/armclang-Command-line-Options/-mbranch-protection?lang=en Arm C Language Extensions (ACLE) https://developer.arm.com/docs/101028/latest Change-Id: I4335f92e2ccc8e209c7d68a0a79f1acdf3aeb791 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/42084 Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: Adam Langley <agl@google.com>	2020-08-11 23:45:04 +00:00
Adam Langley	83b74c6a7a	Add details of 20190808 FIPS certification. Change-Id: I4d17e1e6f24b623ee39a844def8f265eb5e6c6cc Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/42144 Commit-Queue: Adam Langley <alangley@gmail.com> Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: David Benjamin <davidben@google.com>	2020-07-13 16:59:37 +00:00
Steven Valdez	d0637e901d	Remove TRUST_TOKEN_experiment_v0. Update-Note: This gets rid of TRUST_TOKEN_experiment_v0. Existing callers should be updated to call TRUST_TOKEN_experiment_v1. Change-Id: I8ec9b808cbd35546425690d1548db671ff033e14 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41524 Reviewed-by: David Benjamin <davidben@google.com> Commit-Queue: Steven Valdez <svaldez@google.com>	2020-07-07 16:26:59 +00:00
Adam Langley	939d426f6b	Maybe build for AArch64 Windows. Microsoft lists[1] this define to indicate AArch64, support for which is requested on https://github.com/grpc/grpc/issues/23310. More might well be needed, especially if the assembly code is to work, but maybe this'll work for gRPC. [1] https://docs.microsoft.com/en-us/cpp/preprocessor/predefined-macros?view=vs-2019 Change-Id: Id66d1c8ab7ab161f73c993dd4901e2252198bda8 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41945 Reviewed-by: Adam Langley <alangley@gmail.com> Reviewed-by: David Benjamin <davidben@google.com> Commit-Queue: Adam Langley <agl@google.com>	2020-07-02 20:06:21 +00:00
Adam Langley	e2abade424	sha1-x86_64: fix CFI. This issue only arises when SHAEXT is enabled, which it isn't (yet). Will upstream too. Change-Id: I92de51789d58ba1784b88eb872b1f9eca8eb78d8 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41944 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: David Benjamin <davidben@google.com>	2020-06-30 16:48:54 +00:00
Brian Smith	5d74463301	Use \|crypto_word_t\| and \|size_t\| more consistently in ECC scalar recoding. Use \|crypto_word_t\| as the type for secret values in scalar recoding. Use \|size_t\| as the type of array indexes in scalar recoding. Use explicit casts where a larger type is (losslessly) truncated to a smaller type. With this change, \|uint64_t\| is no longer used in the p256.c when building in 32-bit mode, \|unsigned\| is not used in any of the affected modules, and \|uint8_t\| and \|char\| are no longer used for secret values in the ECC recoding. When given the choice of doing non-array-indexing arithmetic (e.g. shifts) on \|size_t\| values or \|crypto_word_t\| values, prefer doing it on \|crypto_word_t\| values. More generally, try to use \|size_t\| only for sizes and array indexes. This is part of a bigger project to minimize the use of types other than \|crypto_word_t\| for secret values. This is also part of a larger project make the ECC code more consistent. Avoid changing the loop indexing in the P-256 scalar multiplication from \|int\| to \|size_t\|. The P-224 code does use \|size_t\| but it is less clear than the P-256 code where \|i - 1\| results in a negative/underflowed value when \|i\| is zero. Change-Id: I78cb404455c2340a4f8c9688d36c0d425bfcc50b Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41685 Reviewed-by: David Benjamin <davidben@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-06-30 15:04:06 +00:00
Ilya Tokar	7361ee42cf	Enable shaext path for sha1. This makes sha1 ~2x faster on Amd Rome: BM_SHA1Hash/2 10.9MB/s ± 1% 14.0MB/s ± 1% +28.77% (p=0.000 n=10+10) BM_SHA1Hash/4 21.9MB/s ± 1% 28.2MB/s ± 1% +28.56% (p=0.000 n=10+10) BM_SHA1Hash/8 43.9MB/s ± 1% 56.3MB/s ± 2% +28.36% (p=0.000 n=10+10) BM_SHA1Hash/16 88.1MB/s ± 1% 114.8MB/s ± 2% +30.40% (p=0.000 n=9+10) BM_SHA1Hash/32 178MB/s ± 1% 229MB/s ± 2% +28.64% (p=0.000 n=10+10) BM_SHA1Hash/64 240MB/s ± 1% 363MB/s ± 2% +51.57% (p=0.000 n=10+10) BM_SHA1Hash/512 629MB/s ± 1% 1129MB/s ± 2% +79.54% (p=0.000 n=9+10) BM_SHA1Hash/4k 794MB/s ± 0% 1538MB/s ± 1% +93.76% (p=0.000 n=8+10) BM_SHA1Hash/32k 820MB/s ± 1% 1610MB/s ± 2% +96.44% (p=0.000 n=10+10) BM_SHA1Hash/256k 822MB/s ± 1% 1624MB/s ± 1% +97.48% (p=0.000 n=10+10) BM_SHA1Hash/1M 822MB/s ± 1% 1625MB/s ± 1% +97.63% (p=0.000 n=10+10) BM_SHA1Hash/2M 824MB/s ± 1% 1626MB/s ± 1% +97.32% (p=0.000 n=10+10) BM_SHA1Hash/4M 826MB/s ± 1% 1631MB/s ± 0% +97.56% (p=0.000 n=10+8) BM_SHA1Hash/8M 824MB/s ± 1% 1625MB/s ± 1% +97.14% (p=0.000 n=10+10) BM_SHA1Hash/16M 823MB/s ± 1% 1625MB/s ± 1% +97.40% (p=0.000 n=10+10) Change-Id: Ic75eb717a71b35d0ca775c309e08396b2ab77641 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41884 Reviewed-by: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-06-30 14:58:34 +00:00
Adam Langley	7f90eda55e	Add “Z Computation” KAT. FIPS updates will make this useful / mandatory in the future. Change-Id: I9921e4f3fc8a8315dc85dc366f331b456572d49e Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41644 Commit-Queue: Adam Langley <agl@google.com> Reviewed-by: David Benjamin <davidben@google.com>	2020-06-12 19:01:37 +00:00
David Benjamin	2309f645e5	Use ctr32 optimizations for AES_ctr128_encrypt. There are a decent number of uses of this function directly. I've attached this to bug 338. Arguably it makes it worse, though it does help with aligning on ctr32, if that works out. Bug: 338 Change-Id: I3dfc1305d359ec0c88d4f298fe1928bef7ec9877 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41426 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2020-05-26 19:04:16 +00:00
David Benjamin	9701e84eff	Remove RAND_set_urandom_fd. Also update the documentation for RAND_enable_fork_unsafe_buffering. The fd parameter is no longer used. Update-Note: RAND_set_urandom_fd no longer exists. This was only called by Chromium, which now uses CRYPTO_pre_sandbox_init. Change-Id: I1659c1cc84a6f1edc01f6105fc07e80856e457fc Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41424 Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-05-20 18:55:59 +00:00
David Benjamin	3e4dfbb2f8	Add CRYPTO_pre_sandbox_init. The intent is to replace the logic in [0] and allows Chromium to set up the MADV_WIPEONFORK page without increasing sandbox syscall surface. From there we can remove RAND_set_urandom_fd and trim a bit of complexity from the PRNG logic. [0] https://source.chromium.org/chromium/chromium/src/+/master:content/app/content_main_runner_impl.cc;l=333-341;drc=975850fa57e140ec696114477e9416a19f06d29f Change-Id: I9b679e15da551a10302389556c6c77d192be662a Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41326 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2020-05-14 18:17:12 +00:00
Nick Harper	be28dd623f	Add missing header to ec/wnaf.c a810d82 added calls to OPENSSL_malloc in this file, but openssl/mem.h was missing. Change-Id: I77e19e61e92b1e73702cb3eb93b9c6e22aca9596 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41344 Reviewed-by: David Benjamin <davidben@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-05-14 00:09:11 +00:00
David Benjamin	b7acfff8e7	Fix OPENSSL_TSAN typo. We weren't actually reducing MAX_BLINDINGS_PER_RSA under TSan. Change-Id: Ib33dc1a1c0312bd3309a64f2600ec4d6e2ec9ddb Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41324 Commit-Queue: Steven Valdez <svaldez@google.com> Reviewed-by: Steven Valdez <svaldez@google.com>	2020-05-13 17:00:31 +00:00
David Benjamin	49e95dc0f3	Fix p256-x86_64-table.h indentation. This makes clang-format a no-op. (I would not recommend running clang-format on that file. It takes a while.) Change-Id: Ie4bd93340be8fe586e774ce16b5aed1ab626bb0c Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41165 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Steven Valdez <svaldez@google.com>	2020-05-12 19:40:02 +00:00
Ilya Tokar	1274d1d977	Enable avx2 implementation of sha1. Before: Did 19655000 SHA-1 (16 bytes) operations in 2000053us (157.2 MB/sec) Did 5112000 SHA-1 (256 bytes) operations in 2000126us (654.3 MB/sec) Did 1280000 SHA-1 (1350 bytes) operations in 2000559us (863.8 MB/sec) Did 229000 SHA-1 (8192 bytes) operations in 2004784us (935.7 MB/sec) Did 116000 SHA-1 (16384 bytes) operations in 2008224us (946.4 MB/sec) After: Did 20506000 SHA-1 (16 bytes) operations in 2000062us (164.0 MB/sec) [+4.3%] Did 5659000 SHA-1 (256 bytes) operations in 2000100us (724.3 MB/sec) [+10.7%] Did 1454000 SHA-1 (1350 bytes) operations in 2000739us (981.1 MB/sec) [+13.6%] Did 255000 SHA-1 (8192 bytes) operations in 2000733us (1044.1 MB/sec) [+11.6%] Did 129000 SHA-1 (16384 bytes) operations in 2009483us (1051.8 MB/sec) [+11.1%] Change-Id: Idaf75e92b2da4d83b597771aca1123f2884d7687 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41284 Reviewed-by: David Benjamin <davidben@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-05-12 19:32:02 +00:00
David Benjamin	d4d501c159	Trim Z coordinates from the OPENSSL_SMALL P-256 tables. The small P-256 tables currently store 2 * 16 points in Jacobian coordinates, but all the Z values are one, except for the two entries which store infinity. Instead, store only affine coordinates and omit the infinity entries. (We can recover infinity with constant-time selects.) This trims 1152 bytes from the table. Comparing the sizes of the bssl tool (stripped, otherwise debug symbols undo the size gain), the binary is 4096 bytes smaller, but I suspect this is just an artifact of something rounding to page sizes. Along the way, I've scripted the table generation and merged it into a much generalized version of make_p256-x86_64-table.go. As a bonus, by cutting the size of the table we scan, it's faster! (ECDSA verify was already not scanning a table, so that one staying the same is expected.) Before: Did 22302 ECDH P-256 operations in 4009795us (5561.9 ops/sec) Did 62000 ECDSA P-256 signing operations in 4020075us (15422.6 ops/sec) Did 26544 ECDSA P-256 verify operations in 4056526us (6543.5 ops/sec) After: Did 23001 ECDH P-256 operations in 4016544us (5726.6 ops/sec) [+3.0%] Did 66000 ECDSA P-256 signing operations in 4043083us (16324.2 ops/sec) [+5.8%] Did 26544 ECDSA P-256 verify operations in 4051220us (6552.1 ops/sec) [+0.1%] Change-Id: Ie88898fd56e57505e3a325c50bbaf0dc5d42eeb3 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41164 Reviewed-by: Steven Valdez <svaldez@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-05-12 19:24:22 +00:00
David Benjamin	a810d82575	Use public multi-scalar mults in Trust Tokens where applicable. The input points are all public, so we can use a faster multi-scalar multiplication. This generalizes ec_point_mul_scalar_public to ec_point_mul_scalar_public_batch. To support the batched DLEQ construction, this function takes an arbirarily-length array of points and allocates some temporaries if necessary. First, to confirm that this doesn't affect the basic ECDSA verify case: Before: Did 6324 ECDSA P-384 verify operations in 3069342us (2060.4 ops/sec) After: Did 6324 ECDSA P-384 verify operations in 3063355us (2064.4 ops/sec) [+0.2%] Results for Trust Tokens issue (Exp1) and finish_issuance (both): Before: Did 147 TrustToken-Exp0-Batch1 finish_issuance operations in 2059145us (71.4 ops/sec) Did 14 TrustToken-Exp0-Batch10 finish_issuance operations in 2085888us (6.7 ops/sec) Did 357 TrustToken-Exp1-Batch1 issue operations in 2068238us (172.6 ops/sec) Did 286 TrustToken-Exp1-Batch1 finish_issuance operations in 2090932us (136.8 ops/sec) Did 63 TrustToken-Exp1-Batch10 issue operations in 2068201us (30.5 ops/sec) Did 56 TrustToken-Exp1-Batch10 finish_issuance operations in 2064796us (27.1 ops/sec) After: Did 168 TrustToken-Exp0-Batch1 finish_issuance operations in 2058891us (81.6 ops/sec) [+14.3%] Did 16 TrustToken-Exp0-Batch10 finish_issuance operations in 2075742us (7.7 ops/sec) [+14.8%] Did 378 TrustToken-Exp1-Batch1 issue operations in 2067956us (182.8 ops/sec) [+5.9%] Did 336 TrustToken-Exp1-Batch1 finish_issuance operations in 2097757us (160.2 ops/sec) [+17.1%] Did 105 TrustToken-Exp1-Batch10 issue operations in 2069934us (50.7 ops/sec) [+66.5%] Did 88 TrustToken-Exp1-Batch10 finish_issuance operations in 2014621us (43.7 ops/sec) [+61.1%] (This CL doesn't affect other operations.) Change-Id: Ie643b06f44990ab52bf892a007732fde61cdffe5 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41285 Reviewed-by: Steven Valdez <svaldez@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-05-12 17:38:55 +00:00
David Benjamin	d4a97fa65f	Use a 5-bit comb for some Trust Tokens multiplications. Several of the Trust Tokens multiplications use repeated points (G, H, and the public keys). We can precompute a 5-bit comb for those points and perform only 1/5th as many doubles in the multiplication. Before: Did 483 TrustToken-Exp0-Batch1 generate_key operations in 2017082us (239.5 ops/sec) Did 1449 TrustToken-Exp0-Batch1 begin_issuance operations in 2086097us (694.6 ops/sec) Did 176 TrustToken-Exp0-Batch1 issue operations in 2089640us (84.2 ops/sec) Did 147 TrustToken-Exp0-Batch1 finish_issuance operations in 2027924us (72.5 ops/sec) Did 12284000 TrustToken-Exp0-Batch1 begin_redemption operations in 2000151us (6141536.3 ops/sec) Did 483 TrustToken-Exp0-Batch1 redeem operations in 2063241us (234.1 ops/sec) Did 35000 TrustToken-Exp0-Batch1 finish_redemption operations in 2050694us (17067.4 ops/sec) Did 483 TrustToken-Exp0-Batch10 generate_key operations in 2003222us (241.1 ops/sec) Did 138 TrustToken-Exp0-Batch10 begin_issuance operations in 2000845us (69.0 ops/sec) Did 16 TrustToken-Exp0-Batch10 issue operations in 2010264us (8.0 ops/sec) Did 14 TrustToken-Exp0-Batch10 finish_issuance operations in 2036137us (6.9 ops/sec) Did 12106000 TrustToken-Exp0-Batch10 begin_redemption operations in 2000126us (6052618.7 ops/sec) Did 483 TrustToken-Exp0-Batch10 redeem operations in 2062366us (234.2 ops/sec) Did 35000 TrustToken-Exp0-Batch10 finish_redemption operations in 2023617us (17295.8 ops/sec) Did 1254 TrustToken-Exp1-Batch1 generate_key operations in 2086776us (600.9 ops/sec) Did 3612 TrustToken-Exp1-Batch1 begin_issuance operations in 2052090us (1760.2 ops/sec) Did 420 TrustToken-Exp1-Batch1 issue operations in 2002421us (209.7 ops/sec) Did 378 TrustToken-Exp1-Batch1 finish_issuance operations in 2078074us (181.9 ops/sec) Did 12843000 TrustToken-Exp1-Batch1 begin_redemption operations in 2000068us (6421281.7 ops/sec) Did 1210 TrustToken-Exp1-Batch1 redeem operations in 2083419us (580.8 ops/sec) Did 35000 TrustToken-Exp1-Batch1 finish_redemption operations in 2023704us (17295.0 ops/sec) Did 1239 TrustToken-Exp1-Batch10 generate_key operations in 2060962us (601.2 ops/sec) Did 357 TrustToken-Exp1-Batch10 begin_issuance operations in 2031131us (175.8 ops/sec) Did 42 TrustToken-Exp1-Batch10 issue operations in 2045185us (20.5 ops/sec) Did 36 TrustToken-Exp1-Batch10 finish_issuance operations in 2028604us (17.7 ops/sec) Did 12435000 TrustToken-Exp1-Batch10 begin_redemption operations in 2000084us (6217238.9 ops/sec) Did 1176 TrustToken-Exp1-Batch10 redeem operations in 2023934us (581.0 ops/sec) Did 35000 TrustToken-Exp1-Batch10 finish_redemption operations in 2002899us (17474.7 ops/sec) After: Did 875 TrustToken-Exp0-Batch1 generate_key operations in 2028222us (431.4 ops/sec) [+80.2%] Did 1449 TrustToken-Exp0-Batch1 begin_issuance operations in 2097298us (690.9 ops/sec) [-0.5%] Did 207 TrustToken-Exp0-Batch1 issue operations in 2083578us (99.3 ops/sec) [+18.0%] Did 147 TrustToken-Exp0-Batch1 finish_issuance operations in 2018783us (72.8 ops/sec) [+0.5%] Did 12020250 TrustToken-Exp0-Batch1 begin_redemption operations in 2000036us (6010016.8 ops/sec) [-2.1%] Did 525 TrustToken-Exp0-Batch1 redeem operations in 2077137us (252.8 ops/sec) [+8.0%] Did 35000 TrustToken-Exp0-Batch1 finish_redemption operations in 2006257us (17445.4 ops/sec) [+2.2%] Did 903 TrustToken-Exp0-Batch10 generate_key operations in 2091846us (431.7 ops/sec) [+79.0%] Did 138 TrustToken-Exp0-Batch10 begin_issuance operations in 2006432us (68.8 ops/sec) [-0.3%] Did 19 TrustToken-Exp0-Batch10 issue operations in 2000665us (9.5 ops/sec) [+19.3%] Did 14 TrustToken-Exp0-Batch10 finish_issuance operations in 2045846us (6.8 ops/sec) [-0.5%] Did 12124000 TrustToken-Exp0-Batch10 begin_redemption operations in 2000055us (6061833.3 ops/sec) [+0.2%] Did 525 TrustToken-Exp0-Batch10 redeem operations in 2076637us (252.8 ops/sec) [+7.9%] Did 35000 TrustToken-Exp0-Batch10 finish_redemption operations in 2000072us (17499.4 ops/sec) [+1.2%] Did 2142 TrustToken-Exp1-Batch1 generate_key operations in 2031447us (1054.4 ops/sec) [+75.5%] Did 3633 TrustToken-Exp1-Batch1 begin_issuance operations in 2073265us (1752.3 ops/sec) [-0.4%] Did 504 TrustToken-Exp1-Batch1 issue operations in 2043677us (246.6 ops/sec) [+17.6%] Did 378 TrustToken-Exp1-Batch1 finish_issuance operations in 2086624us (181.2 ops/sec) [-0.4%] Did 12548250 TrustToken-Exp1-Batch1 begin_redemption operations in 2000020us (6274062.3 ops/sec) [-2.3%] Did 1281 TrustToken-Exp1-Batch1 redeem operations in 2067790us (619.5 ops/sec) [+6.7%] Did 35000 TrustToken-Exp1-Batch1 finish_redemption operations in 2012117us (17394.6 ops/sec) [+0.6%] Did 2184 TrustToken-Exp1-Batch10 generate_key operations in 2069977us (1055.1 ops/sec) [+75.5%] Did 357 TrustToken-Exp1-Batch10 begin_issuance operations in 2041930us (174.8 ops/sec) [-0.5%] Did 50 TrustToken-Exp1-Batch10 issue operations in 2063927us (24.2 ops/sec) [+18.0%] Did 36 TrustToken-Exp1-Batch10 finish_issuance operations in 2038115us (17.7 ops/sec) [-0.5%] Did 12693000 TrustToken-Exp1-Batch10 begin_redemption operations in 2000070us (6346277.9 ops/sec) [+2.1%] Did 1281 TrustToken-Exp1-Batch10 redeem operations in 2066940us (619.8 ops/sec) [+6.7%] Did 35000 TrustToken-Exp1-Batch10 finish_redemption operations in 2020506us (17322.4 ops/sec) [-0.9%] Change-Id: Id26600c07401d6567275155aa389839ac0e87013 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41124 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Steven Valdez <svaldez@google.com>	2020-05-07 18:16:27 +00:00
David Benjamin	5f43b12d52	Use a (mostly) constant-time multi-scalar mult for Trust Tokens. With multi-scalar multiplication, we're stuck pondering the doubling case. But it's fine for trust tokens, because the points are independent and the scalars are uniformly generated and not under attacker control. That means the probability of hitting a double is negligible. (It's equivalent to accidentally finding the discrete log of two independent points.) Before: Did 306 TrustToken-Exp0-Batch1 generate_key operations in 2000725us (152.9 ops/sec) Did 1428 TrustToken-Exp0-Batch1 begin_issuance operations in 2080325us (686.4 ops/sec) Did 105 TrustToken-Exp0-Batch1 issue operations in 2070658us (50.7 ops/sec) Did 88 TrustToken-Exp0-Batch1 finish_issuance operations in 2023864us (43.5 ops/sec) Did 12283000 TrustToken-Exp0-Batch1 begin_redemption operations in 2000063us (6141306.5 ops/sec) Did 315 TrustToken-Exp0-Batch1 redeem operations in 2084451us (151.1 ops/sec) Did 35000 TrustToken-Exp0-Batch1 finish_redemption operations in 2024388us (17289.2 ops/sec) Did 315 TrustToken-Exp0-Batch10 generate_key operations in 2045481us (154.0 ops/sec) Did 138 TrustToken-Exp0-Batch10 begin_issuance operations in 2022158us (68.2 ops/sec) Did 10 TrustToken-Exp0-Batch10 issue operations in 2148640us (4.7 ops/sec) Did 8 TrustToken-Exp0-Batch10 finish_issuance operations in 2047452us (3.9 ops/sec) Did 12167000 TrustToken-Exp0-Batch10 begin_redemption operations in 2000118us (6083141.1 ops/sec) Did 315 TrustToken-Exp0-Batch10 redeem operations in 2084853us (151.1 ops/sec) Did 35000 TrustToken-Exp0-Batch10 finish_redemption operations in 2014997us (17369.8 ops/sec) Did 777 TrustToken-Exp1-Batch1 generate_key operations in 2034967us (381.8 ops/sec) Did 3612 TrustToken-Exp1-Batch1 begin_issuance operations in 2052618us (1759.7 ops/sec) Did 264 TrustToken-Exp1-Batch1 issue operations in 2084327us (126.7 ops/sec) Did 220 TrustToken-Exp1-Batch1 finish_issuance operations in 2024603us (108.7 ops/sec) Did 12691000 TrustToken-Exp1-Batch1 begin_redemption operations in 2000111us (6345147.8 ops/sec) Did 777 TrustToken-Exp1-Batch1 redeem operations in 2070867us (375.2 ops/sec) Did 35000 TrustToken-Exp1-Batch1 finish_redemption operations in 2019118us (17334.3 ops/sec) Did 798 TrustToken-Exp1-Batch10 generate_key operations in 2090816us (381.7 ops/sec) Did 357 TrustToken-Exp1-Batch10 begin_issuance operations in 2032751us (175.6 ops/sec) Did 25 TrustToken-Exp1-Batch10 issue operations in 2046353us (12.2 ops/sec) Did 21 TrustToken-Exp1-Batch10 finish_issuance operations in 2015579us (10.4 ops/sec) Did 12695000 TrustToken-Exp1-Batch10 begin_redemption operations in 2000126us (6347100.1 ops/sec) Did 740 TrustToken-Exp1-Batch10 redeem operations in 2032413us (364.1 ops/sec) Did 35000 TrustToken-Exp1-Batch10 finish_redemption operations in 2011564us (17399.4 ops/sec) After: Did 483 TrustToken-Exp0-Batch1 generate_key operations in 2003131us (241.1 ops/sec) [+57.7%] Did 1449 TrustToken-Exp0-Batch1 begin_issuance operations in 2089317us (693.5 ops/sec) [+1.0%] Did 176 TrustToken-Exp0-Batch1 issue operations in 2094210us (84.0 ops/sec) [+65.7%] Did 147 TrustToken-Exp0-Batch1 finish_issuance operations in 2006750us (73.3 ops/sec) [+68.5%] Did 12217000 TrustToken-Exp0-Batch1 begin_redemption operations in 2000094us (6108212.9 ops/sec) [-0.5%] Did 483 TrustToken-Exp0-Batch1 redeem operations in 2058132us (234.7 ops/sec) [+55.3%] Did 35000 TrustToken-Exp0-Batch1 finish_redemption operations in 2026970us (17267.2 ops/sec) [-0.1%] Did 504 TrustToken-Exp0-Batch10 generate_key operations in 2086204us (241.6 ops/sec) [+56.9%] Did 144 TrustToken-Exp0-Batch10 begin_issuance operations in 2084670us (69.1 ops/sec) [+1.2%] Did 16 TrustToken-Exp0-Batch10 issue operations in 2008793us (8.0 ops/sec) [+71.1%] Did 14 TrustToken-Exp0-Batch10 finish_issuance operations in 2033577us (6.9 ops/sec) [+76.2%] Did 12026000 TrustToken-Exp0-Batch10 begin_redemption operations in 2000018us (6012945.9 ops/sec) [-1.2%] Did 483 TrustToken-Exp0-Batch10 redeem operations in 2056418us (234.9 ops/sec) [+55.5%] Did 35000 TrustToken-Exp0-Batch10 finish_redemption operations in 2046766us (17100.1 ops/sec) [-1.6%] Did 1239 TrustToken-Exp1-Batch1 generate_key operations in 2060737us (601.2 ops/sec) [+57.5%] Did 3675 TrustToken-Exp1-Batch1 begin_issuance operations in 2085293us (1762.3 ops/sec) [+0.1%] Did 420 TrustToken-Exp1-Batch1 issue operations in 2008121us (209.2 ops/sec) [+65.1%] Did 378 TrustToken-Exp1-Batch1 finish_issuance operations in 2077226us (182.0 ops/sec) [+67.5%] Did 12783000 TrustToken-Exp1-Batch1 begin_redemption operations in 2000134us (6391071.8 ops/sec) [+0.7%] Did 1197 TrustToken-Exp1-Batch1 redeem operations in 2056802us (582.0 ops/sec) [+55.1%] Did 35000 TrustToken-Exp1-Batch1 finish_redemption operations in 2030955us (17233.3 ops/sec) [-0.6%] Did 1260 TrustToken-Exp1-Batch10 generate_key operations in 2095507us (601.3 ops/sec) [+57.5%] Did 357 TrustToken-Exp1-Batch10 begin_issuance operations in 2029693us (175.9 ops/sec) [+0.2%] Did 42 TrustToken-Exp1-Batch10 issue operations in 2050856us (20.5 ops/sec) [+67.6%] Did 36 TrustToken-Exp1-Batch10 finish_issuance operations in 2027488us (17.8 ops/sec) [+70.4%] Did 12140000 TrustToken-Exp1-Batch10 begin_redemption operations in 2000070us (6069787.6 ops/sec) [-4.4%] Did 1210 TrustToken-Exp1-Batch10 redeem operations in 2079615us (581.8 ops/sec) [+59.8%] Did 34000 TrustToken-Exp1-Batch10 finish_redemption operations in 2052918us (16561.8 ops/sec) [-4.8%] Change-Id: Idd51d7e1d18f3b94edc4105e68fd50b5f44d87cd Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41104 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Steven Valdez <svaldez@google.com>	2020-05-07 17:10:36 +00:00
David Benjamin	ce1665b825	Batch inversions in Trust Tokens. The DLEQ and DLEQOR proofs require converting many Jacobian points to affine, some multiple times. The inversions involved can be batched. This buys us a +5-8% improvement in token issuance speed. issue and finish_issue should each only perform two inversions per token now. We could save an inversion per token by changing the dleq_generate and dleq_verify function signatures, but that would complicate the likely more valuable batched DLEQ(OR) optimization, so I've left those alone. Before: Did 300 TrustToken-Exp0-Batch1 generate_key operations in 2031798us (147.7 ops/sec) Did 1449 TrustToken-Exp0-Batch1 begin_issuance operations in 2093639us (692.1 ops/sec) Did 96 TrustToken-Exp0-Batch1 issue operations in 2044310us (47.0 ops/sec) Did 84 TrustToken-Exp0-Batch1 finish_issuance operations in 2072137us (40.5 ops/sec) Did 12170000 TrustToken-Exp0-Batch1 begin_redemption operations in 2000098us (6084701.8 ops/sec) Did 315 TrustToken-Exp0-Batch1 redeem operations in 2091938us (150.6 ops/sec) Did 35000 TrustToken-Exp0-Batch1 finish_redemption operations in 2004900us (17457.2 ops/sec) Did 308 TrustToken-Exp0-Batch10 generate_key operations in 2067860us (148.9 ops/sec) Did 138 TrustToken-Exp0-Batch10 begin_issuance operations in 2005706us (68.8 ops/sec) Did 9 TrustToken-Exp0-Batch10 issue operations in 2107753us (4.3 ops/sec) Did 8 TrustToken-Exp0-Batch10 finish_issuance operations in 2193489us (3.6 ops/sec) Did 12046750 TrustToken-Exp0-Batch10 begin_redemption operations in 2000025us (6023299.7 ops/sec) Did 315 TrustToken-Exp0-Batch10 redeem operations in 2091940us (150.6 ops/sec) Did 35000 TrustToken-Exp0-Batch10 finish_redemption operations in 2008851us (17422.9 ops/sec) Did 756 TrustToken-Exp1-Batch1 generate_key operations in 2051005us (368.6 ops/sec) Did 3633 TrustToken-Exp1-Batch1 begin_issuance operations in 2072577us (1752.9 ops/sec) Did 242 TrustToken-Exp1-Batch1 issue operations in 2052091us (117.9 ops/sec) Did 210 TrustToken-Exp1-Batch1 finish_issuance operations in 2058740us (102.0 ops/sec) Did 12477000 TrustToken-Exp1-Batch1 begin_redemption operations in 2000004us (6238487.5 ops/sec) Did 777 TrustToken-Exp1-Batch1 redeem operations in 2084953us (372.7 ops/sec) Did 35000 TrustToken-Exp1-Batch1 finish_redemption operations in 2028286us (17255.9 ops/sec) Did 756 TrustToken-Exp1-Batch10 generate_key operations in 2051178us (368.6 ops/sec) Did 357 TrustToken-Exp1-Batch10 begin_issuance operations in 2041875us (174.8 ops/sec) Did 23 TrustToken-Exp1-Batch10 issue operations in 2026494us (11.3 ops/sec) Did 20 TrustToken-Exp1-Batch10 finish_issuance operations in 2048478us (9.8 ops/sec) Did 12492000 TrustToken-Exp1-Batch10 begin_redemption operations in 2000053us (6245834.5 ops/sec) Did 777 TrustToken-Exp1-Batch10 redeem operations in 2084956us (372.7 ops/sec) Did 36000 TrustToken-Exp1-Batch10 finish_redemption operations in 2021991us (17804.2 ops/sec) After: Did 315 TrustToken-Exp0-Batch1 generate_key operations in 2046638us (153.9 ops/sec) [+4.2%] Did 1449 TrustToken-Exp0-Batch1 begin_issuance operations in 2087930us (694.0 ops/sec) [+0.3%] Did 105 TrustToken-Exp0-Batch1 issue operations in 2071104us (50.7 ops/sec) [+8.0%] Did 88 TrustToken-Exp0-Batch1 finish_issuance operations in 2023502us (43.5 ops/sec) [+7.3%] Did 11847000 TrustToken-Exp0-Batch1 begin_redemption operations in 2000041us (5923378.6 ops/sec) [-2.7%] Did 315 TrustToken-Exp0-Batch1 redeem operations in 2084116us (151.1 ops/sec) [+0.4%] Did 35000 TrustToken-Exp0-Batch1 finish_redemption operations in 2003732us (17467.4 ops/sec) [+0.1%] Did 315 TrustToken-Exp0-Batch10 generate_key operations in 2046863us (153.9 ops/sec) [+3.3%] Did 138 TrustToken-Exp0-Batch10 begin_issuance operations in 2000108us (69.0 ops/sec) [+0.3%] Did 10 TrustToken-Exp0-Batch10 issue operations in 2149283us (4.7 ops/sec) [+9.0%] Did 8 TrustToken-Exp0-Batch10 finish_issuance operations in 2046416us (3.9 ops/sec) [+7.2%] Did 12112000 TrustToken-Exp0-Batch10 begin_redemption operations in 2000077us (6055766.9 ops/sec) [+0.5%] Did 315 TrustToken-Exp0-Batch10 redeem operations in 2084427us (151.1 ops/sec) [+0.4%] Did 35000 TrustToken-Exp0-Batch10 finish_redemption operations in 2015111us (17368.8 ops/sec) [-0.3%] Did 777 TrustToken-Exp1-Batch1 generate_key operations in 2029777us (382.8 ops/sec) [+3.9%] Did 3654 TrustToken-Exp1-Batch1 begin_issuance operations in 2093484us (1745.4 ops/sec) [-0.4%] Did 252 TrustToken-Exp1-Batch1 issue operations in 2024557us (124.5 ops/sec) [+5.5%] Did 220 TrustToken-Exp1-Batch1 finish_issuance operations in 2034633us (108.1 ops/sec) [+6.0%] Did 12659000 TrustToken-Exp1-Batch1 begin_redemption operations in 2000112us (6329145.6 ops/sec) [+1.5%] Did 777 TrustToken-Exp1-Batch1 redeem operations in 2073783us (374.7 ops/sec) [+0.5%] Did 35000 TrustToken-Exp1-Batch1 finish_redemption operations in 2050371us (17070.1 ops/sec) [-1.1%] Did 768 TrustToken-Exp1-Batch10 generate_key operations in 2025482us (379.2 ops/sec) [+2.9%] Did 357 TrustToken-Exp1-Batch10 begin_issuance operations in 2034429us (175.5 ops/sec) [+0.4%] Did 25 TrustToken-Exp1-Batch10 issue operations in 2049293us (12.2 ops/sec) [+7.5%] Did 21 TrustToken-Exp1-Batch10 finish_issuance operations in 2022256us (10.4 ops/sec) [+6.4%] Did 12702000 TrustToken-Exp1-Batch10 begin_redemption operations in 2000015us (6350952.4 ops/sec) [+1.7%] Did 777 TrustToken-Exp1-Batch10 redeem operations in 2072048us (375.0 ops/sec) [+0.6%] Did 35000 TrustToken-Exp1-Batch10 finish_redemption operations in 2024580us (17287.5 ops/sec) [-2.9%] Change-Id: Ia1b09cd14aa8ce0935d18033fb4bd75666a258e9 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41086 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Steven Valdez <svaldez@google.com>	2020-05-07 16:45:06 +00:00
David Benjamin	802523aa5f	Introduce an EC_AFFINE abstraction. PMBTokens ends up converting the same point to affine coordinates repeatedly. Additionally, it converts many affine coordinates at once, which we can batch. Introduce an EC_AFFINE type to store affine points and move the inversion to the Jacobian -> affine conversion. This does mean we lose the (negligible) Montgomery reduction optimization in EC_GFp_mont. point_get_affine_coordinates no longer breaks the EC_FELEM abstraction around Montgomery form. Unfortunately, this complicates hardening of the callers not checking return values because EC_AFFINE cannot represent the point at infinity and, due to OpenSSL's API limitations, groups may not have generators available and the generator is not affine at the type level. (EC_AFFINE cannot represent the point at infinity.) Thus this CL: - Tidies up some duplicate code in setting up the generator and ensures it always has Z = 1. - ec_point_set_affine_coordinates hardens against unused results if the generator is configured. But this is ultimately an internal function. - Retains the hardening on the public APIs by adding calls to ec_set_to_safe_point in two places. This CL does not apply the optimization to Trust Tokens, only introduces the EC_AFFINE abstraction. It additionally continues to store EC_POINTs (used in ECDH and ECDSA) in Jacobian form. See https://crbug.com/boringssl/326#c4 for a discussion on why this is tricky. Those protocols are hopefully simple enough that they don't need complexity around inversions. Having an EC_AFFINE type will also be useful for computing custom tables for Trust Token public keys, which gives a nice speedup. Bug: 326 Change-Id: I11b010a33f36a15bac9939351df5205bd35cc665 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41084 Commit-Queue: Steven Valdez <svaldez@google.com> Reviewed-by: Steven Valdez <svaldez@google.com>	2020-05-05 16:55:18 +00:00
Adam Langley	cccfb9bcf2	Disable fork-detect tests under TSAN. TSAN can't cope with the pattern of forking and threading here and exits with an error. Change-Id: I3673004b62a45c5e910a4597f89764682a0b8d0a Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41224 Commit-Queue: Adam Langley <agl@google.com> Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: David Benjamin <davidben@google.com>	2020-05-04 15:19:06 +00:00
David Benjamin	aa764c46eb	Introduce TRUST_TOKENS_experiment_v1. This starts a new branch of the protocol using P-384 and draft-07 of hash-to-curve. Mark it unstable for now as we aim to add batching and other fixes. Did 154 TrustToken-Exp0-Batch1 generate_key operations in 1033312us (149.0 ops/sec) Did 748 TrustToken-Exp0-Batch1 begin_issuance operations in 1078707us (693.4 ops/sec) Did 48 TrustToken-Exp0-Batch1 issue operations in 1030056us (46.6 ops/sec) Did 44 TrustToken-Exp0-Batch1 finish_issuance operations in 1092816us (40.3 ops/sec) Did 6040750 TrustToken-Exp0-Batch1 begin_redemption operations in 1000009us (6040695.6 ops/sec) Did 154 TrustToken-Exp0-Batch1 redeem operations in 1023722us (150.4 ops/sec) Did 18000 TrustToken-Exp0-Batch1 finish_redemption operations in 1021087us (17628.3 ops/sec) Did 154 TrustToken-Exp0-Batch10 generate_key operations in 1032878us (149.1 ops/sec) Did 72 TrustToken-Exp0-Batch10 begin_issuance operations in 1050908us (68.5 ops/sec) Did 4 TrustToken-Exp0-Batch10 issue operations in 1051989us (3.8 ops/sec) Did 4 TrustToken-Exp0-Batch10 finish_issuance operations in 1214996us (3.3 ops/sec) Did 5987000 TrustToken-Exp0-Batch10 begin_redemption operations in 1000068us (5986592.9 ops/sec) Did 154 TrustToken-Exp0-Batch10 redeem operations in 1037898us (148.4 ops/sec) Did 18000 TrustToken-Exp0-Batch10 finish_redemption operations in 1024788us (17564.6 ops/sec) Did 396 TrustToken-Exp1-Batch1 generate_key operations in 1060955us (373.2 ops/sec) Did 1925 TrustToken-Exp1-Batch1 begin_issuance operations in 1093039us (1761.1 ops/sec) Did 121 TrustToken-Exp1-Batch1 issue operations in 1022292us (118.4 ops/sec) Did 110 TrustToken-Exp1-Batch1 finish_issuance operations in 1076011us (102.2 ops/sec) Did 6322000 TrustToken-Exp1-Batch1 begin_redemption operations in 1000144us (6321089.8 ops/sec) Did 407 TrustToken-Exp1-Batch1 redeem operations in 1087774us (374.2 ops/sec) Did 18000 TrustToken-Exp1-Batch1 finish_redemption operations in 1031736us (17446.3 ops/sec) Did 407 TrustToken-Exp1-Batch10 generate_key operations in 1090415us (373.3 ops/sec) Did 187 TrustToken-Exp1-Batch10 begin_issuance operations in 1065754us (175.5 ops/sec) Did 11 TrustToken-Exp1-Batch10 issue operations in 1005738us (10.9 ops/sec) Did 10 TrustToken-Exp1-Batch10 finish_issuance operations in 1066494us (9.4 ops/sec) Did 6302000 TrustToken-Exp1-Batch10 begin_redemption operations in 1000029us (6301817.2 ops/sec) Did 407 TrustToken-Exp1-Batch10 redeem operations in 1087692us (374.2 ops/sec) Did 18000 TrustToken-Exp1-Batch10 finish_redemption operations in 1014611us (17740.8 ops/sec) Change-Id: I7ea9a8ab8ad48acfbf50026e251cc6e1d5b8ba7f Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41069 Reviewed-by: Steven Valdez <svaldez@google.com> Commit-Queue: Steven Valdez <svaldez@google.com>	2020-05-04 14:20:47 +00:00
Adam Langley	ad5582985c	fork_detect: be robust to qemu. fork_detect needs to know whether \|MADV_WIPEONFORK\| is supported by the kernel or not. However, current versions of qemu ignore madvise calls and just return zero, making it seems like it's supported when it's actually not. Therefore, try an madvise with -1 to ensure that clearly bogus values actually produce and error before trusting the result of calling with \|MADV_WIPEONFORK\|. Change-Id: I7f72714d5794274acabd0bee0b0ac470e1933774 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41024 Reviewed-by: Adam Langley <agl@google.com> Reviewed-by: David Benjamin <davidben@google.com> Commit-Queue: Adam Langley <agl@google.com>	2020-05-01 18:35:42 +00:00
David Benjamin	dc06e320d8	Remove unused code from ghash-x86_64.pl. Thanks to Brian Smith for pointing these out in https://boringssl-review.googlesource.com/c/boringssl/+/38724. Change-Id: I715da0778346fcc45aab19855050e18fe95a9185 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41144 Reviewed-by: Steven Valdez <svaldez@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-05-01 16:01:21 +00:00
David Benjamin	eeb5bb3561	Switch the P-384 hash-to-curve to draft-07. Leave the P-521 one alone as it's part of the current trust token experiment. But suffix all the functions by their draft until everything stabilizes. Also remove the ref_for_testing function since we can cite the fixed test vectors from the upstream PR. Change-Id: Ied89d26848c8ec1f6e8414a2385d9f3e491d7fb2 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/41067 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Steven Valdez <svaldez@google.com>	2020-04-30 20:23:27 +00:00
Steven Valdez	6a7184066f	Add hash-to-curve code for P384. Change-Id: I34c3609641c23aed14f2324c6887250369ae8b5f Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40944 Commit-Queue: Steven Valdez <svaldez@google.com> Reviewed-by: David Benjamin <davidben@google.com>	2020-04-29 17:16:17 +00:00
David Benjamin	b36f52d187	Write down the expressions for all the NIST primes. I find myself needing to look them up frequently and this is easier than digging through https://www.secg.org/SEC2-Ver-1.0.pdf each time. Change-Id: I4aa7cdf2c8f114a0f2ba8b9bd7ee394d3201d731 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40984 Reviewed-by: Steven Valdez <svaldez@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-04-27 17:27:49 +00:00
Adam Langley	21aede9179	Move fork_detect files into rand/ Files in the top-level of fipsmodule/ are considered be FIPS-support files by generate_build_files.py. However, fork_detect is different. Rather than have more special cases in the support scripts, this change moves fork_detect into fipsmodule/rand. It's not a perfect fit, and maybe it could have been its own directory, but it's not bad and fixes the build for now. Change-Id: I875088dd458069190dade870e085865e1306f55a Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40964 Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: Adam Langley <agl@google.com>	2020-04-26 21:38:36 +00:00
David Benjamin	b1086cdb12	Harden against fork via MADV_WIPEONFORK. Linux 4.14 and up support MADV_WIPEONFORK, which can be used to reliably and efficiently detect forks. Use it to harden the PRNG and RSA blinding caches. On the RSA side, we currently cache blinding values. (Alas, removing the cache gives a very noticeable perf hit. There is some low-hanging fruit to trim a few Montgomery reductions, but it didn't offset much last I toyed with it.) Now, on Linux 4.14 and up, this cache is fork-safe. Since not all platforms that support fork also support fork detection, this should only be used as a hardening measure. Now, when detection is present, BoringSSL will skip doing per-call entropy draws from the kernel. (This might regress protection against VM cloning when no fast RDRAND is available. However, we need to do something for AMD machines. Hypervisors that clone VMs are going to need to signal the kernel to wipe WIPEONFORK pages.) Upgrade-Note: BoringSSL now calls some more syscalls on Linux. If this offends sandboxes, let us know. We can loosen the sandbox or add a mechanism to prime the MADV_WIPEONFORK page before entering it. Change-Id: I6ba43951aeaa2b9b81f74f9e5a7a0ce2de0438a4 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/34745 Reviewed-by: Adam Langley <alangley@gmail.com> Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: Adam Langley <agl@google.com>	2020-04-26 18:31:56 +00:00
David Benjamin	14d192e930	Fix typo in comment. Change-Id: I67a012a54a2818ec12c1829a746e2d7f796616d6 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40924 Commit-Queue: Steven Valdez <svaldez@google.com> Reviewed-by: Steven Valdez <svaldez@google.com>	2020-04-24 16:01:58 +00:00
David Benjamin	21f694210c	Use faster addition chains for P-256 field inversion. Switch to the addition chains by Brian Smith in https://briansmith.org/ecc-inversion-addition-chains-01#p256_field_inversion The new addition chains are a bit faster when measured independently. They aren't, however, noticeable when measured with everything else in ECDH. Rather, the motivation is just to align fiat_p256, nistz256, and a possible future fiat_p384 import. Since it's free, I've included the (negligible) z^-2 optimization, but if we ever want a z^-1 abstraction, it doesn't actually matter. In the meantime, it replaces the (even more negligible) Montgomery conversion optimization which is a bit less odd on the EC_FELEM abstraction. (I'm pondering whether we want an EC_AFFINE abstraction given how the Trust Tokens DLEQ proofs work.) fiat_p256 (64-bit): Before: Did 539000 P-256 get x operations in 5007148us (107646.1 ops/sec) Did 532000 P-256 get x and y operations in 5008736us (106214.4 ops/sec) After: Did 607000 P-256 get x operations in 5005225us (121273.3 ops/sec) Did 594000 P-256 get x and y operations in 5001251us (118770.3 ops/sec) nistz256: Before: Did 1472000 P-256 get x operations in 5003286us (294206.6 ops/sec) Did 1445000 P-256 get x and y operations in 5002052us (288881.4 ops/sec) After: Did 1491000 P-256 get x operations in 5002524us (298049.5 ops/sec) Did 1452000 P-256 get x and y operations in 5003193us (290214.7 ops/sec) I haven't bothered checking in the benchmarks as those operations standalone are largely artificial. They're a consequence of using the same type for affine and Jacobian points. Change-Id: I71e0d50a8712198f9cb8f68d50894d14a6091635 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40867 Reviewed-by: David Benjamin <davidben@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-04-23 21:31:07 +00:00
David Benjamin	47b1e39042	Tidy up third_party/fiat. Originally, when we imported fiat-crypto field operations, the pipeline was in early stages and the generated code had to be manually integrated with the rest of the curve implementation, so we moved all our supporting code to third_party/fiat for simplicity. Over time more supporting code, like the table generators, landed there to be next to its callers. fiat-crypto now generates standalone files which we #include into the supporting code. This moves the supporting code back to the usual location. It also updates the README.md file to reflect the new pipeline. (Most of it was a documentation of the old pipeline, which was much more manual.) Change-Id: I64db7235feb6566f0d3cd4db3a7146050edaf25a Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40904 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2020-04-23 19:50:07 +00:00
David Benjamin	8bbc5e9a61	Add missing curve check to ec_hash_to_scalar_p521_xmd_sha512. The bounds on k make this a little tricky to test, so stick an assert(0) as that codepath should be impossible. Change-Id: I03958ed36bff4f0b420a446c6d49eca944f45da2 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40885 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Steven Valdez <svaldez@google.com>	2020-04-23 17:44:57 +00:00
David Benjamin	c878b651ce	Align remaining Intel copyright notice. https://boringssl-review.googlesource.com/c/boringssl/+/25588/ missed one of the files. Change-Id: Iba3aa37f5ecd69fd35054c44cbd5f8988a05f76e Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40868 Commit-Queue: Adam Langley <agl@google.com> Reviewed-by: Adam Langley <agl@google.com>	2020-04-23 15:59:37 +00:00
Steven Valdez	78987bb7bb	Implement DLEQ checks for Trust Token. Change-Id: I6f263b775aafad6616b31af59096c3b4229fe3e1 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40684 Commit-Queue: Steven Valdez <svaldez@google.com> Reviewed-by: David Benjamin <davidben@google.com>	2020-04-21 13:42:50 +00:00
David Benjamin	7853619553	Match parameter names between header and source. clang-tidy flagged this in a diff. Running a full clang-tidy would probably reveal others, but ah well. Change-Id: Ib004f7df4fd53b326686810c314869b35b35e547 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40749 Reviewed-by: Steven Valdez <svaldez@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-04-17 16:37:18 +00:00
Steven Valdez	538a124d70	Trust Token Implementation. Trust Token implementation based on PrivacyPass using the PMBToken construction. This implementation currently omits the DLEQ proofs. https://github.com/alxdavids/privacy-pass-ietf/blob/master/draft-davidson-pp-protocol.md https://eprint.iacr.org/2020/072.pdf Change-Id: If236cc8beaf33a80bdad2991c3163f9dd0cb7571 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/39244 Commit-Queue: Steven Valdez <svaldez@google.com> Reviewed-by: David Benjamin <davidben@google.com>	2020-04-17 01:41:02 +00:00
Adam Langley	f37eb8d6a8	Include mem.h for \|CRYPTO_memcmp\| Change-Id: I3009326a5fa0c28bd3d214254e9beda4ea779833 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40744 Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: Adam Langley <agl@google.com>	2020-04-16 21:10:07 +00:00
Adam Langley	3c11bf53e9	Add SHA-512-256. (Not wired up into all the signature verifiers because we don't need or recommend that.) Change-Id: Ia212a1f0e1c389a31d303e00a6fafb0ec3db7c71 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40704 Commit-Queue: Adam Langley <agl@google.com> Reviewed-by: David Benjamin <davidben@google.com>	2020-04-15 21:23:37 +00:00
David Benjamin	9fc31378f0	Make ec_GFp_simple_cmp constant-time. We need a constant-time point equality for two reasons. First, although multiplication results are usually public, their Jacobian Z coordinates may be secret, or at least are not obviously public. Second, more complex protocols will sometimes manipulate secret points, notably PMBTokens. While here I've renamed the inner function to points_equal without the flipped return value, to be less confusing. Update-Note: This does mean that we pay a 6M+2S Jacobian comparison where comparing two publicly affine points should cost no field operations at all. Code which compares two EC public keys for equality will be slightly slower. I wouldn't expect this to matter (if you actually use the public keys, you'll pay much much more) If it does, we can restore this optimization by keeping better track of affine vs. Jacobian forms. See https://crbug.com/boringssl/326. Bug: 326, chromium:1014199 Change-Id: I67c9a56bc9b66f30c0b500a29e8bf90427d89061 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40665 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2020-04-13 21:33:44 +00:00
David Benjamin	f883b98cfd	Tidy up CRYPTO_sysrand variants. We can extend the inline function trick to avoid needing to add extra wrappers to the fuzzer PRNG. Change-Id: Ie007a4ccaf0e2d703a3710e4298a774af861d514 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40624 Commit-Queue: Adam Langley <agl@google.com> Reviewed-by: Adam Langley <agl@google.com>	2020-04-13 19:15:28 +00:00
David Benjamin	3d22c8260a	Do a better job testing EC_POINT_cmp. If EC_POINT_cmp always returns points are equal, our tests barely notice. Change-Id: I1a78d95af2fad3d97c2db013468d34159807ff21 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40664 Reviewed-by: Adam Langley <agl@google.com>	2020-04-13 19:02:38 +00:00
David Benjamin	2a8e294b70	Follow-up comments to hash_to_scalar. See https://boringssl-review.googlesource.com/c/boringssl/+/40646/3#message-ee607e82b0c62dd73a1b8a81f03acd9329cbbf02 Additionally, to be consistent with hash_to_field, we ought to use a big-endian value. It's also probably time to have some common functions for dealing with converting BN_ULONG[]s to and from big-endian bytes. Coding all those free-handed is a little tedious and error-prone. Change-Id: I6bdcd9362cee60e160e5a8eca25206b052206e1f Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40647 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2020-04-10 18:35:25 +00:00
David Benjamin	28987cf081	Add a hash_to_scalar variation of P-521's hash_to_field. DLEQ proofs for PMBTokens need a random oracle over scalars as well as field elements. (Interestingly, draft-irtf-cfrg-voprf-03 section 5.1 does not specify as strong of requirements, but then their reference implementation does rejection sampling, so it's unclear.) Reusing the hash_to_field operation so hash calls use the domain separation tag consistently with other hash-to-curve operations seems prudent, so implement a companion function until the actual construction solidifies. Change-Id: I92d807bfddcca26db690cce0a3da551143c25ff3 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40646 Reviewed-by: Steven Valdez <svaldez@google.com> Commit-Queue: Steven Valdez <svaldez@google.com>	2020-04-10 15:37:52 +00:00
David Benjamin	5902657734	Double-check secret EC point multiplications. Doing a Jacobian point-on-curve check has negligible cost compared to the point operations. We may as well perform one to add some defense against potential bugs and such. (We already double-check RSA operations.) Before: Did 40000 ECDH P-224 operations in 3014872us (13267.6 ops/sec) Did 55000 ECDH P-256 operations in 3026351us (18173.7 ops/sec) Did 3410 ECDH P-384 operations in 3053181us (1116.9 ops/sec) Did 1364 ECDH P-521 operations in 3079656us (442.9 ops/sec) Did 83000 ECDSA P-224 signing operations in 3013476us (27542.9 ops/sec) Did 41000 ECDSA P-224 verify operations in 3073530us (13339.7 ops/sec) Did 168000 ECDSA P-256 signing operations in 3008562us (55840.6 ops/sec) Did 60000 ECDSA P-256 verify operations in 3038517us (19746.5 ops/sec) Did 6169 ECDSA P-384 signing operations in 3066741us (2011.6 ops/sec) Did 6231 ECDSA P-384 verify operations in 3054468us (2040.0 ops/sec) Did 2418 ECDSA P-521 signing operations in 3058901us (790.5 ops/sec) Did 2418 ECDSA P-521 verify operations in 3048765us (793.1 ops/sec) After: Did 39600 ECDH P-224 operations in 3001966us (13191.4 ops/sec) Did 55000 ECDH P-256 operations in 3033098us (18133.3 ops/sec) Did 3441 ECDH P-384 operations in 3088436us (1114.2 ops/sec) Did 1364 ECDH P-521 operations in 3087711us (441.8 ops/sec) Did 83000 ECDSA P-224 signing operations in 3029486us (27397.4 ops/sec) Did 40000 ECDSA P-224 verify operations in 3005452us (13309.1 ops/sec) Did 168000 ECDSA P-256 signing operations in 3011387us (55788.2 ops/sec) Did 60000 ECDSA P-256 verify operations in 3030343us (19799.7 ops/sec) Did 6076 ECDSA P-384 signing operations in 3023469us (2009.6 ops/sec) Did 6231 ECDSA P-384 verify operations in 3056138us (2038.8 ops/sec) Did 2418 ECDSA P-521 signing operations in 3057375us (790.9 ops/sec) Did 2449 ECDSA P-521 verify operations in 3083418us (794.2 ops/sec) Change-Id: Icedc51e340c8f3a21f96a535395814575e0c89b2 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40592 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2020-04-09 18:11:29 +00:00
David Benjamin	d2c5b7da2f	Make ec_felem_equal constant-time. This doesn't fix any particular issue, but we may as well use the constant-time comparison to be more robust. Change-Id: I96dffce7fe153a7dd4eec226a6b42dcea240c3f1 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40591 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2020-04-09 15:45:09 +00:00
David Benjamin	f20772cc0a	Make ec_GFp_simple_is_on_curve constant-time. This function (by way of EC_POINT_is_on_curve) is used by callers in two places: - To check the affine result of decoding a point. (This is no longer necessary because we'll always do it internally, but folks still do it.) - To check the Jacobian result of a multiplication as fault protection. (Tink does this. We should probably do it in the library.) That function's implementations of affine and Jacobian checks are mostly constant-time, but branching between the two isn't. Since the difference is small (2S + 1M vs 2S + 3M) compared to what one would be doing with an affine point (point multiplication), this probably isn't worth worrying about. Conservatively do the Jacobian check so folks like Tink aren't accidentally introducing side channels. Change-Id: I3140167868e027004906293df547add43ae40552 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40590 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: Adam Langley <agl@google.com>	2020-04-09 15:03:48 +00:00
David Benjamin	a49c617197	Implement draft-irtf-cfrg-hash-to-curve-06. This implements hash-to-curve for P-521, which is needed by the PMBTokens construction in https://eprint.iacr.org/2020/072.pdf. It is only an internal function for now, operating on EC_RAW_POINT, so that PMBTokens can avoid allocating EC_POINTs everywhere. If we ever have a need to expose this outside, we can add an EC_POINT wrapper (hopefully by then the draft will be stable). Note this implements two versions of the function due to a spec issue in P521_XMD:SHA-512_SSWU_RO_. One of them only exists to test against the original test vectors. See https://github.com/cfrg/draft-irtf-cfrg-hash-to-curve/issues/237 Bug: chromium:1014199 Change-Id: I7207d1bcb8b20f7111c2ffb40e2794ad2d3d0000 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40589 Reviewed-by: Adam Langley <agl@google.com> Commit-Queue: David Benjamin <davidben@google.com>	2020-04-08 22:18:51 +00:00
Adam Langley	7a22a65098	Only draw from RDRAND for additional_data if it's fast. We seek to incorporate entropy into every \|RAND_bytes\| call to avoid problems with fork() and VM cloning. However, on some chips, RDRAND is significantly slower than a system call thus crushing the performance of \|RAND_bytes\|. This change disables use of RDRAND for this opportunistic draw for non-Intel chips. BoringSSL will then fall back to either the OS, or nothing (if fork-unsafe mode has been set). RDRAND is still used for seeding the PRNG whenever it is available. This now adds a new blocking case: RDRAND may be used for seeding, but the syscall to get additional_data was still blocking. Previously that didn't matter because, if a syscall was used to get additional_data, then a blocking one had already been used to seed. Thus the syscall for additional_data is now non-blocking. Also, we had both \|hwrand\| and \|rdrand\| names hanging around. We don't support entropy instructions other than RDRAND, so unify around \|rdrand\| naming. If we ever do add support for more we can properly abstract at that time. Change-Id: I91121b270a2ebc667543dad1324f37285daad893 Reviewed-on: https://boringssl-review.googlesource.com/c/boringssl/+/40565 Commit-Queue: David Benjamin <davidben@google.com> Reviewed-by: David Benjamin <davidben@google.com>	2020-04-08 18:29:21 +00:00

1 2 3 4 5 ...

663 Commits