/search.css" rel="stylesheet" type="text/css"/> /search.js">
 All Classes Namespaces Files Functions Variables Typedefs Enumerations Enumerator Friends Macros Groups Pages
1.11 - May ??, 2017
  • Changes to conventional/Engine.hpp:
    • The sequence of generated randoms is unchanged.
    • shrink the memory footprint by not keeping a copy of the ukey in the Engine and storing the 'elem' counter in v.back().
    • insertion and extraction operators have changed incompatibly. I.e., engines serialized with 1.10 or earlier cannot be deserialized with 1.11, and vice versa.
    • getkey returns a key_type and setkey takes a key_type
    • when ukey_type is not the same as key_type, provide a constructor and a seed method that takes a const key_type& argument.
    • the getseed method is eliminated
  • Move uniform.hpp, boxmuller.hpp and u01fixedpt.h into include/Random123. I.e., make them available in exactly the same way as threefry.h, et al. It is no longer necessary to copy them from the examples directory to user code.
  • Incorporate ua.hpp into uniform.hpp.
  • Provide insertion, extraction and comparison operators to the aesopenssl16x8_key_t and aesni1xm128_key_t.
  • Separate tests and benchmarks (which are absurdly complicated) from examples (which are meant to be short and simple) in the source tree. We still test and time the same things, but all that code is in tests/, while the code remaining in examples/ really tries to be exemplary.
  • More careful use cl_uint and elimination of uint from opencl code.
  • Better (but not reliable) support for OpenCL on MacOS.
  • Fix documentation for examples/uniform.hpp.

1.10 - Apr 12, 2016
  • R123EXAMPLE_ENVCONF_SEED environment variable provides seed to most pi_* examples
  • R123EXAMPLE_ENVCONF_CUDA_DEVICE and R123EXAMPLE_ENVCONF_OPENCL_DEVICE allow selection of non-default GPUs, e.g. R123EXAMPLE_ENVCONF_CUDA_DEVICE=980 pi_cuda or R123EXAMPLE_ENVCONF_OPENCL_DEVICE=680 pi_opencl
  • Updated hardcoded device core detection with newer GPUs (Kepler, Maxwell) and changed default cores from 1 to 384 to run more sensible number of iterations on unrecognized devices.
  • moved lots of arbitrary, funny, irrelevant constants into example_seeds.h so that they do not confuse the punters.
  • Tested on ARMv7 (32bit) and `ARM A53 (64bit) with Freescale/Linaro toolchain gcc 4.8. Needed workaround for -O3 compiler bug on ARMv7.
1.09 - Mar 6, 2016
  • add #define R123_USE_U01_DOUBLE to u10fixedpt.h
  • fix definition of const char *srcstr[] in util_opencl.h
  • improve nvccfeatures.h so that 64-bit philox can be used both on the host and on the cuda device.
  • add _Pragmas so R123_STATIC_ASSERT doesn't warn with clang-3.6.
  • boxmuller.hpp doesn't look for sincos when APPLE is true.
1.08 - Aug 20, 2013
  • Fix a bug in threefry2xW whereby rotation constants were chosen incorrectly after the 20th round. There is no reason to suspect that the incorrectly chosen rotation constants were "bad" or "deficient" so there is no reason to suspect the quality of the random numbers produced. Using threefry with R>20 would have been rare in any case. The fix simply makes the implementation correspond correctly to the description in the paper.
  • Add examples/uniform.hpp, examples/boxmuller.hpp and examples/ua.hpp, containing example code to generate uniformly and gaussian distributed floats and doubles. Also add unit tests (ut_uniform.cpp, ut_uniform_IEEEkat.cpp) and timing harnesses (time_boxmuller.cpp and time_boxmuller_cuda.cpp).
  • overhaul u01_closed_closed and u01_open_open so they produce equally spaced/probable values except for endpoints.
  • Demote u01.h from the include/Random123/ to examples/ufixed01.h, but examples/uniform.hpp is preferred.
  • Add kat_vectors for threefry2xW_32 and threefry4xW_72, i.e., the largest number of supported rounds in each case.
  • Mention ukey on the first page of docs.
  • Fix typos in documentation and comments and in gccfeatures.h.
  • Don't #include <x86intrin.h> unless (defined(x86_64)||defined(i386)).
  • Use clang rather than llvm in the predicate that decides whether to include clangfeatures.h.
  • Add support for Portland Group compilers.
  • Add support for gcc and IBM XL compilers on powerpc64, e.g. BlueGene/Q.
  • Port to PPC. (BG/L)
  • Refuse to build with CUDA before 4.1 to avoid namespace bug.
  • Update/fix core count hackery NVIDIA Tesla K20c

1.07 - Nov 7, 2012
  • Provide const static data members: _Min and _Max in Engine and MicroURNG, which work around a non-standard requirement imposed by the MacOS Xcode 4.5.2 <random> library.
  • Fine-grained test macros for specific features of C++11 rather than an all-or-none USE_CXX0X macro. Features tested: constexpr, unrestricted unions, explicit conversions, <random> and <type_traits>.
  • Declare max() and min() methods with the R123_CONSTEXPR attribute in Engine and MicroURNG.
  • Improved clang support
  • Works with Solaris Sun CC now (requires -library=stlport4)
  • NVIDIA GTX6x0 should report correct core count
  • ut_features prints the list of features that compile on the build platform
  • Compiles cleanly with -Wall -Wextra (implying -Wunused-parameter -Wsign-compare)

1.06 - Apr 5, 2012
  • Added a known answer test for the u01 functions (kat_u01).
  • Defend headers against possible max and min macros in "system" header files.

1.05 - Mar 20, 2012
  • MicroURNG and GSL_MICROURNG no longer give the programmer control over the number bits in the internal counter via a template or macro argument. Such flexibility is too easy to misuse, with the possible consequence of repeating random values. Now, all MicroURNGs have a period in excess of 2^32, and MicroURNG<CBRNG>(c1, k1) and MicroURNG<CBRNG>(c2, k2) "collide" if and only if c1==c2 and k1==k2.
  • Added <Random123/u01.h> header file with static functions for conversion of integers to uniformly distributed floating point values.
  • Make operator<<(ostream&, const r123array&) and operator>>(ostream&, r123array&) inline. This should work around ODR violations when the header files are included in more than one file in a project.
  • Recognize that icpc has an _mm_set_epi64x intrinsic as of version 12.1.
  • The gsl adapters always return 32-bits of random data, regardless of the width of counter type.
  • Works around lack of anonymous enums in Intel OpenCL 1.5
  • gencl.sh works on MacOS and systems with "classic" BSD indent.
  • Tests run on AMD Radeon 7970 (Tahiti).

1.04 - Dec 5, 2011
  • new kat_vectors - there are now three tests for each tested generator: gen(ctr=0, key=0), gen(ctr=fff, key=0xfff), and gen(ctr=digits-of-pi, key=more-digits-of-pi). There are fewer tests overall, but they provide better coverage, especially of non-zero keys.

1.03 - Nov 30, 2011
  • overhaul known answer tests (kat)
    • common source for serial C, C++, cuda and opencl replaces katc and katpp
    • add missing kat_vectors for threefry2x32
  • make keyinit functions device/kernel functions in CUDA/OpenCL
  • replace r123array::assemble methods with r123array::seed(SeedSeq) template
  • cleanup of signatures of Engine and MicroURNG methods:
    • Engine(ukey_type&) and MicroURNG(ctr_type, ukey_type) constructors.
    • Engine(SeedSeq&) takes a reference argument, and, when C++0x <type_traits> are available the SeedSeq templates don't participate in "surprising" overloads.

1.02 - Nov 21, 2011
  • BUG FIXED in threefry2xW_R(R<20, ...). Earlier implementations of threefry2x32_R and threfry2x64_R produced incorrect results when called with the number of rounds less than 20. The bug caused extra work to be done in every call, possibly resulting in more mixing than the correct implementation delivers. Both old (buggy, more mixing) and new (bug-fixed, less mixing) implementations have been rechecked and are fully "Crush-Resistant".
  • worked around problems with argument marshalling in version 2.4 of the AMDAPPSDK OpenCL implementation

1.01 - Nov 11, 2011
  • allow the user to define __STDC_CONSTANT_MACROS
  • allow r123arrayNxW::incr(larger_than_largest_value_type)

1.00 - Sep 26, 2011
Initial release