Professional Documents
Culture Documents
Whatsnew
Whatsnew
exe
-------------------------------------------
1) The "days between sending new end dates" preference now accepts values between
0.125 and
7.0 (was between 1 and 7). This lets you contact the server as frequently as
every three
hours. This can be useful in conjunction with the server feature that sends an
email to
you if the computer is more than one hour late contacting the server. To turn
on that
server feature, go to the CPUs web page, and click on the CPU, there you can
turn on a
couple of email features.
2) AVX2 support for trial factoring. About a 50% speedup over the previous
version. However,
all trial factoring should still be done on a GPU. A GPU is on the order of
100 times more
efficient at trial factoring than a CPU!!!
1) Changed the output to the worker windows during LL and PRP tests. The new
output includes
the estimated time to complete the test. There are two new options described
in undoc.txt:
ClassicOutput and OutputRoundoff.
3) Benchmarking on hyperthreaded machines now times only the most common cases.
Specifically,
hyperthreading is used only in the one cpu and all cpu cases.
4) Benchmarking trial factoring is now off by default. Prime95 should not be used
for trial
factoring. GPUs are about 100 times more efficient at that task.
6) There are many new options described in undoc.txt to customize the benchmarking
process.
1) Reduced memory usage. This may make some single-thread benchmarks slower, but
when
running several workers on machines where memory is a bottleneck there should
be a
small performance increase.
1) Supports Intel's new for fused multiply add instruction introduced with the
Haswell CPU.
This results in faster FFTs. Note that performance on many Haswell systems is
memory-bandwidth
limited. This means that when running workers on all cores performance gains
will be small.
2) Some minor optimizations may give a very small performance boost for AVX CPUs.
3) All new test torture test data for AVX CPUs. The new data runs more
interations, thus more time
is spent torturing the CPU rather than initializing the FFT routines.
4) Information added to result lines containing "has a factor". This information
may be used
by the server's manual web page to give proper TF / P-1 / ECM cpu credit at a
future date.
1) When an error occurs reading a save file it is renamed with a .bad extension.
On rare occasions the file can be read successfully at a later time.
1) Multi-threaded tests might be a little bit faster especially when using a lot
of threads.
Of course, single-threaded usage still gives the best throughput.
2) FFT crossover points were adjusted. Many higher, a few lower.
1) 32-bit FFTs optimized for AVX-capable computers. Intel Sandy Bridge computers
should
see a 25% speed increase.
New features in Version 26.6 of prime95.exe
-------------------------------------------
1) For rare cases where the program cannot figure out the number of cores and
hyperthreading,
the NumPhysicalCores option may help. See undoc.txt.
2) Faster FFT implementations are now selected for Core 2 CPUs with 1MB L2 cache
or less
(marketed under the Celeron and Pentium label).
3) New, slightly higher, trial factoring breakeven points.
1) A few crash bugs were fixed that affected only some CPU architectures and some
FFT lengths. Other minor bugs were fixed.
1) A bug that caused pfactor save file names to begin with the letter p
was fixed. It now uses m like all other P-1 efforts. Old save file
names are automatically upgraded.
2) Prime95 now recovers gracefully from more out-of-memory conditions
when doing ECM or P-1.
3) We now do P-1 factoring one bit level before the trial factoring limit.
The previous version started P-1 two bit levels before the trial
factoring limit.
1) A bug that caused the torture test to hang on 256K FFTs on SSE2 machines
with 128K of L2 cache was fixed.
1) Celeron D (256K L2 cache) and Willamette (also 256K L2 cache) now have
different implementations for several FFT sizes. This results in an
improvement of several percent for the Celeron D.
2) A bug that caused some machines to generate "Error 2252" when communicating
with the server was fixed.
3) SSE2 trial factoring code had a bug when factoring very large exponents.
1) For SSE2 machines the larger FFTs have been changed to more effectively
use a wide variety of L2 cache sizes. The previous version was optimized
for a 256KB L2 cache only. Depending on your CPU and FFT size, you could
see an improvement of several percent.
2) As a side "benefit" even larger FFT sizes are now supported. This allows
testing of exponents up to 596 million. Not recommended.
3) The factoring breakeven points have been recalculated using my 2 GHz P4.
This version of prime95 will do less trial factoring.
4) Since server database crashes cause spurious error 3 messages, prime95
will now ignore error 3 messages from the server for 72 hours. This should
workaround the problem whereby a result is reported but no CPU credit is
given and the reservation is not cleared.
5) Fixed crash bug trial factoring exponents above 286 million.
1) Faster FFTs for AMD64 using prefetchw in both 32-bit and 64-bit mode.
You can expect about a 15% speed improvement.
2) Prime95 now detects support for 3DNow! instructions. See undoc.txt
for overriding this detection with CpuSupports3DNow=n in local.ini.
3) Factoring entries in worktodo.ini now accept exponents up to 2 billion.
4) Improved checking for memory allocation errors during a torture test.
Better guessing of amount of memory to use in a blend test.
5) Added timeouts to PrimeNet communications in hopes of avoiding rare hangs
when contacting the PrimeNet server.
6) Fixed rare bug where P-1's GCD could miss a factor.
7) Added trial factoring to the benchmark.
8) Fixed bug in ECM when using zero-padded FFTs.
9) SSE2 macros optimized for an additional 1-3% improvement on P4 and AMD64
CPUs.
1) The blend torture test now uses less memory by default to prevent
thrashing. The in-place torture test is now the default more cases
to reduce complaints dealing with memory allocation issues (no pagefile,
thrashing, etc.)
2) A bug in continuing after finding a factor when using AdvancedFactor was
fixed.
1) Four changes have been made after GIMPS' first false positive report
in 7 1/2 years of operation.
a) The program now returns the number of errors that occurred when the
result is prime. A non-zero value will make us more suspect of the
reported prime.
b) The save files will not be deleted. The user can then email these to
me and we can rerun the last 30 minutes of the LL test. It is hard
to imagine a second false positive report in this mini LL run.
c) The shift counter is now checked every iteration. If this variable
AND the FFT data was corrupted, then and endless loop of LL iterations
generating zero is possible - resulting in a false prime report.
d) Every iteration the FFT data is checked to see if the data has been
zeroed.
e) The is-this-a-prime check now makes sure the FFT data is not NaN.
NaN stands for not a number and means the data is corrupt. The
previous version checked for zero and my C compiler returns TRUE for
the test NaN == 0.0!
2) I restored the old behavior sending relative URLs. Some users had trouble
with this feature introduced in 23.4. So, UseFullURL=0 is now the default.
3) Some prefetching improvements were made for the Athlon, Pentium 3, and
Celeron 2 processors. You can expect speed improvements between 3%
and 10% for most FFT sizes. Warning: the new code is slower for Durons
and Celeron's with small L2 caches for FFT sizes 1024K and above.
1) Further performance improvements in the SSE2 code. You can expect about
a 5% improvement in LL tests on a P4.
1) Big SSE2 FFTs now take the L2 cache size into account. P4 Celeron (128KB
L2 cache) is faster for FFTs between 512K and 2M. P4 Northwood (512KB
L2 cache) is faster for FFTs larger than 1M.
2) Benchmark no longer times 256K and 320K FFTs, but does time 2048K FFT.
3) Support for torture testing FFT sizes from 1280K to 4096K added.
4) A 900 MHz P-III is now required to get first time LL tests by default.
5) Slightly faster SSE2 FFTs for lengths of 5*2^N and 7*2^N (e.g. 640K, 896K).
1) A P-1 and ECM QA suite was implemented. A bug in ECM for exponents
below 172,700 and near the limit of an FFT size and using SSE2 code
was fixed.
1) A bug was fixed that caused some factors to be missed in stage 2 of P-1
when the available memory did not let the program allocate 12 temporary
variables. If testing a number in the 16 millions using an FFT size of
768K, then each temporary takes 768K * 8 bytes or 6MB. If your memory
setting was less than 72MB (6MB * 12 temporaries) then you were affected
by the bug. Actual the program allocates some fixed tables so anything
less than about 75MB triggered the bug.
1) A bug that caused factors to be missed in the last stage of P-1 and ECM
factoring was fixed. The bug was introduced in executables built after
Sept. 28, 2002.
1) Error rate for a clean run is now estimated at 1.8% raising your chances
of finding a Mersenne prime while double-checking.
2) You can now stop and continue testing from the system tray menu.
3) You can now pause prime95 when another program starts running. See the
PauseWhileRunning option in undoc.txt.
4) Fixed bug introduced in 22.8 where No Icon did not work if Start at Bootup
was also specified in Windows 98.
5) A bug in unscrambling the proxy server password in primenet.ini was fixed.
1) Soft FFT crossovers have been implemented. If you test an exponent that
is within 0.2% of the old hard FFT crossover point, then 1000 test
iterations are run to determine if the smaller or larger FFT is
appropriate for the exponent. Although not recommended, you can adjust
prime95's behavior to be more aggressive using the SoftCrossover and
SoftCrossoverAdjust features discussed in undoc.txt.
2) To better stress main memory, the torture test will now use up to the
amount of memory specified in the Options/CPU dialog box.
3) Iterations with roundoff checking are a little faster for non-SSE2 CPUs.
1) Given more data, the roundoff error checking is now done on every
iteration for exponents within 0.5% of the maximum that can be tested
by the current FFT length. If a roundoff error above 0.4 occurs,
then the iteration is now repeated without any change to the shift count.
It now takes a roundoff error greater than 0.6 to corrupt the results.
2) Many of the FFT ranges have changed. Version 21 was too aggressive
in choosing FFT sizes for the P4. The new handling of roundoff
errors above 0.4 lets us be more aggressive with the non-P4 code.
3) Result lines are now WYn rather than WXn.
4) A crash bug affecting P-1 and ECM using the 2^N+1 option for large N was
fixed.
5) A rare memory corruption and possible crash bug in the GCD code was
fixed.
6) The -t command line argument will run the torture test.
7) To reduce wild fluctuations in the RollingAverage, it will be updated
roughly twice per day.
1) Some bugs in error handling when communicating with the server have
been fixed.
2) Communicating with the server by RPC is no longer supported. The
HTTPNET.DLL and RPCNET.DLL have been deleted. Only "Basic" Proxy server
authentication is supported. Version 21 may have supported MS Proxy
Server 2.0's NTLM (NT Lan Manager) challenge/response authentication.
3) The program now uses a high resolution timer rather than the RDTSC
instruction to time events. This should help the program display accurate
timings on laptops with SpeedStep or desktops that can hibernate. You can
force the program to use the old RDTSC timing method with the RdtscTiming
option described in undoc.txt.
4) The program uses a updated algorithm to automatically detect CPU type and
speed. The Options/CPU dialog box no longer let's you set this
information. Instead, the Options/CPU dialog displays the detected
CPU type and speed. This new feature helps prevent incorrect settings
when users upgrade or try several overclocking speeds. If the new
algorithm fails, you can overide the settings as described in undoc.txt.
5) Benchmark now writes the program version, timing methodology, cpu type
and speed, L1 and L2 cache information to results.txt. It will refuse to
benchmark if rdtsctiming is 10.
1) Exiting the Torture Test now prints out how long it ran.
2) P4 error checking was relaxed slightly to reduce false alarms.
1) The program will now skip the P-1 factoring stage if another user has
already performed this step.
2) The Advanced/Quit GIMPS menu choice now lets you quit after current
work completes or quit immediately.
3) A bug was fixed in the error recovery code. After getting a "Disregard
last error" message, the user was treated to a new error on every
iteration. The end result was incorrect. The bug only affected the
error recovery of the new P4 FFT introduced in the beta version 21.2.
1) The program now uses the SSE2 instructions introduced on the Pentium 4 CPU.
This version is about 3 times faster than the previous version on a P4.
2) The program now uses the prefetch instructions on the Celeron 2, Pentium 3,
and all Athlon CPUs. This results in about a 20% performance boost on
these machines.
3) Process priority is now set to idle. Microsoft documentation says that
an idle thread priority overrides process priority. The program's
priority scheme has always worked well. However, several Win2K users
have reported that the program works better if the process priority is
also set to idle.
4) The program now delays calculations until 90 seconds after bootup. This
lets your machine boot up as fast as possible. This can be changed, see
undoc.txt.
5) The default crossover between double-check assignments and first-time
tests has been increased to a 400 MHz PII.
6) After 5 1/2 years, a help file now exists! It is HTML Help which may
have problems on older Windows machines. If so, try downloading
hhupd.exe from Microsoft's web site.
7) The program used to do factoring and P-1 testing on new exponents before
completing LL tests on older exponents. This was confusing to many.
The program now processes the worktodo.ini file in sequential order.
See undoc.txt on how to restore the old behavior.
8) Error messages such as ILLEGAL SUMOUT, SUMINP != SUMOUT, etc. are no
longer sent to the server since the final result contains a count of how
many errors occured during the LL test.
9) Some of the FFT crossover points have changed slightly.
10) Test/Status now outputs the day of the week each work item will complete.
11) Interim output lines have changed - hopefully, so they are more useful
especially to newbies. By default, output lines no longer contain the
clocks count. Lines now contain a timestamp. Benchmark timings are now
output in milliseconds. These defaults can be changed using options
in undoc.txt.
12) A new welcome screen for new users will encourage stress testers
to use the program without reserving exponents.
13) There is now a menu choice that lets you unreserve a specific exponent.
This is for knowledgable users only. You might do this if the server
assigned a small exponent and you'd rather be testing larger ones. Or
the prime95 you set up on another machine had a hard drive failure.
14) The Windows 9x Service menu choice has been replaced by the more general
Start at Bootup menu choice. This choice now sets registry entries to
autostart prime95 on any Windows machine. WARNING: Using this option
will delete any StartUp menu shortcuts so the registry entry and startup
menu do not both try to start prime95.
15) Prime95 will now ask for confirmation if you enter a CPU speed that
differs from the computed CPU speed by more than 4%.
16) The self-test menu choice was deleted - use the torture test instead.
17) A benchmark menu choice has been added.
18) The torture test code now includes more FFT lengths including smaller
ones that run completely in the L2 cache which may increase CPU
temperatures. Each FFT size is tested for 15 minutes by default.
There are now several options in the undoc.txt for fine tuning the
torture test's behavior.
1) A crash bug was fixed. If you did P-1 factoring followed by trial
factoring, then if you found a factor unallocated memory was accessed
usually resulting in a crash.
1) A rare error was fixed. When the memory settings changed during the
GCD step in P-1 or ECM factoring, then a spurious "ERROR: Factor
does not divide N!" error was raised.
1) Another fairly uncommon ECM bug was fixed. The bug caused "Factor does
not divide N!" errors.
2) A couple of minor bugs in computing the optimal P-1 bounds to use prior
to a Lucas-Lehmer test were fixed. The program now does a better job at
estimating the memory required in P-1 stage 2. Finally, although P-1
stage 2 working set size is unchanged, the program allocates less memory
in stage 2.
3) Prime95 no longer searches for a smaller factor when trial factoring
discovers a factor. The reasons are two-fold. 1) Version 19 had a
bug where stopping and restarting the program bypassed the search for
smaller factors. Thus, my database may already be missing smaller
factors. 2) As we factor larger exponents to a deeper depth it may
no longer be a quick job to determine if there are smaller factors.
Note, that version 20 will still look for smaller factors if you are
looking for factors below 2^60 with the FactorOverride option in undoc.txt.
4) The undocumented AMPM feature controls how times are formatted in the
Options/CPU dialog box.
1) If P-1 stage 1 completed and there was not enough memory to start
stage 2 immediately, then an incorrect save file was generated. This
bug was introduced in version 20.1. Upon restart of the P-1 factoring
job a crash or other unpredictable behavior was possible. This bug was
fixed and this version has special code to properly read these
incorrect save files.
2) P-1 will restart any time the memory settings change. This is done
so that the optimal P-1 bounds can be computed with the new memory
settings.
3) A bug in ECM testing was fixed.
1) Faster - in some cases as much as 10% faster!. The FFTs were recoded
for improved memory and TLB efficiency. Furthermore, optimizations
specific to the Pentium Pro and later processors were added.
2) New FFT lengths. The program can now test exponents as large as
79.3 million. Also, smaller FFT lengths are supported for use in
ECM and P-1 factoring.
3) More conservative FFT breakpoints. This could actually result in
some exponents being slower to test in this version. However, the
chance of a fatal rounding error has been reduced.
4) P-1 factoring has been added. Although it is not very practical for
large exponents because of a slow GCD routine, it can be used to
find new factors of exponents below a few million or so.
5) ECM can now run on large exponents. Once again, the slow GCD routine
and high memory requirements might make this impractical for large
exponents.
6) ECM and P-1 factoring now support save files. Very handy on lengthy runs.
The count of completed ECM curves is maintained in the save file instead
of the worktodo.ini file.
7) ECM and P-1 factoring lets you specify the amount of memory to use.
In some cases, more memory can improve execution speed slightly.
8) A bug in guessing the CPU speed on initial install has been fixed.
9) The preferences dialog now has an option to pause prime95 when a laptop
is running on its battery.
10) Error checking has been improved slightly.
11) Factoring is now "layered". That is, prime95 now factors to 2^50, then
2^51, 2^52, and so forth up to the appropriate limit. The factoring
output lines have been changed to show percent complete in the current
"layer".
12) A bug in running two or more self or torture tests in the same directory
has been fixed.
13) Trial factoring above 2^64 is now supported.
14) More trial factoring is now done to take into account the cost of
double-checking.
15) Title now contains percent complete when LL testing. By default, the
percent complete value is now displayed to 2 decimal places. You can
change this by setting PercentPrecision in prime.ini to a value
between 0 and 6.
16) Affinity and service name settings moved from prime.ini to local.ini file.
Prime95 will automatically move these settings for you.
17) An option to get only 10,000,000 digit numbers to run primality tests
on has been added to the Test/Primenet dialog box. See
http://www.mersenne.org/prize.htm for rules on claiming the EFF award
for finding a 10,000,000 digit prime.
18) The Advanced/Clear primes menu choice has been deleted.
19) The prime95 icon turns yellow when the program is idle. After an
error such as ILLEGAL SUMOUT, the icon will blink for 10 seconds.
20) The User Information dialog box allows you to request newsletters and
form a team user ID where the team members cannot alter the team name.
21) A bug in the reporting of error counts in the results.txt file has
been fixed.
22) The server can now broadcast important messages to the prime95 client.
Prime95 will blink the icon until prime95 is activated and then it will
display the message.
23) A computer ID will be generated for computers that do not have one. If
you want to assign your own computer ID, please try to make each
computer ID using the same user ID unique.
1) Added code so that server can distinguish between a v17 and v18 client.
2) Only v17 save files above 4194304 are deleted.
1) Quit GIMPS choice moved to the Advanced Menu (so novice users do
not confuse it with exit).
2) A bug in httpnet.dll was fixed. It now works for even more
proxy servers and firewalls.
1) More errors in detecting whether you are connected to the Internet have
been fixed.
2) Minor bugs fixed.
1) Command line arguments are now available to help you run the program
on your co-worker's machines!
1) Factoring speed has been doubled for Pentiums. 486 machines will
notice a 15% improvement when factoring.
2) The program will now automatically determine your CPU type and speed.
This information is used in choosing the optimal factoring algorithm.