Java performance: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 12:40, 19 January 2023 edit Blisswelness (talk \| contribs) 29 edits mNo edit summary Tags: Reverted Visual edit Newcomer task Newcomer task: update ← Previous edit		Latest revision as of 05:07, 14 August 2025 edit undo Bender the Bot (talk \| contribs) Bots 1,064,377 edits m HTTP to HTTPS for Blogspot Tag: AWB
(27 intermediate revisions by 13 users not shown)
Line 1: {{Short description\|Aspect of Java programming language}} {{Update\|reason=Is missing the many improvements in Java 8, 11, 17, 21, ... \|date=November 2023}} Java was created at Sun Microsystems, Inc., where James Gosling led a team of researchers in an effort to create a new language that would allow consumer electronic devices to communicate with each other. Work on the language began in 1991, and before long the team's focus changed to a new niche, the World Wide Web.{{Update\|reason=Is missing the many improvements in Java 8, 11, 17...\|date=April 2022}} {{Dablink\|This article is a general presentation of the [[Java (software platform)\|Java platform]] performance. For criticisms about Java performance, and more generally about the [[Java (programming language)\|Java language]], see [[Criticism of Java]].}} {{Use mdy dates\|date=October 2018}} In [[software development]], the programming language [[Java (programming language)\|Java]] was historically considered slower than the fastest [[~~Programming~~Third-generation ~~language~~programming ~~generations~~language\|~~3rd~~ third-generation]] [[Type system\|typed]] languages such as [[C (programming language)\|C]] and [[C++]].<ref>{{Cite web\|url=http://www.scribblethink.org/Computer/javaCbenchmark.html\|title=Java versus C++ benchmarks}}</ref> ~~The~~In ~~main~~contrast ~~reason~~to ~~being~~those ~~a different language design~~languages, ~~where~~Java ~~after compiling, Java~~compiles ~~programs~~by ~~run~~default onto a [[Java ~~virtual~~Virtual ~~machine~~Machine]] (JVM) ~~rather~~with ~~than~~operations ~~directly~~distinct onfrom those of the actual computer's ~~[[central~~hardware. ~~processor~~Early ~~unit\|processor]]~~JVM asimplementations were [[~~native~~Interpreter ~~code~~(computing)\|interpreters]],; asthey dosimulated Cthe ~~and~~virtual ~~C++~~operations ~~programs.~~one-by-one ~~Performance~~rather ~~was~~than atranslating ~~matter~~them ofinto ~~concern~~[[machine ~~because~~code]] ~~much~~for ~~business software has been written in Java after the language quickly became popular in the late 1990s and~~direct ~~early~~hardware ~~2000s~~execution. Since the late 1990s, the execution speed of Java programs improved significantly via introduction of [[just-in-time compilation]] (JIT) (in 1997 for [[Java version history\|Java 1.1]]),<ref name="~~symantec.com~~Symantec">{{Cite web \| url=http://www.symantec.com/about/news/release/article.jsp?prid=19970407_03 \| archive-url=https://web.archive.org/web/20100628171748/http://www.symantec.com/about/news/release/article.jsp?prid=19970407_03 \| url-status=dead \| archive-date=June 28, 2010 \| title=Symantec's Just-In-Time Java Compiler To Be Integrated Into Sun JDK 1.1 }}</ref><ref name=cnet1998/><ref>{{cite web \| url=http://grnlight.net/index.php/programming-articles/116-java-gets-four-times-faster-with-new-symantec-just-in-time-compiler \| archive-url=https://archive.today/20140527181040/http://grnlight.net/index.php/programming-articles/116-java-gets-four-times-faster-with-new-symantec-just-in-time-compiler \| title=Java gets four times faster with new Symantec just-in-time compiler}}</ref> the addition of language features supporting better code analysis, and optimizations in the JVM (such as [[HotSpot (virtual machine)\|HotSpot]] becoming the default for [[Sun Microsystems\|Sun]]'s JVM in 2000). Hardware execution of Java bytecode, such as that offered by ARM's [[Jazelle]], was also explored to offer significant performance improvements.▼ \| url-status=usurped \| archive-date=May 27, 2014 ▲\| title=Java gets four times faster with new Symantec just-in-time compiler}}</ref> the addition of language features supporting better code analysis, and optimizations in the JVM (such as [[HotSpot (virtual machine)\|HotSpot]] becoming the default for [[Sun Microsystems\|Sun]]'s JVM in 2000). Sophisticated [[garbage collection (computer science)\|garbage collection]] strategies were also an area of improvement. Hardware execution of Java bytecode, such as that offered by ARM's [[Jazelle]], was ~~also~~ explored ~~to offer significant~~but ~~performance~~not ~~improvements~~deployed. The [[Computer performance\|performance]] of a [[Java bytecode]] compiled Java program depends on how optimally its given tasks are managed by the host [[Java virtual machine]] (JVM), and how well the JVM exploits the features of the [[computer hardware]] and [[operating system]] (OS) in doing so. Thus, any Java [[Software performance testing\|performance test]] or comparison has to always report the version, vendor, OS and hardware architecture of the used JVM. In a similar manner, the performance of the equivalent natively compiled program will depend on the quality of its generated machine code, so the test or comparison also has to report the name, version and vendor of the used compiler, and its activated [[compiler optimization]] directives. ==Virtual machine optimization methods== Line 20 ⟶ 25: ===Just-in-time compiling=== {{Further\|Just-in-time compilation\|HotSpot (virtual machine)}} Early JVMs always interpreted [[Java bytecode]]s. This had a large performance penalty of between a factor 10 and 20 for Java versus C in average applications.<ref>{{cite web \| url=http://www.shudo.net/jit/perf/ \| title=Performance Comparison of Java/.NET Runtimes (Oct 2004) }}</ref> To combat this, a just-in-time (JIT) compiler was introduced into Java 1.1. Due to the high cost of compiling, an added system called [[HotSpot (virtual machine)\|HotSpot]] was introduced in Java 1.2 and was made the default in Java 1.3. Using this framework, the [[Java virtual machine]] continually analyses program performance for ''hot spots'' which are executed frequently or repeatedly. These are then targeted for [[Optimization (computer science)\|optimizing]], leading to high performance execution with a minimum of [[Overhead (computing)\|overhead]] for less performance-critical code.<ref> {{Cite web \| url=https://weblogs.java.net/blog/kohsuke/archive/2008/03/deep_dive_into.html Line 38 ⟶ 43: \| publisher=[[Intel Corporation]] \| access-date=June 22, 2007}}</ref> Some benchmarks show a 10-fold speed gain by this means.<ref>This [http://www.shudo.net/jit/perf/ article] shows that the performance gain between interpreted mode and Hotspot amounts to more than a factor of 10.</ref> However, due to time constraints, the compiler cannot fully optimize the program, and thus the resulting program is slower than native code alternatives.<ref>[http://www.itu.dk/~sestoft/papers/numericperformance.pdf Numeric performance in C, C# and Java ]</ref><ref>[http://www.cherrystonesoftware.com/doc/AlgorithmicPerformance.pdf Algorithmic Performance Comparison Between C, C++, Java and C# Programming Languages] {{webarchive\|url=https://web.archive.org/web/20100331155325/http://www.cherrystonesoftware.com/doc/AlgorithmicPerformance.pdf \|date=March 31, 2010 }}</ref> ===Adaptive optimizing=== Line 49 ⟶ 54: \| publisher=[[Sun Microsystems]] \| access-date=April 20, 2008}}</ref><ref>{{Cite web \| url=~~http~~https://headius.blogspot.com/2008/01/langnet-2008-day-1-thoughts.html \| title=Lang.NET 2008: Day 1 Thoughts \| quote=''Deoptimization is very exciting when dealing with performance concerns, since it means you can make much more aggressive optimizations...knowing you'll be able to fall back on a tried and true safe path later on'' Line 83 ⟶ 88: ====Escape analysis and lock coarsening==== {{Further\|Lock (computer science)\|Escape analysis}} Java is able to manage [[Thread (computer science)\|multithreading]] at the language level. Multithreading ~~is a method allowing~~allows programs to perform multiple processes concurrently, thus ~~producing~~improving ~~faster~~the performance for programs running on [[computer system]]s with multiple processors or cores. Also, a multithreaded application can remain responsive to input, even while performing long running tasks. However, programs that use multithreading need to take extra care of [[Object (computer science)\|objects]] shared between threads, locking access to shared [[Method (computer science)\|methods]] or [[block (programming)\|blocks]] when they are used by one of the threads. Locking a block or an object is a time-consuming operation due to the nature of the underlying [[operating system]]-level operation involved (see [[concurrency control]] and [[Lock (computer science)#Granularity\|lock granularity]]). Line 89 ⟶ 94: As the Java library does not know which methods will be used by more than one thread, the standard library always locks [[block (programming)\|blocks]] when needed in a multithreaded environment. Before Java 6, the virtual machine always [[Lock (computer science)\|locked]] objects and blocks when asked to by the program, even if there was no risk of an object being modified by two different threads at once. For example, in this case, a local {{code\|~~vector~~Vector}} was locked before each of the ''add'' operations to ensure that it would not be modified by other threads (~~vector~~{{code\|Vector}} is synchronized), but because it is strictly local to the method this is needless: <syntaxhighlight lang="java"> public String getNames() { final Vector<String> v = new Vector<>(); v.add("Me"); v.add("You"); Line 117 ⟶ 122: Before [[Java version history\|Java 6]], [[Register allocation\|allocation of registers]] was very primitive in the ''client'' virtual machine (they did not live across [[Block (programming)\|blocks]]), which was a problem in [[CPU design]]s which had fewer [[processor register]]s available, as in [[x86]]s. If there are no more registers available for an operation, the compiler must [[register spilling\|copy from register to memory]] (or memory to register), which takes time (registers are significantly faster to access). However, the ''server'' virtual machine used a [[Graph coloring\|color-graph]] allocator and did not have this problem. An optimization of register allocation was introduced in Sun's JDK 6;<ref>[http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6320351 Bug report: new register allocator, fixed in Mustang (JDK 6) b59]</ref> it was then possible to use the same registers across blocks (when applicable), reducing accesses to the memory. This led to a reported performance gain of about 60% in some benchmarks.<ref>[http://weblogs.java.net/blog/2005/11/10/mustangs-hotspot-client-gets-58-faster Mustang's HotSpot Client gets 58% faster!] {{Webarchive\|url=https://web.archive.org/web/20120305215143/http://weblogs.java.net/blog/2005/11/10/mustangs-hotspot-client-gets-58-faster \|date=March 5, 2012 }} in Osvaldo Pinali Doederlein's Blog at java.net</ref> ====Class data sharing==== Line 126 ⟶ 131: ==History of performance improvements== {{Update\|section\|date=~~January~~April ~~2022~~2023\|reason=The most recently mentioned version in this section, Java 7, is over a decade old; as of writing, Java 1720 is the current version}} {{Further\|Java version history}} Apart from the improvements listed here, each release of Java introduced many performance improvements in the JVM and Java [[application programming interface]] (API). JDK 1.1.6: First [[just-in-time compilation]] ([[NortonLifeLock\|Symantec]]'s JIT-compiler)<ref name="~~symantec.com~~Symantec"/><ref name="symantec compiler">{{Cite web\|last1=Mckay\|first1=Niali\|url=http://linxdigital.ca/java-four-times-faster-symantec-compiler.html\|title=Java gets four times faster with new Symantec just-in-time compiler}}</ref> J2SE 1.2: Use of a [[Garbage collection (computer science)#Generational GC (aka Ephemeral GC)\|generational collector]]. Line 157 ⟶ 162: \| publisher=Sun Microsystems \|last=Haase\|first=Chet \| quote=''At the OS level, all of these megabytes have to be read from disk, which is a very slow operation. Actually, it's the seek time of the disk that's the killer; reading large files sequentially is relatively fast, but seeking the bits that we actually need is not. So even though we only need a small fraction of the data in these large files for any particular application, the fact that we're seeking all over within the files means that there is plenty of disk activity.'' \|date= May 2007\| access-date=July 27, 2007}}</ref> Parts of the platform needed to execute an application accessed from the web when JRE is not installed are now downloaded first. The full JRE is 12 MB, a typical Swing application only needs to download 4 MB to start. The remaining parts are then downloaded in the background.<ref>{{Cite web Line 223 ⟶ 228: ==Comparison to other languages== Objectively comparing the performance of a Java program and an equivalent one written in another language such as [[C++]] needs a carefully and thoughtfully constructed benchmark which compares programs completing identical tasks. The target [[Platform (computing)\|platform]] of Java's [[bytecode]] compiler is the [[Java platform]], and the bytecode is either interpreted or compiled into machine code by the JVM. Other compilers almost always target a specific hardware and software platform, producing machine code that will stay virtually unchanged during execution{{citation needed\|reason=What are the real world, non-theoretical implications of this?\|date=May 2016}}. Very different and hard-to-compare scenarios arise from these two different approaches: static vs. [[dynamic compilation]]s and [[Dynamic recompilation\|recompilations]], the availability of precise information about the runtime environment and others. Java is often [[Just-in-time compilation\|compiled just-in-time]] at runtime by the Java [[virtual machine]], but may also be [[Ahead-of-time compilation\|compiled ahead-of-time]], as is C++. When compiled just-in-time, the micro-benchmarks of [[The Computer Language Benchmarks Game]] indicate the following about its performance:<ref> Line 295 ⟶ 300: Results for [[Benchmark (computing)\|microbenchmarks]] between Java and C++ highly depend on which operations are compared. For example, when comparing with Java 5.0: 32- and 64 -bit arithmetic operations,<ref>{{Cite web \| url=http://www.ddj.com/java/184401976?pgno=2 \| title=Microbenchmarking C++, C#, and Java: 32-bit integer arithmetic Line 305 ⟶ 310: \| publisher=[[Dr. Dobb's Journal]] \| date=July 1, 2005 \| access-date=January 18, 2011}}</ref> [[Input/output\|~~File~~file Iinput/Ooutput]], <ref>{{Cite web \| url=http://www.ddj.com/java/184401976?pgno=15 \| title=Microbenchmarking C++, C#, and Java: File I/O \| publisher=[[Dr. Dobb's Journal]] \| date=July 1, 2005 \| access-date=January 18, 2011}}</ref> and [[~~Exception~~exception handling]],<ref>{{Cite web \| url=http://www.ddj.com/java/184401976?pgno=17 \| title=Microbenchmarking C++, C#, and Java: Exception Line 316 ⟶ 321: \| date=July 1, 2005 \| access-date=January 18, 2011}}</ref> have a similar performance to comparable C++ programs Operations on [[Array data type\|~~Array~~array]]s<ref>{{Cite web \| url=http://www.ddj.com/java/184401976?pgno=19 \| title=Microbenchmarking C++, C#, and Java: Array \| publisher=[[Dr. Dobb's Journal]] \| date=July 1, 2005 \| access-date=January 18, 2011}}</ref> ~~operations~~have better performance ~~are better~~ in C. The performance of [[~~Trigonometric~~trigonometric functions]] ~~performance~~ is much better in C.<ref>{{Cite web \| url=http://www.ddj.com/java/184401976?pgno=19 \| title=Microbenchmarking C++, C#, and Java: Trigonometric functions Line 334 ⟶ 339: ===Multi-core performance=== The scalability and performance of Java applications on multi-core systems is limited by the object allocation rate. This effect is sometimes called an "allocation wall".<ref>Yi Zhao, Jin Shi, Kai Zheng, Haichuan Wang, Haibo Lin and Ling Shao, [http://portal.acm.org/citation.cfm?id=1640116 Allocation wall: a limiting factor of Java applications on emerging multi-core platforms], Proceedings of the 24th ACM SIGPLAN conference on Object oriented programming systems languages and applications, 2009.</ref> However, in practice, modern garbage collector algorithms use multiple cores to perform garbage collection, which to some degree alleviates this problem. Some garbage collectors are reported to sustain allocation rates of over a gigabyte per second,<ref>[{{Cite web \|url=http://www.azulsystems.com/sites/default/files/images/c4_paper_acm_0.pdf \|title=C4: The Continuously Concurrent Compacting Collector] \|access-date=October 29, 2013 \|archive-date=August 9, 2014 \|archive-url=https://web.archive.org/web/20140809222603/http://www.azulsystems.com/sites/default/files/images/c4_paper_acm_0.pdf \|url-status=dead }}</ref> and there exist Java-based systems that have no problems scaling to several hundreds of CPU cores and heaps sized several hundreds of GB.<ref>[https://www.theregister.co.uk/2007/06/15/azul_releases_7200_systems/ Azul bullies Java with 768 core machine]</ref> Automatic memory management in Java allows for efficient use of lockless and immutable data structures that are extremely hard or sometimes impossible to implement without some kind of a garbage collection.{{citation needed\|date=September 2018}} Java offers a number of such high-level structures in its standard library in the java.util.concurrent package, while many languages historically used for high performance systems like C or C++ are still lacking them.{{citation needed\|date=September 2017}} Line 401 ⟶ 406: \| url=http://www.osnews.com/story/5602&page=3 \| title=Nine Language Performance Round-up: Benchmarking Math & File I/O \| last=W. Cowell-Shah \| first=Christopher W. \| date=January 8, 2004 \| access-date=June 8, 2008 Line 420 ⟶ 425: \| access-date=February 15, 2008}} </ref><ref>{{Cite web \|url = ~~http~~https://janet-project.sourceforge.net/papers/jnibench.pdf \|title = Efficient Cooperation between Java and Native Codes - JNI Performance Benchmark \|last = Kurzyniec Line 430 ⟶ 435: \|archive-date = February 14, 2005 \|df = dmy-all }}</ref>{{sfn\|Bloch\|2018\|loc=Chapter §11 Item 66: Use native methods judiciously\|p=285}} [[Java Native Access]] (JNA) provides [[Java (programming language)\|Java]] programs easy access to native [[Shared library\|shared libraries]] ([[dynamic-link library]] (DLLs) on Windows) via Java code only, with no JNI or native code. This functionality is comparable to Windows' Platform/Invoke and [[Python (programming language)\|Python's]] ctypes. Access is dynamic at runtime without code generation. But it has a cost, and JNA is usually slower than JNI.<ref>{{Cite web \|url = https://jna.dev.java.net/#performance \|title = How does JNA performance compare to custom JNI? Line 494 ⟶ 499: \|author1=Chris Nyberg \|author2=Mehul Shah \| access-date=November 30, 2010 }}</ref><ref name=googlemapreduce>{{cite web \| url=~~http~~https://googleblog.blogspot.com/2008/11/sorting-1pb-with-mapreduce.html \| title=Sorting 1PB with MapReduce \| date=November 21, 2008 Line 513 ⟶ 518: [[Java ConcurrentMap]] ==~~References~~Citations== {{Reflist\|2\|refs= <ref name=cnet1998> Line 520 ⟶ 525: \| date= May 12, 1998 \|access-date= November 15, 2015}}</ref> }} ==References== {{cite book \|last=Bloch\| first=Joshua\| title= "Effective Java: Programming Language Guide" \| publisher=Addison-Wesley \| edition=third \| isbn=978-0134685991\| date=2018}} ==External links==