Using the debugger¶

Pycharm has a build in debugger, however, there are some tricky problems described below.

Breaking on Exception¶

Breaking on exception is one of the most important debugger tools. However, there are some problems with Pycharm exception debugging.

Breake on Termination vs on Raise¶

By default, the program breakes on termination (unhandled exception). This is usually a correct configuration. However, in jupyter, all exceptions are caught to not break the jupyter itself. Therefore, in jupyter, all exceptions are ignored by the debugger if the breakpoins are set to break on termination.

To break on exceptions in jupyter, we have to breake on raise. By this setting, however, we stop even on expected/handeled exceptions, stoping potentially on hundereds breakpoints in library code.

Another issue is with the setting itself. To propagate the change between breaking on raise/termination, we have to deactivate and then activate again the exception breakpoints, otherwise, the setting is ignored.

Profiling¶

Profiling in Python is surprisingly difficult compared to other languages. There is no all-in-one profiler for Python, and most of the profilers fail to analyze complex code and thus are unreliable. The table below summarizes the situation:

Profiler	Reliable	Granularity	Call Graph	Mode	Code Modif. Required	Scrip must finish	Notes
cProfile	Yes	function-level	No	Det.	No	Yes	Included in the standard library. Mostly useful as a backend for other profilers
PyCharm	Partially	function-level	incomplete and confusing	Det.	No	No	Built-in profiler in PyCharm. By default, it uses cProfile as a backend.
line_profiler	Yes	line-level	No	Det.	Yes	Yes
pyinstrument	Yes	function-level	Yes	Sampling	No	Yes	Both command line and HTML GUI
py-spy	?	function-level	No	Sampling	No	Yes	Broken on Windows
Scalene	No	line-level	No	Sampling	No	Yes	Absolutely random results

Typically, the best profiling approach is:

find the functions that are taking the most time using pyinstrument
If we need to find specific lines of a function, use line_profiler for just that specific function identified as a bottleneck using pyinstrument

pyinstrument¶

The pyinstrument profiler is the most reliable profiler that provides a call tree. Usage

Run the profiler: pyinstrument <script> <script arguments>.
- the results are displayed immediately in the terminal
Explore the results: pyinstrument --load-prev <previous profile name>. Parameters:
- --show-all: show all functions, including library functions
- -r: render variant. Possible values:
  - html: HTML output

Line Profiler¶

GitHub

The line_profiler package provides line-by-line profiling of the code. Unfortunately, the corresponding PyCharm plugin is currently broken and does not work with latest PyCharm version (2025.2), so we have to use the command line interface.

The usage is:

decorate functions of interest with @profile decorator
run the profiler: kernprof -l -v <script> <script arguments>

PyCharm Profiler¶

official documentation

Pycharm has a built-in profiler with a very simple use:

Select three dots next to the debug button and click on Profile <script name>
Wait some time to warm up and get to the important parts of the code
in the left panel, select create snapshot
Explore the results

Each snapshot contains the following tabs:

Flame Graph: Frequently broken - absolutely incorrect visualization.
Call Tree: Again frequently broken
Method list: Can be broken as well
Statistics: This view comes directly from the cProfile backend and is the only reliable view in PyCharm.
Call Graph: Typically both incorerect and totally visually broken.

The built-in profiler provides only aggregated results for each function. To get line-by-line results we need a line_profiler package.

cProfile¶

Official documentation

Usage: python -m cProfile <script> <script arguments>

This way, everything is outputed to the terminal, which is not very convenient, as the list of functions may be too long. One option is to redirect the output to a file and then read the top. But there is also a GUI option using snakeviz package:

pip install snakeviz
python -m cProfile -o profile.prof <script> <script arguments>
snakeviz profile.prof

Individual outputs columns:

ncalls: number of calls
tottime: total time spent in the function. The most important column to look at.
percall: time spent in the function per call
cumtime: total time spent in the function including the time spent in the called functions. This is the column by which the list of functions is sorted.
percall: time spent in the function per call including the time spent in the called functions.
filename:lineno(function): filename, line number, and function name

Scalene¶

GitHub

Scalane is a profiler that allegedly provides CPU, memory, and GPU profiling, all with minimal performance impact. The reality is however, that the profiler results are quite random, not at all showing all the function calls, totally failing to identify the bottlenecks.

Usage: scalene run <script> --- <script arguments>. Important parameters:

--cpu-only: only CPU profiling
--profile-all: profile all files. By default, only the main script is profiled (even more useless)

To display the results, use scalene view.

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search