Oberon/ETH Oberon/stability
This document was originally hosted at the ETHZ. It remains under the ETH license and is in the WayBack archive.
Stability is a prominent highlight of Oberon. There exist however a number of ways to produce an "unrecoverable" system freeze and they are discussed below.
- Using the unsafe SYSTEM features or unsafe interfaces of low-level modules (e.g. Kernel) incorrectly.
- When a stack overflow occurs. Native Oberon and Bluebottle solve this problem by leaving the bottom page of the stack area in virtual memory unmapped. This causes a page fault when the stack overflows. This is reliable, even for large local variables, since stack initialization is done from top to bottom.
A stack overflow may happen when passing very large arrays as value parameters in procedure headings. Increasing the stack size (discussed at the end) is a way out, but the best solution is to use VAR formal parameters or pointers.
As a concrete example, try the following command on your Oberon system to see how it handles stack overflow induced by a recursive Trap call (warning: it may crash):
Here are the results of test conducted on the four ETH Oberon systems (latest versions):
- Native Oberon:
- Oberon for Bluebottle:
- ETH PlugIn Oberon for Windows (14.5.2001) running on Windows 2000 Version 5.0.2195 Service Pack 2:
- Oberon for Linux x86 crashes (terminates the Oberon process). This bug was analyzed by Günter Feldmann. He found no way to handle a segmentation violation caused by stack overflow in Linux. In other Unix versions known to him, stack overflow handling is not a problem as they support an alternate stack for signal (trap) handling. In the Solaris ports of Oberon, Temp.Trap yields two segmentation violation traps. The second (recursive) trap view contains a notice which informs the user that the first trap was probably caused by stack overflow. When he looked for the alternate signal stack in Linux last time (Kernel 2.2, x, glibc 2.2.1), he finally found the procedure "signalstack". But calling this procedure crashed the program. He hopes that in the near future signal stack will work in Linux too.
Stack size - Default size and how to adjust it
- Native Oberon: use the configuration string "StackSize" to specify how many bytes are allocated for the stack. The default value is 131072, i.e. 128K.
- PlugIn Oberon: The default size is 1MB. Two possibilities to adjust it are offered (effectively tested):
- allocate more space to the Thread executing the critical code: in this case the Oberon loop. In the procedure Oberon.Loop, specify the stack size (in bytes) in the Threads.Start call. The disadvantage of this approach is that the stack space is immediately allocated, even if it is not (immediately) used.
- allocate more space to all the Threads of the Oberon process. In the file Win32.Oberon.Link, search for the line HEAPSIZE and add a line STACKSIZE with an appropriate size right after it.
In both cases oberon.exe must be linked again using PELinker.Link Win32.Oberon.Link ~ (Oberon.exe is the implied destination). For a stacksize <= 1 MB adjust the size to a multiple of 4096 (Page size) and beyond 1 MB to a multiple of 1048576 (1 MB).
Notes for implementers:
- PELinker is part of the Developer package which must be installed (cfr. Packages.Tool).
- As long as the key of the module Oberon is not changed, it is enough to link the new Oberon.exe. Do not forget to copy the latter to the right place.
- Save the old Oberon.exe in a subdirectory, such as /Obj, as backup in case the new version fails.
- Bluebottle (not tested): In AosMemory.Mod, adjust the value MaxUserStackSize, compile the module and relink it. The default value is 128K but in reality the size is 124 KB because some space is reserved.
- Unloading a module whereby some procedure variable reference to it still exists. The visual gadget case is a common manifestation of that, as can be demonstrated with this sequence of operations:
Set the caret in a viewer
Execute Gadgets.Insert BasicGadgets.NewButton ~
Execute System.Free BasicGadgets ~ Explanation: The Button's Handler was "stolen". When freeing a Module, Oberon checks if there are still other Modules depending on that module but it cannot currently check if a procedure variable reference to it still exists (e.g. to the Handler of the visual gadget). Workaround: Do not System.Free a Module when its Objects are still displayed, since the Handler is called any time a Display.Broadcast (or update message) is sent (which is the case when new Viewers like a TRAP viewer are opened). Solution: The aim is to make the system to trap gracefully and not crash. One way to solve the problem is to have a termination handler for each module. This might be true in most cases, e.g. if you have installed a task, or some other upcall that can be uninstalled again. In the case of visual gadgets, this is difficult, because the gadgets are not linked anywhere except in the display space (by design). Even broadcasting a "remove yourself" message to the gadgets of the module will not always work, since the gadget may be in a covered track. This situation is handled correctly in Native Oberon. The pragmatic solution is:- Where possible, the module termination handler (installed with Modules.InstallTermHandler) cleans up so that this problem does not occur.
- When a frame handler traps, the frame is closed. This avoids recursive traps and stack overflows, which thus avoids system crashes. The code for this is a bit of a hack, but it seems very effective. It is implemented in Viewers.Broadcast, Viewers.Close and System.Trap. In some cases it produces false positives, e.g., sometimes when a command linked to a gadget traps, the (innocent) frame containing the gadget is closed. System.Recall can be used to recall it.
- When a "Volume full" situation is encountered - cfr. Partition size considerations.
- When Oberon.Text is corrupted, Oberon start-up can end abruptly. This situation can be caused by a typing error during the edition of Oberon.Text. To recover from this situation, the partition hosting the corrupted Oberon.Text must be edited by mounting the file system from another Oberon system. The other system could reside on the same machine, on an Oberon-0 boot diskette or on CD-ROM. Pay particular attention to unmatched "{" and "}" parentheses used for nesting sections in the text. Bluebottle also has an AosConfig.XML file which directs the start-up process and it is essential that it is syntactically well formed. A slight mistake could cause the start-up to derail. To enhance the system stability, the system is prepared to switch automatically to an alternate file Save.AosConfig.XML which is supposedly intact, thus giving the user a possibility to mend an error in AosConfig.XML. If the alternate file is corrupted, the same solution as described for Native Oberon must be applied. Both files are identical in the distribution.
11 Jun 2003 - Copyright © 2003 ETH Zürich. All rights reserved.
E-Mail: oberon at lists.inf.ethz.ch
Homepage: http://www.ethoberon.ethz.ch/