Optimizing JavaScript and solving the halting problem: Part II

Aug 18 2017

Optimizing JavaScript and solving the halting problemPart II

time to read 3 min | 544 words

In the previous post I talked about the issues we had with wanting to run untrusted code and wanting to use Jurassic to do so. The problem is that when the IL code is generated, it is then JITed and run on the machine directly, we have no control over what it is doing. And that is a pretty scary thing to have. A simple mistake causing an infinite loop is going to take down a thread, which requires us to Abort it, which means that we are now in a funny state. I don’t like funny states in production systems.

So we were stuck, and then I realized that Jurrasic is actually generating IL for the scripts. If it generates the IL, maybe we can put something in the IL? Indeed we can, here is the relevant PR for Jurrasic code. Here is the most important piece of code here:

The key here is that we can provide Jurrasic with a delegate that will be called at specific locations in the code. Basically, we do that whenever you evaluate a loop condition or just before you jump using goto. That means that our code can be called, which means that we can then check whatever limits such as time / number of iterations / memory used have been exceeded and abort.

Now, the code is written in a pretty funny way. Instead of invoking the delegate we gave to the engine, we are actually embedding the invocation directly in each loop prolog. Why do we do that? Calling a delegate is something that has a fair amount of cost when you are doing that a lot. There is a bit of indirection and pointer chasing that you must do.

The code above, however, is actually statically embedding the call to our methods (providing the instance pointer if needed). If we are careful, we can take advantage of that. For example, if we mark our method with aggressive inlining, that will be taken into account. That means that the JavaScript code will turn into IL and then into machine code and our watchdog routine will be there as well. Here is an example of such a method:

We register the watchdog method with the engine. You’ll notice that the code smells funny. Why do we have a watchdog and then a stage 2 watchdog? The reason for that is that we want to inline the watchdog method, so we want it to be small. In the case above, here are the assembly instructions that will be added to a loop prolog.

These are 6 machine instructions that are going to have great branch prediction and in general be extremely lightweight. Once in a while we’ll do a more expensive check. For example, we can check the time or use GC.GetAllocatedBytesForCurrentThread() to see that we didn’t allocate too much. The stage 2 check is still expected to be fast, but by making it only rarely, it means that we can actually afford to do this in each and every place and we don’t have to worry about the performance of that.

This means that our road is opened to now move to Jurassic fully, since the major limiting factor, trusting the scripts, is now handled.

Tweet Share Share 7 comments

Tags:

More posts in "Optimizing JavaScript and solving the halting problem" series:

(18 Aug 2017) Part II
(17 Aug 2017) Part I

Comments

18 Aug 2017
13:15 PM

Rafal

Question: Nil.JS or Jurassic as application scripting tool - did anyone compare these?

20 Aug 2017
12:42 PM

Oren Eini

Rafal, We looked at them, Jurassic seems better in terms of the integration abilities.

21 Aug 2017
04:43 AM

Jeremy Danyow

Would Edge.js be applicable here?

21 Aug 2017
06:06 AM

Oren Eini

Jeremy, We actually don't want to allow any modules at all. We want the code to access only what we give it. It also doesn't support the CoreCLR, which we require

23 Aug 2017
08:00 AM

Steve Wagner

The sad thing for both engines is that they stuck with a (now finally) pretty old version of the ES spec.

23 Aug 2017
10:19 AM

Oren Eini

Steve, Jint, at least, is using ES6, that is pretty modern, no?

25 Aug 2017
22:48 PM

Sebastien

Jint is not based on ES6, just ES5.1. There is a functional branch with the full ES6 parser but almost none of the new stuff is supported. Only Symbols so far. And I haven't been able to continue since December. https://github.com/sebastienros/jint/issues/343

Transpiling to ES5.1 is the only workaround right now.

Comment preview

Comments have been closed on this topic.

Oren Eini

Oren Eini

CEO of RavenDB