The Lounge - CodeProject

Re: There are many gotos, but these ones are mine

k505412-May-24 4:00

12-May-24 4:00

Ravi Bhavnani wrote:
And most (I suspect all) modern compilers won't allow specifying the target of a goto into another block

Not exactly sure what you mean here, but C/C++ certainly allows you to goto into a contained block, or to a label in another code block within the same containing block e.g.

void f(int n)
{
    if(n > 1)
      goto foo;
      // do stuff
     if( ... )
     {
        // some stuff
        if ( ... )
            goto foo;     // jumps down to inside next if block
        // other stuff
     } 
     if(  ... )
     {
       foo:
       // do more stuff
     }
}

Will compile just fine. You can get a warnings if you have an initialization before label foo, with the right warning options (gcc/clang, at least), but a quick perusal of the warning options doesn't seem to suggest that there's a warning for a goto int a contained block.

But maybe you meant that you can't jump from one block to another, like

C++

void foo()
{
   foo_label:
   // ...
}

void bar()
{
   // ...
   goto foo_label;  // invalid non-local goto
}

But for that you have setjmp/longjmp, which, of course, should be avoided like the plague.

"A little song, a little dance, a little seltzer down your pants"
Chuckles the clown

Re: There are many gotos, but these ones are mine

Ravi Bhavnani12-May-24 6:05

Ravi Bhavnani

12-May-24 6:05

k5054 wrote:
C/C++ certainly allows you to goto into a contained block, ...

Thanks for the correction. The C# compiler doesn't permit that.

/ravi

My new year resolution: 2048 x 1536
Home | Articles | My .NET bits | Freeware
ravib(at)ravib(dot)com

Re: There are many gotos, but these ones are mine

honey the codewitch12-May-24 8:37

honey the codewitch

12-May-24 8:37

C++ is painfully permissive at times, but not nearly so bad as C. Smile | :)

C# brought a bit of sense to the mess, even if it sacrifices the ability to do some of the black magic.

Check out my IoT graphics library here:
https://honeythecodewitch.com/gfx
And my IoT UI/User Experience library here:
https://honeythecodewitch.com/uix

Re: There are many gotos, but these ones are mine

jmaida12-May-24 14:46

jmaida

12-May-24 14:46

totally agree, Ravi. Well said.

"A little time, a little trouble, your better day"
Badfinger

Re: There are many gotos, but these ones are mine

Daniel Will13-May-24 17:09

Daniel Will

13-May-24 17:09

Break, continue, and return are basically goto, when translated to low level machine codes Thumbs Up | :thumbsup:

Also for-loop, if-else, while-do, switch, etc.

Gotos are frowned because some people used it badly. Maybe they caused infinite loop or something. Maybe they forgot to free the allocated memory. Also it shouldn't be used when your high level language provides more explanatory keywords above. The reason is obviously, for maintainability and readibility purpose.

Re: There are many gotos, but these ones are mine

Amarnath S11-May-24 17:02

Amarnath S

11-May-24 17:02

One of my professors was the GOTO person for many of us students, in terms of resolving technical doubts and even giving career advice.

We need more such GOTOs, i feel.

Re: There are many gotos, but these ones are mine

trønderen11-May-24 17:40

trønderen

11-May-24 17:40

If I were given the responsibility for a state machine implementation like that, I would immediately run to my boss asking for permission to rewrite the whole thing as a table driven machine.

There is no way, with code like that, that I could guarantee that all inputs/events are properly handled in all cases (or given the proper error treatment). I would have to make a huge effort if I were to report a complete set of normal (non-error) ways to go from a given state to another, and which inputs/events would lead to which error states.

I've never written any CP article, but code like this makes my fingers itch to compose an article about proper table driven state machine implementation! Maybe I some day get around to do it Smile | :)

Religious freedom is the freedom to say that two plus two make five.

Re: There are many gotos, but these ones are mine

honey the codewitch11-May-24 18:01

honey the codewitch

11-May-24 18:01

Fair enough. I generated that code using Visual FA. It's slightly faster than the table driven variety, which Visual FA can also generate.

Check out my IoT graphics library here:
https://honeythecodewitch.com/gfx
And my IoT UI/User Experience library here:
https://honeythecodewitch.com/uix

Re: There are many gotos, but these ones are mine

trønderen12-May-24 8:15

trønderen

12-May-24 8:15

Compilers will generate jump statements. I don't mind, as long as I don't have to trace them, and maintain the code at that level.

Other generators may generate source format goto statements. I don't mind, as long as I don't have to trace them, and maintain the code at that level.

If the gotos and labels you presented are created by a generator, and you never will have to trace them and maintain the code at that level, I do not consider them "your" gotos. Not any more than I consider the conditional and unconditional jumps generated from your source code to be "your" jmp instructions.

I'd be curious to also see your input to Visual FA to generate this code, as well as the table driven code generated by Visual FA!

If you have any reason at all to relate to the generated code: Trading readability and maintainability for "slightly faster code" is generally a bad move. Besides: From the snippet you presented, I am surprised that a table driven variety generated from the same input can be even "slightly" slower. I really wonder what that generator generates! (That is why I'd like to see it.)

I have never been using Visual FA, but from what I gather from a quick net search, it looks like you are not at all using SM as a development too. You are just generating code for different virtual machines, one with a state oriented execution model, one with a C/C++ oriented execution model. A comparison between them is like compiling some (arbitrary language) source code for x64 and for AArch64 and observing that the x64 is "slightly faster".

To me, the SM table is not the result delivered by a generator - it is a modeling tool for the human developer. The driver is typically a score code statements, independent of the model (a.k.a. transition table). I certainly agree that an editor tailored to transition table editing is a great thing to have. I have tried to maintain a compile time initialized C++ array for a transition table, using Np++. Even for trivial SMs, that is almost impossible (unless you just copy the table from e.g. a standard document and it will be carved in stone).

Religious freedom is the freedom to say that two plus two make five.

Re: There are many gotos, but these ones are mine

honey the codewitch12-May-24 8:28

honey the codewitch

12-May-24 8:28

trønderen wrote:
If you have any reason at all to relate to the generated code. Trading readability and maintainability for "slightly faster code" is generally a bad move.

. I generally agree with you. However, as we both know there are exceptions, which is why you used the word "generally" I'm sure. This is one of those cases, as lexing is always in a critical code path, and a generalized lexer must be able to handle bulk input as efficiently as possible.

My input to visual fa is one or more regular expressions. Literally just that. Here is the full input for that generated lexer, in my .rl Rolex lexer format, but it should be easy enough to discern the grammar below without knowing the format.

Object = "{"
ObjectEnd = "}"
Array = "["
ArrayEnd = "]"
FieldSeparator = ":"
Comma = ","
Number = '-?(?:0|[1-9][0-9]*)(?:\.[0-9]+)?(?:[eE][+-]?[0-9]+)?'
Boolean = 'true|false'
Null = "null"
String = '"([^\n"\\]|\\([btrnf"\\/]|(u[0-9A-Fa-f]{4})))*"'
WhiteSpace = '[ \t\r\n]+'

The table driven code is run on a flat array of integers. It might be more efficient to unflatten it in this case - maybe? I used to run a more complicated array of structs for this, and I don't remember there being a performance difference. But anyway, there is also an array of int arrays for a feature called block ends, which simulate lazy matching on a DFA. (I have the details of all of it documented in my Visual FA series). It's also simpler in operation than it looks. I do actually use gotos in a couple of places here to restart the state machine. It was much less complicated than orchestrating a while with breaks. I should state that I didn't comment the code here because it wouldn't help me. It may help others, but I didn't really care about that. This pattern is burned into my brain after writing more than half a dozen lexers that follow the same. It honestly would just clutter it for me, as the code makes immediate sense to me despite how it looks, and I didn't write it for a team.

private FAMatch _NextImpl(
#if FALIB_SPANS
	ReadOnlySpan<char> s
#else
	string s
#endif
	)
{
	int tlen;
	int tto;
	int prlen;
	int pmin;
	int pmax;
	int i;
	int j;
	int state = 0;
	int acc;
	if (position == -1)
	{
		// first read
		++position;
	}
	int len = 0;
	long cursor_pos = position;
	int line = this.line;
	int column = this.column;
	int ch = -1;
	Advance(s, ref ch, ref len, true);
	start_dfa:
	acc = _dfa[state];
	++state;
	tlen = _dfa[state];
	++state;
	for (i = 0; i < tlen; ++i)
	{
		tto = _dfa[state];
		++state;
		prlen = _dfa[state];
		++state;
		for (j = 0; j < prlen; ++j)
		{
			pmin = _dfa[state];
			++state;
			pmax = _dfa[state];
			++state;
			if (ch < pmin)
			{
				state += ((prlen - (j + 1)) * 2);
				j = prlen;
			}
			else if (ch <= pmax)
			{
				Advance(s, ref ch, ref len, false);
				state = tto;
				goto start_dfa;
			}
		}
	}
	if (acc != -1)
	{
		int sym = acc;
		int[] be = (_blockEnds != null && _blockEnds.Length > acc) ? _blockEnds[acc] : null;
		if (be != null)
		{
			state = 0;
			start_be_dfa:
			acc = be[state];
			++state;
			tlen = be[state];
			++state;
			for (i = 0; i < tlen; ++i)
			{
				tto = be[state];
				++state;
				prlen = be[state];
				++state;
				for (j = 0; j < prlen; ++j)
				{
					pmin = be[state];
					++state;
					pmax = be[state];
					++state;
					if (ch < pmin)
					{
						state += ((prlen - (j + 1)) * 2);
						j = prlen;
					}
					else if (ch <= pmax)
					{
						Advance(s, ref ch, ref len, false);
						state = tto;
						goto start_be_dfa;
					}
				}
			}
			if (acc != -1)
			{
				return FAMatch.Create(sym,
#if FALIB_SPANS
					s.Slice(unchecked((int)cursor_pos), len).ToString()
#else
					s.Substring(unchecked((int)cursor_pos), len)
#endif
					, cursor_pos, line, column);
			}
			if (ch == -1)
			{
				return FAMatch.Create(-1,
#if FALIB_SPANS
					s.Slice(unchecked((int)cursor_pos), len).ToString()
#else
					s.Substring(unchecked((int)cursor_pos), len)
#endif
					, cursor_pos, line, column);
			}
			Advance(s, ref ch, ref len, false);
			state = 0;
			goto start_be_dfa;
		}
		return FAMatch.Create(acc,
#if FALIB_SPANS
					s.Slice(unchecked((int)cursor_pos), len).ToString()
#else
					s.Substring(unchecked((int)cursor_pos), len)
#endif
			, cursor_pos, line, column);
	}
	// error. keep trying until we find a potential transition.
	while (ch != -1)
	{
		var moved = false;
		state = 1;
		tlen = _dfa[state];
		++state;
		for (i = 0; i < tlen; ++i)
		{
			++state;
			prlen = _dfa[state];
			++state;
			for (j = 0; j < prlen; ++j)
			{
				pmin = _dfa[state];
				++state;
				pmax = _dfa[state];
				++state;
				if (ch < pmin)
				{
					state += ((prlen - (j + 1)) * 2);
					j = prlen;
				}
				else if (ch <= pmax)
				{
					moved = true;
				}
			}
		}
		if (moved)
		{
			break;
		}
		Advance(s, ref ch, ref len, false);
	}
	if (len == 0)
	{
		return FAMatch.Create(-2, null, 0, 0, 0);
	}
	return FAMatch.Create(-1,
#if FALIB_SPANS
					s.Slice(unchecked((int)cursor_pos), len).ToString()
#else
					s.Substring(unchecked((int)cursor_pos), len)
#endif
		, cursor_pos, line, column);
}

Check out my IoT graphics library here:
https://honeythecodewitch.com/gfx
And my IoT UI/User Experience library here:
https://honeythecodewitch.com/uix

Re: There are many gotos, but these ones are mine

honey the codewitch12-May-24 13:56

honey the codewitch

12-May-24 13:56

trønderen wrote:
To me, the SM table is not the result delivered by a generator - it is a modeling tool for the human developer. The driver is typically a score code statements, independent of the model (a.k.a. transition table). I certainly agree that an editor tailored to transition table editing is a great thing to have. I have tried to maintain a compile time initialized C++ array for a transition table, using Np++. Even for trivial SMs, that is almost impossible (unless you just copy the table from e.g. a standard document and it will be carved in stone).

I didn't address this part of your post. I should. I don't expect transition tables to be especially readable.

Visual FA is called "Visual" because it can produce images - directed graphs of the state machines that match 1:1 with the generated tables and code. For example, q0 in the graph corresponds to the q0: label in the goto table. It's perspicuous enough and yet concise enough that I've used the graphs as a guide to hand roll lexers when i needed the lexer to perform additional work beyond what a strict DFA could provide.

I've also used them to debug. Since the correlation is 1:1 between the graphs and the compiled code, it's easier to follow along with than the tables, but both can be followed with the graphs.

The graphs effectively become in part, documentation for the state machine, and for that they work pretty well.

Check out my IoT graphics library here:
https://honeythecodewitch.com/gfx
And my IoT UI/User Experience library here:
https://honeythecodewitch.com/uix

Re: There are many gotos, but these ones are mine

den2k8812-May-24 23:08

den2k88

12-May-24 23:08

Generally you are right, usually the codewitch works on small embedded systems where performances and code footprint can be extremely stringent.

I found myself doing things I would have abhorred only a few scant years ago...

GCS/GE d--(d) s-/+ a C+++ U+++ P-- L+@ E-- W+++ N+ o+ K- w+++ O? M-- V? PS+ PE Y+ PGP t+ 5? X R+++ tv-- b+(+++) DI+++ D++ G e++ h--- r+++ y+++* Weapons extension: ma- k++ F+2 X

The shortest horror story: On Error Resume Next

Re: There are many gotos, but these ones are mine

trønderen13-May-24 0:17

trønderen

13-May-24 0:17

Table driven implementations usually reduces the control code to typically a couple hundred bytes (or even less), at the expense of data space for the table.

By using data elements no larger than required in the transition table entries, each entry can be kept to a very moderate size.

One possible issue is the number of states and events. It takes some experience to control both, to keep the table size (#states * #events) under control. A common trick is to introduce 'state variables'.
Some times you can use 2+ small tables rather than a huge one, e.g. if you implement a communication protocol: One table for the connect phase, one for the data transfer phase.

Many transition tables are rather sparse anyway, but a lot of methods for space efficient storage of sparse matrices are basic data structure knowledge. E.g. non-empty entries may be factored out to a packed, linear array, and the large table contains indexes to this array. Often, several transitions are identical (typically in one state, for different events, or for one event in several different states) - then a linear table need to hold only a single copy.

Certainly, really old embedded processors (such as 8051) had very little data space; expanding code space was far easier (e.g. through banking hardware). While we would usually call the transition table 'data', it is 100% read-only, and may very well be burnt in ROM (ok, call it 'flash' nowadays) together with the driver code.

If you consider CLR for an embedded CPU (don't try that on an 8051!), then you definitely can fit a packed transition table. My guess is that the total code+data size would be significantly smaller than an equivalent implementation with switch cases and/or if/else-sequences. And faster, even if a packed table will lead to a couple more indirections.

I will maintain that table driven state machines can be a very good solution for embedded processors.

Religious freedom is the freedom to say that two plus two make five.

Re: There are many gotos, but these ones are mine

giulicard13-May-24 6:59

giulicard

13-May-24 6:59

trønderen wrote:
If I were given the responsibility for a state machine implementation like that, I would immediately run to my boss asking for permission to rewrite the whole thing as a table driven machine.

... or as a state machine that returns function pointers instead of using tables and state variables:

C++

#include <stdio.h>
#include <conio.h>

// Fn ptrs defs
typedef  void (*RT)( int input );
typedef  RT (*TER)( int input );

// Forward declarations
extern TER state1( int input );
extern TER state2( int input );
extern TER state3( int input );

// First state
TER state1( int input )
{
    printf( "one\t" );
    return input < 10 ? (TER)&state2 : (TER)NULL;
}

// Second state
TER state2( int input )
{
	printf( "two\t" );
	return (TER)&state3;
}

// Third state
TER state3( int input )
{
    printf( "three\t" );
    return (TER)&state1;
}

int main(int argc, char* argv[])
{
    int n;

    // Set Start state
    TER state = (TER)&state1;

    // Exercises the state machine. Ends when state == NULL
    for ( n = 0 ; state ; ++n ) {
        // Executes the current state (state variable) then goes to the next state
        state = (TER)( state( n ) );
    }

    printf( "\n\nPress any key\n" );
    getch();

	return 0;
}

Type casts are useful because in C it's impossible to declare function pointers that return function pointers that return function pointers that return function pointers... Smile | :)

Regards

Re: There are many gotos, but these ones are mine

honey the codewitch13-May-24 9:21

honey the codewitch

13-May-24 9:21

I hate function pointer dispatch code in general.

Because at some point you'll have to debug and maintain it, and you end up with impossible to follow pointer arrays hiding the flow of your app.

Check out my IoT graphics library here:
https://honeythecodewitch.com/gfx
And my IoT UI/User Experience library here:
https://honeythecodewitch.com/uix

Re: There are many gotos, but these ones are mine

giulicard13-May-24 10:03

giulicard

13-May-24 10:03

honey the codewitch wrote:
with impossible to follow pointer arrays

There are no pointer arrays in my code.

Re: There are many gotos, but these ones are mine

honey the codewitch13-May-24 10:04

honey the codewitch

13-May-24 10:04

Sorry, I was speaking generally about dispatch function pointers. Your statement just remind me of it. Sorry I wasn't clear. I just woke up when I wrote that. Smile | :)

Check out my IoT graphics library here:
https://honeythecodewitch.com/gfx
And my IoT UI/User Experience library here:
https://honeythecodewitch.com/uix

Re: There are many gotos, but these ones are mine

giulicard13-May-24 10:07

giulicard

13-May-24 10:07

No problem. I'm not a native English speaker so I always fear being misunderstood.

Re: There are many gotos, but these ones are mine

trønderen14-May-24 5:10

trønderen

14-May-24 5:10

honey the codewitch wrote:
I hate function pointer dispatch code in general.

Do you refuse to use delegates at all, or don't you consider those to be function pointers? (In other words: Are function pointers OK as long as they are called delegates?)

No, when you have generated your code, you do not "at some time have to debug and maintain" the generated code. You debug and maintain your source, not the compilation result. Not even if you can, sort of, read it. Executable binaries can also be disassembled into "readable" code - the readability is no argument for random peek and poke.

You send your code through a generator/compiler, and want to patch up the complied result ("The compiled ones can be augmented in a way that the table driven ones cannot"), or complain about the instructions generated by the compiler - I haven't heard anyone saying any such thing in earnest for a decade or two. Some people still believe that they can do smarter heap management than the standard heap manager, rejecting automated garbage collection and smart pointers, but for the most part, compilers became smarter than human coders in the last millennium.

You will see a lot of function pointer dispatch code in the generated code from a plain C++ compiler. Do you hate that as well? If you accept it from a C++ compiler, why do you have problems accepting it from other compilers?

(The first C++ compiler I used didn't produce binary code - it was a machine independent compiler producing K&R C to be fed into a machine specific compiler. So we had full access to the C code for patching it up before passing it on to cc. We did not. I would not do it with any generated code, whether the compiler is called C++ or Visual FA.)

Religious freedom is the freedom to say that two plus two make five.

Re: There are many gotos, but these ones are mine

honey the codewitch14-May-24 6:21

honey the codewitch

14-May-24 6:21

I was going to respond, but I think I answered all this in the post you responded to

Because at some point you'll have to debug and maintain it, and you end up with impossible to follow pointer arrays hiding the flow of your app.

Check out my IoT graphics library here:
https://honeythecodewitch.com/gfx
And my IoT UI/User Experience library here:
https://honeythecodewitch.com/uix

Re: There are many gotos, but these ones are mine

trønderen14-May-24 13:12

trønderen

14-May-24 13:12

The real issue is:

honey the codewitch wrote:
at some point you'll have to debug and maintain it

Does that apply to the code generated by your C, C++ or C# compiler as well?
When are you going to start trusting your tools to do at least as good a job as the one you are doing yourself?

I think: If you don't trust your tools to do a good enough job, throw them away and do the job yourself!

Religious freedom is the freedom to say that two plus two make five.

Re: There are many gotos, but these ones are mine

honey the codewitch14-May-24 13:14

honey the codewitch

14-May-24 13:14

It does not typically apply to generated code because the maintenance of that is moved to the generated code's input specification - in other words, whatever document or resource it uses to generate the code from. THAT is what needs to be maintained.

It does not apply to compiled code either, for exactly the same reason (the compiler being yet another code generator)

Check out my IoT graphics library here:
https://honeythecodewitch.com/gfx
And my IoT UI/User Experience library here:
https://honeythecodewitch.com/uix

Re: There are many gotos, but these ones are mine

trønderen14-May-24 14:12

trønderen

14-May-24 14:12

But if the code is generated by Visual FA rather than cc, then you will do peek and poke on the generated code.

Well, that is choice. I think you are on the wrong track. In the 1980s, I worked in a company distributing OS patches as Poke instructions. I wouldn't condone that practice today.

Religious freedom is the freedom to say that two plus two make five.

Re: There are many gotos, but these ones are mine

honey the codewitch14-May-24 14:13

honey the codewitch

14-May-24 14:13

then you will do peek and poke on the generated code.

I will? That's news to me.

Hell, with VisualFA.SourceGenerator you don't even see the generated code. It's hidden by visual studio.

Check out my IoT graphics library here:
https://honeythecodewitch.com/gfx
And my IoT UI/User Experience library here:
https://honeythecodewitch.com/uix

Re: There are many gotos, but these ones are mine

trønderen14-May-24 14:25

trønderen

14-May-24 14:25

I took that from "at some point you'll have to debug and maintain it", along with your general focus on gotos being "yours" when they are generated by Visual FA - you clearly relate to the generated Visual FA code as something you have a right to (modify).

If you really are as ignorant of the code generated by Visual FA as you are of the code generated by the gcc compiler, why do you post it here?

Religious freedom is the freedom to say that two plus two make five.

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.

Welcome to the Lounge