Curious. Are there examples of programming languages that allow spaces in identi...

pwg · on Feb 6, 2016

> Curious. Are there examples of programming languages that allow spaces in identifiers? Obviously, it would need to be designed for that.

At least Tcl allows spaces in identifiers, but to 'use' such identifiers one does have to add a bit of extra 'sugar' to prevent the code parser from interpreting the spaces as token separators:

Example of spaces in a variable name:

    $ rlwrap tclsh 
    % set "var name with spaces" "contents of the variable"
    contents of the variable
    % puts ${var name with spaces}
    contents of the variable
    % set "var name with spaces"
    contents of the variable

Example of spaces in a procedure (function) name (the first line defines the procedure):

    % proc {my space proc} {string} {puts "'my space proc' called with string='$string'"}

    % {my space proc} "hello how are you"
    'my space proc' called with string='hello how are you'
    % "my space proc" "the quick brown fox"
    'my space proc' called with string='the quick brown fox'
    % set pn "my space proc"
    my space proc
    % $pn "this that and the other"
    'my space proc' called with string='this that and the other'

So there you have at least one example.

jejones3141 · on Feb 6, 2016

Algol 68 allows spaces in identifiers. That means one has to use one of the "stropping" techniques to distinguish keywords from identifiers--case stropping (IF p THEN foo ELSE bar FI), quote stropping a la the old IBM Algol F Algol 60 compiler ('if' p 'then' foo 'else' bar 'fi'), and at least one other I don't remember offhand.

legulere · on Feb 6, 2016

Case-sensitivity makes sense because it enforces uniformity of code. Case insensitivity only matters when you want to write identifiers differently at different locations. The only place where I see this could be useful is when you use a library that uses a different convention.

The IMO better way to solve this is to set a convention for your programming language and enforce it with the compiler (at least with warnings).

xjay · on Feb 7, 2016

As you say, it's about conventions, but with a case-Sensitive system you are more likely going to HAVE TO enforce naming conventions, because there's a distinction now. Otherwise you'd write "MidiPort"/"MIDIPort"/"midiPort" or what have you.

Keep in mind we wouldn't need to care about enforcing case in a style guide, if case didn't matter, because there are no distinctions, and you'd be more inclined to write it the natural way; no camelCase to overrule ambiguation in writing "MIDI Port" as midiPort, or MidiPort, MIDIport, etc.

Case-sensitivity only creates unnecessary dissonance, and leads to clever uses of that system, adding even more choice; and as we know from The Matrix, the problem is choice. ;)

So if we keep it closer to how we would normally read and write words, I think there would be less dissonance about that aspect of programming, or naming files for that matter.

legulere · on Feb 7, 2016

> Keep in mind we wouldn't need to care about enforcing case in a style guide, if case didn't matter, because there are no distinctions

You would still need it to get uniform code. Spaces vs. tabs also doesn't matter but it's still in almost all style guides.

Uncompetative · on Feb 7, 2016

I approve of case-sensitivity where an initial Capital letter indicates that Something is Publically accessible and it really doesn't matter what happens after that. Hence, we could have:

  Newton-Raphson Runge-Kutta Fast-Fourier-Transform

  Class-ID IO-Channel MIDI-port 

  Freudian-Id Io-Channel

In contrast to this, I feel it makes sense to have lowercase mean that something is private. Hence, no camelCase:

  x y z variable longer-variable-name

I've never been that comfortable about appending numerals to the end of identifiers to disambiguate them as I feel that this is a sign that they ideally ought to be subscripted and implemented as arrays. I much prefer hyphens to underscores but would ideally like to use individual words separated by spaces. This can only work if you have an IDE that hides all the underscores (which are incredibly ugly and serve no useful purpose in printed material these days) as you input them and outputs NBSPs instead and then uses similarly suppressed prefix sigils to style your raw input text into an output which conforms to traditional Mathematical notation. Hence, we could have:

  /foo_bar + /bar_qux

become:

foo bar + bar qux

similarly, the following is not a problem if you take advantage of the syntax rule that requires at least one space either side of an operator. Hence, we could have:

  /foo_bar / /bar-qux

become:

foo bar / bar-qux

i.e. the / sign isn't echoed when you initially type it as it is expecting a letter, but when the IDE receives whitespace it belatedly echoes it as the operator symbol as it is now sure that it isn't a suppressed sigil.

clouddrover · on Feb 7, 2016

> Case-sensitivity makes sense because it enforces uniformity of code.

I don't see that case-sensitivity helps to achieve uniformity of code that much. Factors like code structure, common design patterns, and source code formatting are more important. The approach to the structure and design of an application or library is something that each individual development group decides for themselves. Source code formatting can (and should) be enforced by formatting tools.

Having used a case-insensitive language for a while (Object Pascal) I find that developers tend to follow the case convention of a given software project anyway and if they don't the case-sensitive typos aren't an issue. They don't make the code harder to understand and it all compiles.

im_down_w_otp · on Feb 7, 2016

Reading Erlang is really nice partially because all variables are capitalized (enforced by the compiler). You know immediately which parts of the code are what.

It actually drives me a little nuts that I can't do the same thing in Elixir (compiler enforced lowercase) because so much of the code looks the same.

GregBuchholz · on Feb 7, 2016

Common Lisp allows spaces in identifiers and symbols names, but you have to quote it with vertical bars:

    (let ((|This variable has spaces| 1))
      (print |This variable has spaces|))

6502nerdface · on Feb 7, 2016

Similarly, R (which is basically a Lisp with C-like syntax) allows spaces in identifiers, though you'll have to construct usages of such identifiers using quote() or backticks.

WalterBright · on Feb 6, 2016

Fortran was case-insensitive because it used early character encodings that did not support sensitivity.

xjay · on Feb 7, 2016

Yes, I alluded to that by way of the sixth bit, I just had no reference to whether it was something they chose to do, or had no other choice at the time.

heimp · on Feb 7, 2016

I think Pogoscript, Argile and Zinc allow spaces.

http://pogoscript.org/guide/variables.html http://www.nongnu.org/argile/ http://tibleiz.net/zinc/identifiers.html

xjay · on Feb 7, 2016

Thanks for the suggestions!

I found some more tidbits in these StackOverflow answers:

http://programmers.stackexchange.com/questions/145751/has-wh...

sordina · on Feb 6, 2016

Agda allows for "mixfix" function names with spaces. It's very interesting.

kevin_thibedeau · on Feb 7, 2016

VHDL allows arbitrary text in its extended identifiers. The feature was added for easier interop with other tools that have less restrictive rules than VHDLs normal identifiers.