feature request: unicode in source

# Introduction

Some languages now support Unicode (mostly UTF8) for writing source code. It would be great if one could also use Unicode in Stan source. (Note that _comments_ in UTF8, or any superset that embeds ASCII, are already supported in the sense the parser just ignores them.)

Broadly, there are two possible levels of support:
1. in variable and function names (eg `ϕ`), and
2. in operators (eg `≤`), which provide synonyms for existing ones (eg `<=`)
# Example

This is how the [8 schools](https://github.com/stan-dev/example-models/blob/master/misc/eight_schools/eight_schools.stan) example would look like in unicode:

``` stan
data {
  int<lower=0> J;             // number of schools
  real y[J];                  // estimated treatment effect (school j)
  real<lower=0> σ[J];         // std err of effect estimate (school j)
}
parameters {
  real μ;
  real θ[J];
  real<lower=0> τ;
}
model {
  θ ~ normal(μ, τ); 
  y ~ normal(θ, σ);
}
```
# Possible benefits
1. more compact source code
2. better mapping to equations in papers
# Possible downsides
1. editor/entry support 
2. font support
3. possibly corrupted files

The first two are mitigated by the fact that ASCII is a subset of UTF8, so using the feature is optional.
# UTF8 support in various languages which have interfaces for Stan

| language | literals | identifiers | operators | would UTF8 variables work for interfacing with Stan? |
| --- | --- | --- | --- | --- |
| R | yes | yes | no | yes |
| Python | [yes](https://docs.python.org/2/howto/unicode.html#unicode-literals-in-python-source-code) | only from version 3 | no | yes, even in Python 2, as they are used as literal keys |
| Julia | yes | yes | yes | yes |
| Matlab | yes | yes, [but needs to be enabled](http://de.mathworks.com/matlabcentral/newsreader/view_thread/238995) | no | yes |
| Stata | yes | yes, [from version 14](http://blog.stata.com/tag/unicode/) | no | probably? |
# Editor support
## Emacs

See [this list](https://github.com/vspinu/math-symbol-lists) for various UTF8 implementations using autocomplete, company-mode, and quail.
# See also
- [discussion on stan-dev](https://groups.google.com/d/topic/stan-dev/fd76xNO8i20/discussion)
- [detailed description for Julia](http://docs.julialang.org/en/release-0.4/manual/variables/#allowed-variable-names)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feature request: unicode in source #1406

Introduction

Example

Possible benefits

Possible downsides

UTF8 support in various languages which have interfaces for Stan

Editor support

Emacs

See also

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

language	literals	identifiers	operators	would UTF8 variables work for interfacing with Stan?
R	yes	yes	no	yes
Python	yes	only from version 3	no	yes, even in Python 2, as they are used as literal keys
Julia	yes	yes	yes	yes
Matlab	yes	yes, but needs to be enabled	no	yes
Stata	yes	yes, from version 14	no	probably?

Uh oh!

feature request: unicode in source #1406

Description

Introduction

Example

Possible benefits

Possible downsides

UTF8 support in various languages which have interfaces for Stan

Editor support

Emacs

See also

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions