MUMPS

From Academic Kids

Template:Dablink

MUMPS, or simply M, is a programming language dedicated to building and managing databases. Whereas in most systems the database is the first-class citizen and a language is added on top, under M this is inverted, the language itself is the primary object, the database a "side effect" of one of the features of the language.

M contrasts strongly with most database systems, because the system is much "lower level". For instance, whereas most database systems will include a command to find all the records matching a particular pattern, on an M system you would have to write a program to do this search and collect up the results. As you might imagine, this makes even trivial tasks much more difficult, and has led to a number of M-based programs to act as a database management system and provide these features.

For people used to traditional database applications or database management systems, M can be difficult to understand at first. This is offset by its speed and flexibility in dealing with tasks that would cause problems under the relational database model. M has been called the best-kept secret in the IT industry. Much of this secrecy seems self-imposed however: finding good introductory information on M is difficult, and the commercial side of the M market is fractured.

Contents

History

MUMPS started life as the Massachusetts General Hospital Utility Multi-Programming System, developed in Octo Barnett's animal lab at Massachusetts General Hospital (MGH) in Boston in 1966 & 67. The original MUMPS system was built on a spare DEC PDP-7.

Octo Barnett and Neil Pappalardo were also involved with MGH's plans for a Hospital Information System, obtained a PDP-9 and began using MUMPS in the admissions cycle and laboratory test reporting. MUMPS is an interpreted language, and incorporates a hierarchical database file system, to standardize interaction with the data. The origins of MUMPS can be traced from Rand Corporation JOSS through BBN's TELCOMP and STRINGCOMP. The MUMPS team deliberately chose to write the new language with portability in mind. Another feature not widely supported in operating systems of the era was multitasking, which was also built into MUMPS itself.

MUMPS was soon ported to a PDP-15 where it lived for some time. Developed on a government grant, MUMPS was required to be released in the public domain (no longer a requirement for grants), and was soon ported to a number of other systems including the popular PDP-8, the Data General Nova and the PDP-11. Word of MUMPS spread mostly through the medical community, and by the early 1970s was in widespread use, often being locally modified for their own needs.

In 1972 various MUMPS users gathered in order to standardize the now fractured language, creating the MUMPS Users Group and MUMPS Development Committee. These efforts proved successful; a standard was complete by 1974, and by 1977 they had turned it into an ANSI standard. The group was later responsible for the adoption of an alternative name in 1990, after repeatedly seeing people reject the product out of hand due to its name. Over a decade later most people still refer to the product as MUMPS however.

The Veterans Administration (today known as the United States Department of Veterans Affairs) officially adopted MUMPS as the programming language to be used to implement an integrated laboratory / pharmacy / patient admission, tracking and discharge system in the early 1980s. The original version, the Decentralized Hospital Computer Program (DHCP) was delivered early and under budget! DHCP has been continuously extended in the years since, and is available at no cost in source code under the Freedom of Information Act. In order to implement DHCP the VA also wrote an intermediate layer known as FileMan in MUMPS to act as a database management system.

Today, DHCP is known as Veterans Health Information Systems and Technology Architecture (VistA). The Hardhats.org (http://www.hardhats.org) website is the center for the international community of VistA developers and users and also serves something of the same function for MUMPS generally.

Nearly the entire VA hospital system in the United States and the Indian Health Service, as well as major parts of the Department of Defense hospital system (Composite Health Care System (CHCS), different than the VA's for historical reasons) all still run the system for clinical data tracking.

M also gained a following in the financial sector, in this case due to its much higher performance compared to traditional SQL based systems. Given similar hardware, multidimensional databases like M are typically about six times faster than SQL for transaction processing, making them ideal for online systems like banking. They also range from as-good to hundreds of times faster on queries, with more complex queries always favoring the multidimensional approach. They are also particularly good at looking up related information in other data stores "for free", a task that requires an expensive JOIN operation in SQL systems.

During the late 1970s a number of vendors sprung up to market M based platforms. The two largest vendors were Digital Equipment Corporation with their DSM (DEC Standard MUMPS) product, and InterSystems with their ISM (InterSystems M) product (M/11+ on the PDP platform). Some other vendors to enter this space included Greystone Technology with a compiled version, DataTree MUMPS (DTM) with an Intel PC based product, Micronetics MUMPS (MSM) with a Intel PC based product (and later ported to IBM's VM operating system), and M-Global with a Mac OS based product. DSM was somewhat a de-facto standard on DEC machines, and was later ported to their DEC Alpha-based systems running both VMS and Unix. InterSystems was a more cross-platform product, supporting a variety of UNIX flavors in addition to VMS.

Later, InterSystems became the dominant player in the market with the purchase of several of the other players. In 1990 DSM was purchased by InterSystems during the time that DEC was starting to dissolve and eventually become acquired by Compaq. DataTree and Micronetics were also eventually acquired by InterSystems later in the 1990s. InterSystems then began to consolidate these products into a single product line, releasing them on a number of platforms as OpenM (although nothing about it was "open" in the 'open source' sense). Since then InterSystems has increasingly distanced itself from its M history, referring to its product as Caché and removing any mention of M from their literature. With continuing development of object and relational tools on top of the somewhat "traditional" M heirachical database, Caché is probably not recognizable as "M" to programmers more familiar with the legacy M platforms.

General use of M appears to be slowly disappearing. This appears to be a result of the industry's failure to provide a clear and compelling message comparing M with traditional SQL systems. There appears to be no M for SQL Programmers type introductory information available on the Internet, and the small number of books on M are difficult to find. To be truly usable, M systems require a "higher level" layer to act as a database manager, and while M is now well standardized, there is no such standard for these higher levels, adding to the confusion.

A recent release of the industrial-quality GT.M (Greystone Technology, now part of Fidelity Investments) under the GPL may help address this to some degree, by providing a single, free, target for the M community. Several database management layers are available for GT.M, and with a little effort a single suggested platform could evolve.


Description

M is typically an interpreted language, and shares basic syntax with common 1960s data processing languages, most notable those such as COBOL, however Dr. Kevin O'Kane has produced a compiler for it as well. Commands are listed one to a line with whitespace being important, and grouped into procedures (subroutines) in a fashion similar to most structured programming systems. Procedures are simply strings, so they can be easily stored in the underlying datastore, meaning that there is no need for "stored procedures" as there is in SQL – anything can be stored.

A notable curiosity of MUMPS syntax is that spaces are significant: there are contexts in which a pair of spaces is interpreted differently from a single space. As a result, and unlike most programming languages, spaces cannot be inserted freely for readability, except within comments.

A typical M procedure consists of several "blocks", each block separated by a label (known as a tag in M-speak) in the first column. Calling into the procedure with no tag results in the entire procedure being run, whereas calling with a tag skips to that point. This allows programmers to place interactive commands at the top of the procedure and then tag the actual start of the code itself, allowing the procedure to be used both interactively and as a function to be called from other code (look at the GRASS article for an example of an alternate way another language deals with similar issues). It is not possible to define command line macros in M like GRASS, but it is possible to type any line of M code and execute it immediately, including subroutines calls.

One main difference between M and most other languages is that M has only a single data type, the string, which it invisibly converts into common data types such as numbers or dates. Automated conversion of this sort is common to many scripting programming languages, but is generally considered a bad thing for most languages because it can all too easily lead to mistakes that are difficult to debug. In the case of M this would be hard to avoid however, as the underlying datastore would grow in complexity if it had to deal with different types. As you might expect, M includes a complete and powerful set of string manipulation commands, grouped into libraries.

The key to the M system is that all variables are automatically multi-dimensional. For instance, this command:

SET A="abc"

creates the variable A and sets its value to the string. The same variable can then be used to hold additional information:

SET A(1,2)="def"

Will place the string def into "slot" (1,2). Slots can also be designated with strings:

SET A("first_name")="Bob"
SET A("last_name")="Dobbs"

making the variables useful data stores on their own. Note that this example also demonstrates another feature of M, that assignments into variables do not erase other information already there. This makes it easy to "build up" a complex variable, using several assignments.

M variables work in a similar fashion as with other programming languages, in that when the program exits, the value will be lost. M comes onto its own with its concept of globals, variables which are automatically and invisibly stored to the datastore. Globals appear as normal variables with the caret character in front of the name. Modifying the earlier example thus:

SET ^A("first_name")="Bob"
SET ^A("last_name")="Dobbs"

will result in a new record being created and inserted in the datastore.

One difference between M and the traditional SQL model of a database is the "level" of the commands in the language. M is a general purpose language with a datastore, whereas SQL is a language dedicated to database functions. This might sound like a minor distinction, but it is rather important to understand it. For instance, SQL includes a search function:

SELECT * FROM user WHERE first_name like 'Bob%'

returns a list of matching records. M has no equivalent "high level" command like SELECT, instead the programmer must construct a small routine in order to collect up the matching records returned from its lower-level functions. It should also be noted that M does not include any transaction controls, all changes to globals happen instantly, and there is no logging in the basic system. Nor does M include any sort of user-based security.

For all of these reasons one of the most common M programs is a database management system, providing all of the classic ACID properties on top of a generic M implementation. FileMan is one such example. In the 1990s many of these layers were adapted to supporting SQL, turning most M systems into SQL systems. Although the user might be "fooled" (to some degree) into seeing the system as a SQL database, the system nevertheless retains the speed advantages of the M datastore (see below).

A side effect of the way M evolved is that the M system includes fairly complete support for multi-tasking, multi-user, multi-machine programming. The former two features are now commonplace on most operating systems, but the later is still not cleanly supported by most systems. To demonstrate the ease of multi-machine support, consider:

SET ^|DENVER|A("first_name")="Bob"
SET ^|DENVER|A("last_name")="Dobbs"

which sets up A as before, but this time on the remote machine called "DENVER". M programs are thus trivial to distribute over many machines, a feature that is still difficult on most SQL systems. This support also made it easy to expose the same sorts of distribution in the SQL (and other) layers with ease, and it's not uncommon for M systems to be a better distributed SQL solution than a "real" SQL system.

Another use of M in more recent times has been to create object databases. By "flattening" objects into a string representation, like XML, M systems can be used to store objects. An M program then converts back and forth between the "real" objects and the string representations under it, and is able to do so much faster than similar object-relational mapping systems running over relational databases. This should be expected, if M can be used to make a SQL that's faster than SQL, making an object database that does't require conversion to and from SQL in the middle is bound to be even faster.

The MUMPS datastore

In the relational model, datastores consist of a number of tables, each one holding records for some particular object (or "entity"). For instance, an address book application would typically contain tables for PEOPLE, ADDRESSES and PHONE_NUMBERS. The tables consist of a number of fixed-width columns holding one basic piece of data (like "first_name"), and each record is a row.

In this example any row in ADDRESSES is "for" a particular row in PEOPLE. SQL does not understand the concept of "ownership" however, and requires the user to collect this information back up. SQL supports this through procedure, using the concept of a foreign key; copying some unique bit of data found in the PEOPLE table into the ADDRESS table.

To re-create a single "real" record for the address book, the user must instruct the database to collect up the row in PEOPLE they are interested in, extract the key, and then search the ADDRESSES and PHONE_NUMBERS tables for all the rows containing this key.

The SQL syntax for joining tables may seem simpler to some programmers however:

SELECT * from PEOPLE p, ADDRESSES a, PHONE_NUMBERS n
 WHERE p.id = a.person_id
   AND p.id = n.person_id
   AND p.first_name = "Bob"

In this example the WHERE looks for all the a's and n's (addresses and phone numbers) that have the person's ID tag stored inside them, but only for p's named Bob.

This trivial example already requires three lookups in different tables to return the data, which, as you might expect, is very slow. In order to improve performance, the database administrator will place an index on heavily-searched columns, in this example the person_id columns. An index consists of a column containing the data to be found, and the record number of the matching row in the table. This is the reason that tables are fixed width, so that the database can easily figure out the physical location of the record given the location of the start of the table, the length of any row, and the number of rows to skip. Without this simplification, performance of the relational model would be unusable.

M's datastore stores only the physical locations. This means that records can be of any length, placed anywhere, and contain anything. Searching is not needed to find any record, a pointer directly to that record is easily retrieved and followed to the data in question. The physical data in an M is typically stored in a "blob" of strings, one after the other. This provides another advantage over the relational model, as empty cells do not take up room as they do in the fixed-length relational table. M databases are therefore smaller than relational ones, which is another reason for their increased performance (less disk operations).

So why doesn't the relational model do the same thing? Historical accident. At the time the difference in speed between storage and processor was much smaller than it is today, and the cost of having the CPU follow a pointer was expensive compared to the simple arithmetic needed for an index. Today the CPU's have grown many times faster than the storage, so this cost is effectively zero. This is the main reason why multidimensional datastores outperform relational ones today, something that was not true in the 1970s when the two models were in competition.

M globals are, in fact, indexes. Each node in the global contains a pointer to the data, just as an index does in the relational model. Unlike the relational model, where indexes are a special-purpose object included as a necessary evil used in some lookups, under M indexes are first-class citizens that are used for all data access. This is yet another reason for M's performance.

This makes M systems particularly well suited to looking up related data, as in the example above. The equivalent M statement would be something more akin to:

SELECT * from PEOPLE p, ADDRESSES a, PHONE_NUMBERS n
 WHERE p.first_name = "Bob"

Related information can be stored directly in the index, in p.addresses for example. In this case no lookup is needed, PEOPLE can point directly to the addresses and phone numbers.

The biggest consequence of this internal representation is that database operations are economical (in both disk space and execution time). M is extremely well suited to real world data, which is often 'sparse' (ie has missing fields). There is no penalty in storage space if a defined data value is not present. This is an extremely helpful feature in a clinical context.

It cannot be overemphasized that real-world applications really do make heavy and unlimited use of these persistent, disk-resident "global" arrays, which can be hundreds of megabytes in size. This is a notable difference between M and other languages (e.g. Perl) which provide "dictionaries," "maps," or other string-indexed dynamic arrays which are RAM-resident and hence limited in capacity. The ability to access string-subscripted multi-dimensional arrays, and to freely add and delete new elements, is not a theoretical capability. It is how the language is intended to be used. Surviving M implementations are the result of heavy marketplace competition which was largely based on the ability of vendors to provide efficient and robust implementations of these "globals."

M includes almost no operating system specific command syntax, very few file system interface commands, and no machine specific commands. (Idiomatic M applications store data within disk-resident globals; the fact that these globals reside in operating system files is invisible to the application code). It is thus quite portable. Additionally, database manipulation code is extremely brief. A M routine implementing a complex database interaction might be a page or two of code. The equivalent in a less high level language (C, Pascal, Fortran, ...) is likely to be an order of magnitude larger. M is a highly cost effective application programming tool.

Summary of key language features

This incomplete, informal sketch seeks to give programmers familiar with other languages a feeling for what M is like. Neither the language description and the descriptions of each feature are complete, and many significant features have been omitted for brevity. These notes reflect the language circa 1994.

Data types: one universal datatype, interpreted/converted to string, integer, or floating-point number as context requires. Like Visual BASIC "variant" type.

Booleans: In IF statements and other conditionals, any nonzero value is treated as True. a<b yields 1 if a is less than b, 0 otherwise.

Declarations: NONE. Everything dynamically created on first reference.

Lines: important synactic entities. Multiple statements per line are idiomatic. Scope of IF and FOR is "remainder of current line."

Case sensitivity: Commands and intrinsic functions are case-insensitive. Variable names and labels are case-sensitive. No specified meaning for upper vs. lower-case and no widely accepted conventions. Percent sign (%) is legal as first character of variables and labels.

Postconditionals: SET:N<10 A="FOO" sets A to "FOO" if N is less than 10; DO:N>100 PRINTERR performs PRINTERR if N is greater than 100. Provides a conditional whose scope is less than the full line.

Arrays: created dynamically, stored sparsely as B-trees, any number of subscripts, subscripts can be strings or integers. Always automatically stored in sorted order. $ORDER and $QUERY functions allow traversal.

       for i=10000:1:12345 set sqtable(i)=i*i
       set address("Smith","Daniel")="dpbsmith@world.std.com"

Local arrays: names not beginning with caret; stored in process space; private to your process; expire when process terminates; available storage depends on partition size but is typically small (32K)

Global arrays: ^abc, ^def. Stored on disk, available to all processes, persist when process terminates. Very large globals (hundreds of megabytes) are practical and efficient. This is M's main "database" mechanism. Used instead of files for internal, machine-readable recordkeeping.

Indirection: in many contexts, @VBL can be used and effectively substitutes the contents of VBL into the statement. SET XYZ="ABC" SET @XYZ=123 sets the variable ABC to 123. SET SUBROU="REPORT" DO @SUBROU performs the subroutine named REPORT. Operational equivalent of "pointers" in other languages.

Piece function: Treats variables as broken into pieces by a separator. $PIECE(STRINGVAR,"^",3) means the "third caret-separated piece of STRINGVAR." Can appear as an assignment target. After

       SET X="dpbsmith@world.std.com" 

$PIECE("world.std.com",".",2) yields "std". SET $P(X,"@",1)="office" causes X to become "office@world.std.com".

Order function

       Set stuff(6)="xyz",stuff(10)=26,stuff(15)=""  

$Order(stuff("")) yields 6, $Order(stuff(6)) yields 10, $Order(stuff(8)) yields 10, $Order(stuff(10)) yields 15, $Order(stuff(15)) yields "".

       Set i="" For  Set i=$O(stuff(i)) Quit:i=""  Write !,i,?10,stuff(i)

The argumentless For iterates until stopped by the Quit. Prints a table of i and stuff(i) where i is successively 6, 10, and 15.

Commands: may be abbreviated to the bolded characters, usually one letter in length, case-insensitive, with one space after a command unless otherwise noted. These are examples of commands, not showing the full syntax and capabilities.

  BREAK                                     Signal Debugging of current process
  CLOSE IOdevice                            Finish using the IO channel specified by local variable IOdevice,
                                                  releasing it to be used by another job or process
  DO XYZ                                    call subroutine at label XYZ in local routine
  DO PQR(arg1,arg2,arg3)                    call with parameter passing in local routine
  DO XYZ^ROU                                call subroutine at label XYZ in external routine named ROU
  DO PQR^ROU(arg1,arg2,arg3)                call with parameter passing  in external routine named ROU
  ELSE  command1 command2 ...               conditionally execute rest of line, on failure of last IF or timed read (note two spaces following ELSE)
  FOR  command1 command2 ...                repeat until a QUIT breaks you out (note two spaces following FOR)
  FOR i=1:2:100 command1 command2 ...       counted iteration, from initial value i=1 until i is greater than 100 with step size of 2
  FOR i=1,5,9,"A" command1 command2 ...     listed iteration with i equal to each value in list
  GOTO entryreference                       unconditionally transfer control to a label in local or external routine
  HALT                                      stop processing of this job, closing all IO channels and killing all local variables, and relinquishing all locks (note two spaces following HALT)
  HANG seconds                              temporarily stop processing of this job for the duration specified
  IF cnd command1 command2 ...              conditionally execute rest of line 
  JOB TAG^ROU                               start execution in a separate process, with an empty list of variables
                                                  processsing will start at the specified tag and routine
  KILL variable                             return variable to "undefined" state, applies to local and global variables.
  LOCK namespace                            Establish ownership of a namespace to allow synchronization of multiple jobs or processes
  MERGE variable1=variable2                 Copy entire array from variable2 into existing elements of variable1
  NEW variable1,variable2...                stack old local variables, create fresh variables with these names.
                                                  fresh variables are in "undefined" state.
                                                  Fresh variables are lost and old variables are restored on QUIT.
  OPEN IOdevice                             Start using the IO channel specified by local variable IOdevice,
                                                  preventing it to from being used by another job or process
  QUIT                                      return from current subroutine
  QUIT value                                return from programmed function 
  READ "Prompt:",x                          on current I/O stream, first write
                                                  "Prompt:", then read line into variable x  
  SET a=22,NAME="Dan"                       store value in variable
  SET (c,d)=0                               parallel variable assignment to more than one variable
  TCOMMIT                                   finalize current database transaction  (note two spaces after TCOMMIT)
  TRESTART                                  restart the current database transaction (note two spaces after TSTART)
  TROLLBACK                                 retry the current database transaction, rolling back all changes
  TSTART transid                            start a database transaction with a particular rollback identifier
  USE IOdevice                              switch current I/O stream to device identified by local variable IOdevice
  WRITE !,"x=",x                            output to current I/O stream. ! = new line
  XECUTE "set a=5 do xyz"                   execute arbitrary string data as M code

Operators: No precedence, executed left to right, parenthesize as desired. 2+3*10 yields 50.

  +  -  *  /                  sum, difference, product, quotient
  \                           integer division, 1234.9\10 yields 123
  #                           modulo 1.5#1 yields .5
  _                           concatenation, "nice"_2_"use" → "nice2use"
  & ! '                       logical and, logical or, logical not
  < >                         numeric less than, numeric greater than
  =                           string equality
  [                           string contains substring.  "ABCD"["BC" → 1 (true)
  ]                           string lexically follows. "Z"]"A" → 1 (true)
  ]]                          string sorts after. "1"]]"A"→ 0 (false)
  ?                           pattern match operator, (regular expression string matching)

Intrinsic (built-in) functions:

Important structural components of the language (not commonly found in other languages):

  $DATA(V)                    tests if a V is defined (has data) or not, and if V is an array or not.
  $GET(variable)              tests value of variable, and if not defined, returns empty string
  $ORDER, $QUERY              traverse arrays in sorted order
  $ORDER(a("abc"))            value v is the next subscript, following
                                    "abc", according to the M collating sequence,
                                    such that a(v) is defined.                   
  $PIECE(string,delimiter,position) see above
  $SELECT(c1:v1,c2:v2,1:v3)   if c1 is true yields v1, else if c2 is true
                              yields v2, otherwise yields v3          
  $STACK(stacklevel)          return information in deeper stack levels
  $TEXT(FOO+3^PGM)            returns text of source code at line FOO+3 in routine PGM

Convenience functions similar to library functions in other languages:

  $ASCII, $CHAR               text-to-ASCII-code and inverse
  $EXTRACT(string,5,10)       characters 5 through 10 of string; may be
                              assignment target
  $FIND(string,find,from)     substring search
  $FNUMBER()                  floating point formatting including numeric commas
  $JUSTIFY()                  general string formatting with limited numeric 
  $LENGTH(string)             returns a count of the number of bytes in string
  $RANDOM(100)                random number in range 0 to 99 inclusive
  $REVERSE(string)            reverse the characters in the string
  $TRANSLATE("abcd","ab","AB") character substitution; yields "ABcd"

Intrinsic (built-in) variables: Intrinic variables

  $DEVICE                     status variable for current device
  $ESTACK, $ECODE, $ETRAP     error handling variables
  $HOROLOG                    gives current date and time as day offsets from January 1, 1841 and seconds since midnight.
  $IO, $PRINCIPAL             the current and first I/O device for a job or process
  $JOB                        the unique identifier for the current job or process
  $KEY                        auxillary status information from previous READ command.
  $QUIT                       whether current subroutine was invoked as a function or subroutine.
  $STACK                      current depth of subroutine stack
  $STORAGE                    current amount of memory available to current job or process
  $SYSTEM                     the unique identifier of current system or domain of concurrent processes
  $TEST                       success and failure flag for most recent timeout or IF condition evaluation.
  $TLEVEL                     current database transaction depth
  $TRESTART                   count of restarts that have occurred on current database transaction.
  $X, $Y                      current Column and line number on current device

Relationships between MUMPS, Caché, MIIS, MAGIC, InterSystems and Meditech

Early development of MUMPS is credited to A. Neil Pappalardo and Octo Barnett of Massachusetts General Hospital.

MUMPS evolution took two major directions: MUMPS proper and MIIS. MUMPS became an ANSI and ISO-standard language, and has been available for nearly two decades from a number of vendors. Gradually, one vendor, InterSystems, came to dominate. The result is that as of 2004 commercial MUMPS is today primarily hidden as the foundation of the InterSystems-proprietary system, Caché. Caché is available for multiple operating systems, including OpenVMS, Unix, Sun Solaris, Linux, Microsoft Windows, Unix, and Mac OS X under several licenses, including no-cost non-commercial use. InterSystems does not sell applications itself, but supplies Caché and other development tools to companies which use them to build applications.

Independent of InterSystems is the GT.M implementation of MUMPS. It is supported commercially for many platforms such as OpenVMS, Sun Solaris, Microsoft Windows, Unix, and Mac OS X with the Linux version for x86 processors open source.

Another line of development was led by Pappalardo, founder of Meditech, a successful vendor of medical applications and complete medical data processing systems. Within Meditech, an early version of MUMPS evolved into a language named MIIS (Meditech Interactive Information System), and a successor named MAGIC (not to be confused with a different PC database product of the same name). In contrast to InterSystems business strategy, MIIS and MAGIC are used only within Meditech and are not marketed to outside application developers.

"MUMPS" vs. "M"

While of little interest to outsiders, this topic was and is contentious within the MUMPS/M community.

All of the following opinions can and have been supported by knowledgeable people at various times:

  • The name became M in 1993 when the M Technology Association adopted it.
  • The name became M on December 8th, 1995 with the approval of ANSI X11.1-1995
  • Both M and MUMPS are officially accepted names.
  • M is only an "alternate name" or "nickname" for the language, and MUMPS is still the official name.

Some of the contention arose in response to strong advocacy for the name M on the part of one particular commercial interest, InterSystems, whose CEO disliked the name MUMPS and felt that it represented a serious marketing obstacle for InterSystems. Thus advocacy for the name M to some extent became identified as alignment with InterSystems. The dispute also reflected rivalry between organizations (the M Technology Association, the MUMPS development committee, the ANSI and ISO standards committee) as to who determines the "official" name of the language. Some writers have attempted to defuse the issue by referring to the language as M[UMPS], square brackets being the customary notation for optional syntax elements.

Since the M Technology Association dissolved in 2002 and the standards were administratively withdrawn, it can be argued that as of 2004 the language has no official name.

The December 31, 1840 epoch

In M, the current date and time is contained in a special system variable, $H (for "HOROLOG"). The format is a pair of integers separated by a comma, e.g. "54321,12345" The first number is the number of days since December 31st, 1840, i.e. day number 1 is January 1st, 1841; the second is the number of seconds since midnight.

The reason for this choice of epoch is a bit of MUMPS trivia that illustrates the degree to which the design of MUMPS was driven by practical considerations. James M. Poitras has written that he chose this epoch for the date and time routines in a package developed by his group at MGH in 1969: "I remembered reading of the oldest (one of the oldest?) U.S. citizen, a Civil War veteran, who was 121 years old at the time. Since I wanted to be able to represent dates in a Julian-type form so that age could be easily calculated and to be able to represent any birth date in the numeric range selected, I decided that a starting date in the early 1840s would be 'safe.' Since my algorithm worked most logically when every fourth year was a leap year, the first year was taken as 1841. The zero point was then December 31, 1840.... I wasn't party to the MDC negotiations, but I did explain the logic of my choice to members of the Committee."

(More colorful versions have circulated in the folklore, suggesting, for example, that December 30th, 1840 was the exact date of the first entry in the MGH records, but these appear to be legends).

For those who care about such things, $HOROLOG hit 60000 on April 10th, 2005; will be 70000 on August 26th, 2032; 80000 on January 12th, 2060; 90000 on May 30th, 2087; and 100000 on October 16th, 2114.

References

  • Walters, Richard (1997). M Programming: A Comprehensive Guide. Digital Press. ISBN 1555581676.

External links

Template:Major programming languages smallcs:MUMPS da:MUMPS es:MUMPS eo:MUMPS Komputillingvo fr:MUMPS ko:MUMPS it:MUMPS he:MUMPS lt:MUMPS hu:MUMPS programozási nyelv nl:MUMPS ja:MUMPS pl:MUMPS pt:MUMPS sv:MUMPS tr:MUMPS zh:MUMPS

Navigation

Academic Kids Menu

  • Art and Cultures
    • Art (http://www.academickids.com/encyclopedia/index.php/Art)
    • Architecture (http://www.academickids.com/encyclopedia/index.php/Architecture)
    • Cultures (http://www.academickids.com/encyclopedia/index.php/Cultures)
    • Music (http://www.academickids.com/encyclopedia/index.php/Music)
    • Musical Instruments (http://academickids.com/encyclopedia/index.php/List_of_musical_instruments)
  • Biographies (http://www.academickids.com/encyclopedia/index.php/Biographies)
  • Clipart (http://www.academickids.com/encyclopedia/index.php/Clipart)
  • Geography (http://www.academickids.com/encyclopedia/index.php/Geography)
    • Countries of the World (http://www.academickids.com/encyclopedia/index.php/Countries)
    • Maps (http://www.academickids.com/encyclopedia/index.php/Maps)
    • Flags (http://www.academickids.com/encyclopedia/index.php/Flags)
    • Continents (http://www.academickids.com/encyclopedia/index.php/Continents)
  • History (http://www.academickids.com/encyclopedia/index.php/History)
    • Ancient Civilizations (http://www.academickids.com/encyclopedia/index.php/Ancient_Civilizations)
    • Industrial Revolution (http://www.academickids.com/encyclopedia/index.php/Industrial_Revolution)
    • Middle Ages (http://www.academickids.com/encyclopedia/index.php/Middle_Ages)
    • Prehistory (http://www.academickids.com/encyclopedia/index.php/Prehistory)
    • Renaissance (http://www.academickids.com/encyclopedia/index.php/Renaissance)
    • Timelines (http://www.academickids.com/encyclopedia/index.php/Timelines)
    • United States (http://www.academickids.com/encyclopedia/index.php/United_States)
    • Wars (http://www.academickids.com/encyclopedia/index.php/Wars)
    • World History (http://www.academickids.com/encyclopedia/index.php/History_of_the_world)
  • Human Body (http://www.academickids.com/encyclopedia/index.php/Human_Body)
  • Mathematics (http://www.academickids.com/encyclopedia/index.php/Mathematics)
  • Reference (http://www.academickids.com/encyclopedia/index.php/Reference)
  • Science (http://www.academickids.com/encyclopedia/index.php/Science)
    • Animals (http://www.academickids.com/encyclopedia/index.php/Animals)
    • Aviation (http://www.academickids.com/encyclopedia/index.php/Aviation)
    • Dinosaurs (http://www.academickids.com/encyclopedia/index.php/Dinosaurs)
    • Earth (http://www.academickids.com/encyclopedia/index.php/Earth)
    • Inventions (http://www.academickids.com/encyclopedia/index.php/Inventions)
    • Physical Science (http://www.academickids.com/encyclopedia/index.php/Physical_Science)
    • Plants (http://www.academickids.com/encyclopedia/index.php/Plants)
    • Scientists (http://www.academickids.com/encyclopedia/index.php/Scientists)
  • Social Studies (http://www.academickids.com/encyclopedia/index.php/Social_Studies)
    • Anthropology (http://www.academickids.com/encyclopedia/index.php/Anthropology)
    • Economics (http://www.academickids.com/encyclopedia/index.php/Economics)
    • Government (http://www.academickids.com/encyclopedia/index.php/Government)
    • Religion (http://www.academickids.com/encyclopedia/index.php/Religion)
    • Holidays (http://www.academickids.com/encyclopedia/index.php/Holidays)
  • Space and Astronomy
    • Solar System (http://www.academickids.com/encyclopedia/index.php/Solar_System)
    • Planets (http://www.academickids.com/encyclopedia/index.php/Planets)
  • Sports (http://www.academickids.com/encyclopedia/index.php/Sports)
  • Timelines (http://www.academickids.com/encyclopedia/index.php/Timelines)
  • Weather (http://www.academickids.com/encyclopedia/index.php/Weather)
  • US States (http://www.academickids.com/encyclopedia/index.php/US_States)

Information

  • Home Page (http://academickids.com/encyclopedia/index.php)
  • Contact Us (http://www.academickids.com/encyclopedia/index.php/Contactus)

  • Clip Art (http://classroomclipart.com)
Toolbox
Personal tools