coursera · Nov 26, 2014
diff --git a/‎.gitignore
+7 b/‎.gitignore
+7
diff --git a/‎LICENSE
+13 b/‎LICENSE
+13
diff --git a/‎MANIFEST.in
+1 b/‎MANIFEST.in
+1
diff --git a/‎README.md
-2 b/‎README.md
-2
diff --git a/‎README.rst
+105 b/‎README.rst
+105
diff --git a/‎docs/Makefile
+153 b/‎docs/Makefile
+153
diff --git a/‎docs/conf.py
+261 b/‎docs/conf.py
+261
diff --git a/‎docs/index.rst
+93 b/‎docs/index.rst
+93
diff --git a/‎dplyr-comparison.html
+1,209 b/‎dplyr-comparison.html
+1,209
diff --git a/‎ply/__init__.py
+2 b/‎ply/__init__.py
+2
diff --git a/‎ply/methods.py
+206 b/‎ply/methods.py
+206
diff --git a/‎ply/symbolic.py
+202 b/‎ply/symbolic.py
+202
diff --git a/‎setup.py
+15 b/‎setup.py
+15
diff --git a/‎tests/test_all.sh
+3 b/‎tests/test_all.sh
+3
diff --git a/‎tests/test_methods.py
+196 b/‎tests/test_methods.py
+196
diff --git a/‎tests/test_symbolic.py
+248 b/‎tests/test_symbolic.py
+248
@@ -0,0 +1,7 @@
+*.pyc
+*.egg-info
+*.egg
+/MANIFEST
+/dist/
+/docs/_build
+/build/
@@ -0,0 +1,13 @@
+Copyright 2014 Coursera Inc.
+
+Licensed under the Apache License, Version 2.0 (the "License");
+you may not use this file except in compliance with the License.
+You may obtain a copy of the License at
+
+    http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
@@ -0,0 +1 @@
+include README.rst LICENSE
@@ -0,0 +1,105 @@
+**pandas-ply**: functional data manipulation for pandas
+=======================================================
+
+**pandas-ply** is a thin layer which makes it easier to manipulate data with `pandas <http://pandas.pydata.org/>`_. In particular, it provides elegant, functional, chainable syntax in cases where **pandas** would require mutation, saved intermediate values, or other awkward constructions. In this way, it aims to move **pandas** closer to the "grammar of data manipulation" provided by the `dplyr <http://cran.r-project.org/web/packages/dplyr/index.html>`_ package for R.
+
+For example, take the **dplyr** code below:
+
+.. code:: r
+
+  flights %>%
+    group_by(year, month, day) %>%
+    summarise(
+      arr = mean(arr_delay, na.rm = TRUE),
+      dep = mean(dep_delay, na.rm = TRUE)
+    ) %>%
+    filter(arr > 30 & dep > 30)
+
+The most common way to express this in **pandas** is probably:
+
+.. code:: python
+
+  grouped_flights = flights.groupby(['year', 'month', 'day'])
+  output = pd.DataFrame()
+  output['arr'] = grouped_flights.arr_delay.mean()
+  output['dep'] = grouped_flights.arr_delay.mean()
+  filtered_output = output[(output.arr > 30) & (output.dep > 30)]
+
+**pandas-ply** lets you instead write:
+
+.. code:: python
+
+  (flights
+    .groupby(['year', 'month', 'day'])
+    .ply_select(
+      arr = X.arr_delay.mean(),
+      dep = X.dep_delay.mean())
+    .ply_where(X.arr > 30, X.dep > 30))
+
+In our opinion, this **pandas-ply** code is cleaner, more expressive, more readable, more concise, and less error-prone than the original **pandas** code.
+
+Explanatory notes on the **pandas-ply** code sample above:
+
+* **pandas-ply**'s methods (like ``ply_select`` and ``ply_where`` above) are attached directly to **pandas** objects and can be used immediately, without any wrapping or redirection. They start with a ``ply_`` prefix to distinguish them from built-in **pandas** methods.
+* **pandas-ply**'s methods are named for (and modelled after) SQL's operators. (But keep in mind that these operators will not always appear in the same order as they do in a SQL statement: ``SELECT a FROM b WHERE c GROUP BY d`` probably maps to ``b.ply_where(c).groupby(d).ply_select(a)``.)
+* **pandas-ply** includes a simple system for building "symbolic expressions" to provide as arguments to its methods. ``X`` above is an instance of ``ply.symbolic.Symbol``. Operations on this symbol produce larger compound symbolic expressions. When ``pandas-ply`` receives a symbolic expression as an argument, it converts it into a function. So, for instance, ``X.arr > 30`` in the above code could have instead been provided as ``lambda x: x.arr > 30``. Use of symbolic expressions allows the ``lambda x:`` to be left off, resulting in less cluttered code.
+
+Warning
+-------
+
+**pandas-ply** is new, and in an experimental stage of its development. The API is not yet stable. Expect the unexpected.
+
+(Pull requests are welcome. Feel free to contact us at pandas-ply@coursera.org.)
+
+Using **pandas-ply**
+--------------------
+
+Install **pandas-ply** with:
+
+::
+
+  $ pip install pandas-ply
+
+
+Typical use of **pandas-ply** starts with:
+
+.. code:: python
+
+  import pandas as pd
+  from ply import install_ply, X, sym_call
+
+  install_ply(pd)
+
+After calling ``install_ply``, all **pandas** objects have **pandas-ply**'s methods attached.
+
+API reference
+-------------
+
+Full API reference is available at `<http://pythonhosted.org/pandas-ply/>`_.
+
+Possible TODOs
+--------------
+
+* Extend ``pandas``' native ``groupby`` to support symbolic expressions?
+* Extend ``pandas``' native ``apply`` to support symbolic expressions?
+* Add ``.ply_call`` to ``pandas`` objects to extend chainability?
+* Version of ``ply_select`` which supports later computed columns relying on earlier computed columns?
+* Version of ``ply_select`` which supports careful column ordering?
+* Better handling of indices?
+
+License
+-------
+
+Copyright 2014 Coursera Inc.
+
+Licensed under the Apache License, Version 2.0 (the "License");
+you may not use this file except in compliance with the License.
+You may obtain a copy of the License at
+
+    http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
@@ -0,0 +1,153 @@
+# Makefile for Sphinx documentation
+#
+
+# You can set these variables from the command line.
+SPHINXOPTS    =
+SPHINXBUILD   = sphinx-build
+PAPER         =
+BUILDDIR      = _build
+
+# Internal variables.
+PAPEROPT_a4     = -D latex_paper_size=a4
+PAPEROPT_letter = -D latex_paper_size=letter
+ALLSPHINXOPTS   = -d $(BUILDDIR)/doctrees $(PAPEROPT_$(PAPER)) $(SPHINXOPTS) .
+# the i18n builder cannot share the environment and doctrees with the others
+I18NSPHINXOPTS  = $(PAPEROPT_$(PAPER)) $(SPHINXOPTS) .
+
+.PHONY: help clean html dirhtml singlehtml pickle json htmlhelp qthelp devhelp epub latex latexpdf text man changes linkcheck doctest gettext
+
+help:
+	@echo "Please use \`make <target>' where <target> is one of"
+	@echo "  html       to make standalone HTML files"
+	@echo "  dirhtml    to make HTML files named index.html in directories"
+	@echo "  singlehtml to make a single large HTML file"
+	@echo "  pickle     to make pickle files"
+	@echo "  json       to make JSON files"
+	@echo "  htmlhelp   to make HTML files and a HTML help project"
+	@echo "  qthelp     to make HTML files and a qthelp project"
+	@echo "  devhelp    to make HTML files and a Devhelp project"
+	@echo "  epub       to make an epub"
+	@echo "  latex      to make LaTeX files, you can set PAPER=a4 or PAPER=letter"
+	@echo "  latexpdf   to make LaTeX files and run them through pdflatex"
+	@echo "  text       to make text files"
+	@echo "  man        to make manual pages"
+	@echo "  texinfo    to make Texinfo files"
+	@echo "  info       to make Texinfo files and run them through makeinfo"
+	@echo "  gettext    to make PO message catalogs"
+	@echo "  changes    to make an overview of all changed/added/deprecated items"
+	@echo "  linkcheck  to check all external links for integrity"
+	@echo "  doctest    to run all doctests embedded in the documentation (if enabled)"
+
+clean:
+	-rm -rf $(BUILDDIR)/*
+
+html:
+	$(SPHINXBUILD) -b html $(ALLSPHINXOPTS) $(BUILDDIR)/html
+	@echo
+	@echo "Build finished. The HTML pages are in $(BUILDDIR)/html."
+
+dirhtml:
+	$(SPHINXBUILD) -b dirhtml $(ALLSPHINXOPTS) $(BUILDDIR)/dirhtml
+	@echo
+	@echo "Build finished. The HTML pages are in $(BUILDDIR)/dirhtml."
+
+singlehtml:
+	$(SPHINXBUILD) -b singlehtml $(ALLSPHINXOPTS) $(BUILDDIR)/singlehtml
+	@echo
+	@echo "Build finished. The HTML page is in $(BUILDDIR)/singlehtml."
+
+pickle:
+	$(SPHINXBUILD) -b pickle $(ALLSPHINXOPTS) $(BUILDDIR)/pickle
+	@echo
+	@echo "Build finished; now you can process the pickle files."
+
+json:
+	$(SPHINXBUILD) -b json $(ALLSPHINXOPTS) $(BUILDDIR)/json
+	@echo
+	@echo "Build finished; now you can process the JSON files."
+
+htmlhelp:
+	$(SPHINXBUILD) -b htmlhelp $(ALLSPHINXOPTS) $(BUILDDIR)/htmlhelp
+	@echo
+	@echo "Build finished; now you can run HTML Help Workshop with the" \
+	      ".hhp project file in $(BUILDDIR)/htmlhelp."
+
+qthelp:
+	$(SPHINXBUILD) -b qthelp $(ALLSPHINXOPTS) $(BUILDDIR)/qthelp
+	@echo
+	@echo "Build finished; now you can run "qcollectiongenerator" with the" \
+	      ".qhcp project file in $(BUILDDIR)/qthelp, like this:"
+	@echo "# qcollectiongenerator $(BUILDDIR)/qthelp/pandas-ply.qhcp"
+	@echo "To view the help file:"
+	@echo "# assistant -collectionFile $(BUILDDIR)/qthelp/pandas-ply.qhc"
+
+devhelp:
+	$(SPHINXBUILD) -b devhelp $(ALLSPHINXOPTS) $(BUILDDIR)/devhelp
+	@echo
+	@echo "Build finished."
+	@echo "To view the help file:"
+	@echo "# mkdir -p $$HOME/.local/share/devhelp/pandas-ply"
+	@echo "# ln -s $(BUILDDIR)/devhelp $$HOME/.local/share/devhelp/pandas-ply"
+	@echo "# devhelp"
+
+epub:
+	$(SPHINXBUILD) -b epub $(ALLSPHINXOPTS) $(BUILDDIR)/epub
+	@echo
+	@echo "Build finished. The epub file is in $(BUILDDIR)/epub."
+
+latex:
+	$(SPHINXBUILD) -b latex $(ALLSPHINXOPTS) $(BUILDDIR)/latex
+	@echo
+	@echo "Build finished; the LaTeX files are in $(BUILDDIR)/latex."
+	@echo "Run \`make' in that directory to run these through (pdf)latex" \
+	      "(use \`make latexpdf' here to do that automatically)."
+
+latexpdf:
+	$(SPHINXBUILD) -b latex $(ALLSPHINXOPTS) $(BUILDDIR)/latex
+	@echo "Running LaTeX files through pdflatex..."
+	$(MAKE) -C $(BUILDDIR)/latex all-pdf
+	@echo "pdflatex finished; the PDF files are in $(BUILDDIR)/latex."
+
+text:
+	$(SPHINXBUILD) -b text $(ALLSPHINXOPTS) $(BUILDDIR)/text
+	@echo
+	@echo "Build finished. The text files are in $(BUILDDIR)/text."
+
+man:
+	$(SPHINXBUILD) -b man $(ALLSPHINXOPTS) $(BUILDDIR)/man
+	@echo
+	@echo "Build finished. The manual pages are in $(BUILDDIR)/man."
+
+texinfo:
+	$(SPHINXBUILD) -b texinfo $(ALLSPHINXOPTS) $(BUILDDIR)/texinfo
+	@echo
+	@echo "Build finished. The Texinfo files are in $(BUILDDIR)/texinfo."
+	@echo "Run \`make' in that directory to run these through makeinfo" \
+	      "(use \`make info' here to do that automatically)."
+
+info:
+	$(SPHINXBUILD) -b texinfo $(ALLSPHINXOPTS) $(BUILDDIR)/texinfo
+	@echo "Running Texinfo files through makeinfo..."
+	make -C $(BUILDDIR)/texinfo info
+	@echo "makeinfo finished; the Info files are in $(BUILDDIR)/texinfo."
+
+gettext:
+	$(SPHINXBUILD) -b gettext $(I18NSPHINXOPTS) $(BUILDDIR)/locale
+	@echo
+	@echo "Build finished. The message catalogs are in $(BUILDDIR)/locale."
+
+changes:
+	$(SPHINXBUILD) -b changes $(ALLSPHINXOPTS) $(BUILDDIR)/changes
+	@echo
+	@echo "The overview file is in $(BUILDDIR)/changes."
+
+linkcheck:
+	$(SPHINXBUILD) -b linkcheck $(ALLSPHINXOPTS) $(BUILDDIR)/linkcheck
+	@echo
+	@echo "Link check complete; look for any errors in the above output " \
+	      "or in $(BUILDDIR)/linkcheck/output.txt."
+
+doctest:
+	$(SPHINXBUILD) -b doctest $(ALLSPHINXOPTS) $(BUILDDIR)/doctest
+	@echo "Testing of doctests in the sources finished, look at the " \
+	      "results in $(BUILDDIR)/doctest/output.txt."
@@ -0,0 +1,261 @@
+# -*- coding: utf-8 -*-
+#
+# pandas-ply documentation build configuration file, created by
+# sphinx-quickstart on Tue Nov 18 19:40:12 2014.
+#
+# This file is execfile()d with the current directory set to its containing dir.
+#
+# Note that not all possible configuration values are present in this
+# autogenerated file.
+#
+# All configuration values have a default; values that are commented out
+# serve to show the default.
+
+import sys, os
+import sphinx_rtd_theme
+
+# If extensions (or modules to document with autodoc) are in another directory,
+# add these directories to sys.path here. If the directory is relative to the
+# documentation root, use os.path.abspath to make it absolute, like shown here.
+sys.path.insert(0, os.path.abspath('..'))
+
+# -- General configuration -----------------------------------------------------
+
+# If your documentation needs a minimal Sphinx version, state it here.
+#needs_sphinx = '1.0'
+
+# Add any Sphinx extension module names here, as strings. They can be extensions
+# coming with Sphinx (named 'sphinx.ext.*') or your custom ones.
+extensions = [
+    'sphinx.ext.autodoc',
+    'sphinx.ext.doctest',
+    'sphinx.ext.coverage',
+    'sphinxcontrib.napoleon'
+]
+
+# Napoleon settings
+napoleon_google_docstring = True
+napoleon_numpy_docstring = True
+napoleon_include_private_with_doc = False
+napoleon_include_special_with_doc = True
+napoleon_use_admonition_for_examples = False
+napoleon_use_admonition_for_notes = False
+napoleon_use_admonition_for_references = False
+napoleon_use_ivar = False
+napoleon_use_param = True
+napoleon_use_rtype = True
+autodoc_member_order = 'bysource'
+
+# Add any paths that contain templates here, relative to this directory.
+templates_path = ['_templates']
+
+# The suffix of source filenames.
+source_suffix = '.rst'
+
+# The encoding of source files.
+#source_encoding = 'utf-8-sig'
+
+# The master toctree document.
+master_doc = 'index'
+
+# General information about the project.
+project = u'pandas-ply'
+copyright = u'2014, Coursera'
+
+# The version info for the project you're documenting, acts as replacement for
+# |version| and |release|, also used in various other places throughout the
+# built documents.
+#
+# The short X.Y version.
+version = '0.1.0'
+# The full version, including alpha/beta/rc tags.
+release = '0.1.0'
+
+# The language for content autogenerated by Sphinx. Refer to documentation
+# for a list of supported languages.
+#language = None
+
+# There are two options for replacing |today|: either, you set today to some
+# non-false value, then it is used:
+#today = ''
+# Else, today_fmt is used as the format for a strftime call.
+#today_fmt = '%B %d, %Y'
+
+# List of patterns, relative to source directory, that match files and
+# directories to ignore when looking for source files.
+exclude_patterns = ['_build']
+
+# The reST default role (used for this markup: `text`) to use for all documents.
+#default_role = None
+
+# If true, '()' will be appended to :func: etc. cross-reference text.
+#add_function_parentheses = True
+
+# If true, the current module name will be prepended to all description
+# unit titles (such as .. function::).
+#add_module_names = True
+
+# If true, sectionauthor and moduleauthor directives will be shown in the
+# output. They are ignored by default.
+#show_authors = False
+
+# The name of the Pygments (syntax highlighting) style to use.
+pygments_style = 'sphinx'
+
+# A list of ignored prefixes for module index sorting.
+#modindex_common_prefix = []
+
+
+# -- Options for HTML output ---------------------------------------------------
+
+# The theme to use for HTML and HTML Help pages.  See the documentation for
+# a list of builtin themes.
+html_theme = 'sphinx_rtd_theme'
+
+# Theme options are theme-specific and customize the look and feel of a theme
+# further.  For a list of options available for each theme, see the
+# documentation.
+#html_theme_options = {}
+
+# Add any paths that contain custom themes here, relative to this directory.
+html_theme_path = [sphinx_rtd_theme.get_html_theme_path()]
+
+# The name for this set of Sphinx documents.  If None, it defaults to
+# "<project> v<release> documentation".
+#html_title = None
+
+# A shorter title for the navigation bar.  Default is the same as html_title.
+#html_short_title = None
+
+# The name of an image file (relative to this directory) to place at the top
+# of the sidebar.
+#html_logo = None
+
+# The name of an image file (within the static path) to use as favicon of the
+# docs.  This file should be a Windows icon file (.ico) being 16x16 or 32x32
+# pixels large.
+#html_favicon = None
+
+# Add any paths that contain custom static files (such as style sheets) here,
+# relative to this directory. They are copied after the builtin static files,
+# so a file named "default.css" will overwrite the builtin "default.css".
+#html_static_path = ['_static']
+
+# If not '', a 'Last updated on:' timestamp is inserted at every page bottom,
+# using the given strftime format.
+#html_last_updated_fmt = '%b %d, %Y'
+
+# If true, SmartyPants will be used to convert quotes and dashes to
+# typographically correct entities.
+#html_use_smartypants = True
+
+# Custom sidebar templates, maps document names to template names.
+#html_sidebars = {}
+
+# Additional templates that should be rendered to pages, maps page names to
+# template names.
+#html_additional_pages = {}
+
+# If false, no module index is generated.
+#html_domain_indices = True
+
+# If false, no index is generated.
+#html_use_index = True
+
+# If true, the index is split into individual pages for each letter.
+#html_split_index = False
+
+# If true, links to the reST sources are added to the pages.
+#html_show_sourcelink = True
+
+# If true, "Created using Sphinx" is shown in the HTML footer. Default is True.
+#html_show_sphinx = True
+
+# If true, "(C) Copyright ..." is shown in the HTML footer. Default is True.
+#html_show_copyright = True
+
+# If true, an OpenSearch description file will be output, and all pages will
+# contain a <link> tag referring to it.  The value of this option must be the
+# base URL from which the finished HTML is served.
+#html_use_opensearch = ''
+
+# This is the file name suffix for HTML files (e.g. ".xhtml").
+#html_file_suffix = None
+
+# Output file base name for HTML help builder.
+#htmlhelp_basename = 'pandas-plydoc'
+
+
+# -- Options for LaTeX output --------------------------------------------------
+
+latex_elements = {
+# The paper size ('letterpaper' or 'a4paper').
+#'papersize': 'letterpaper',
+
+# The font size ('10pt', '11pt' or '12pt').
+#'pointsize': '10pt',
+
+# Additional stuff for the LaTeX preamble.
+#'preamble': '',
+}
+
+# Grouping the document tree into LaTeX files. List of tuples
+# (source start file, target name, title, author, documentclass [howto/manual]).
+latex_documents = [
+  ('index', 'pandas-ply.tex', u'pandas-ply Documentation',
+   u'Coursera', 'manual'),
+]
+
+# The name of an image file (relative to this directory) to place at the top of
+# the title page.
+#latex_logo = None
+
+# For "manual" documents, if this is true, then toplevel headings are parts,
+# not chapters.
+#latex_use_parts = False
+
+# If true, show page references after internal links.
+#latex_show_pagerefs = False
+
+# If true, show URL addresses after external links.
+#latex_show_urls = False
+
+# Documents to append as an appendix to all manuals.
+#latex_appendices = []
+
+# If false, no module index is generated.
+#latex_domain_indices = True
+
+
+# -- Options for manual page output --------------------------------------------
+
+# One entry per manual page. List of tuples
+# (source start file, name, description, authors, manual section).
+man_pages = [
+    ('index', 'pandas-ply', u'pandas-ply Documentation',
+     [u'Coursera'], 1)
+]
+
+# If true, show URL addresses after external links.
+#man_show_urls = False
+
+
+# -- Options for Texinfo output ------------------------------------------------
+
+# Grouping the document tree into Texinfo files. List of tuples
+# (source start file, target name, title, author,
+#  dir menu entry, description, category)
+texinfo_documents = [
+  ('index', 'pandas-ply', u'pandas-ply Documentation',
+   u'Coursera', 'pandas-ply', 'functional data manipulation for pandas',
+   'Miscellaneous'),
+]
+
+# Documents to append as an appendix to all manuals.
+#texinfo_appendices = []
+
+# If false, no module index is generated.
+#texinfo_domain_indices = True
+
+# How to display URL addresses: 'footnote', 'no', or 'inline'.
+#texinfo_show_urls = 'footnote'
@@ -0,0 +1,93 @@
+**pandas-ply**: functional data manipulation for pandas
+=======================================================
+
+**pandas-ply** is a thin layer which makes it easier to manipulate data with `pandas <http://pandas.pydata.org/>`_. In particular, it provides elegant, functional, chainable syntax in cases where **pandas** would require mutation, saved intermediate values, or other awkward constructions. In this way, it aims to move **pandas** closer to the "grammar of data manipulation" provided by the `dplyr <http://cran.r-project.org/web/packages/dplyr/index.html>`_ package for R.
+
+For example, take the **dplyr** code below:
+
+.. code:: r
+
+  flights %>%
+    group_by(year, month, day) %>%
+    summarise(
+      arr = mean(arr_delay, na.rm = TRUE),
+      dep = mean(dep_delay, na.rm = TRUE)
+    ) %>%
+    filter(arr > 30 & dep > 30)
+
+The most common way to express this in **pandas** is probably:
+
+.. code:: python
+
+  grouped_flights = flights.groupby(['year', 'month', 'day'])
+  output = pd.DataFrame()
+  output['arr'] = grouped_flights.arr_delay.mean()
+  output['dep'] = grouped_flights.arr_delay.mean()
+  filtered_output = output[(output.arr > 30) & (output.dep > 30)]
+
+**pandas-ply** lets you instead write:
+
+.. code:: python
+
+  (flights
+    .groupby(['year', 'month', 'day'])
+    .ply_select(
+      arr = X.arr_delay.mean(),
+      dep = X.dep_delay.mean())
+    .ply_where(X.arr > 30, X.dep > 30))
+
+In our opinion, this **pandas-ply** code is cleaner, more expressive, more readable, more concise, and less error-prone than the original **pandas** code.
+
+Explanatory notes on the **pandas-ply** code sample above:
+
+* **pandas-ply**'s methods (like ``ply_select`` and ``ply_where`` above) are attached directly to **pandas** objects and can be used immediately, without any wrapping or redirection. They start with a ``ply_`` prefix to distinguish them from built-in **pandas** methods.
+* **pandas-ply**'s methods are named for (and modelled after) SQL's operators. (But keep in mind that these operators will not always appear in the same order as they do in a SQL statement: ``SELECT a FROM b WHERE c GROUP BY d`` probably maps to ``b.ply_where(c).groupby(d).ply_select(a)``.)
+* **pandas-ply** includes a simple system for building "symbolic expressions" to provide as arguments to its methods. ``X`` above is an instance of ``ply.symbolic.Symbol``. Operations on this symbol produce larger compound symbolic expressions. When ``pandas-ply`` receives a symbolic expression as an argument, it converts it into a function. So, for instance, ``X.arr > 30`` in the above code could have instead been provided as ``lambda x: x.arr > 30``. Use of symbolic expressions allows the ``lambda x:`` to be left off, resulting in less cluttered code.
+
+Warning
+-------
+
+**pandas-ply** is new, and in an experimental stage of its development. The API is not yet stable. Expect the unexpected.
+
+(Pull requests are welcome. Feel free to contact us at pandas-ply@coursera.org.)
+
+Using **pandas-ply**
+--------------------
+
+Install **pandas-ply** with:
+
+::
+
+  $ pip install pandas-ply
+
+
+Typical use of **pandas-ply** starts with:
+
+.. code:: python
+
+  import pandas as pd
+  from ply import install_ply, X, sym_call
+
+  install_ply(pd)
+
+After calling ``install_ply``, all **pandas** objects have **pandas-ply**'s methods attached.
+
+API reference
+-------------
+
+pandas extensions
+~~~~~~~~~~~~~~~~~
+
+.. automodule:: ply.methods
+    :members:
+    :undoc-members:
+    :show-inheritance:
+
+`ply.symbolic`
+~~~~~~~~~~~~~~
+
+.. automodule:: ply.symbolic
+    :members:
+    :undoc-members:
+    :private-members:
+    :show-inheritance:
@@ -0,0 +1,2 @@
+from methods import install_ply
+from symbolic import X, sym_call
@@ -0,0 +1,206 @@
+"""This module contains the **pandas-ply** methods which are designed to be
+added to panda objects. The methods in this module should not be used directly.
+Instead, the function `install_ply` should be used to attach them to the pandas
+classes."""
+
+import symbolic
+
+pandas = None
+
+def install_ply(pandas_to_use):
+    """Install `pandas-ply` onto the objects in a copy of `pandas`."""
+
+    global pandas
+    pandas = pandas_to_use
+
+    pandas.DataFrame.ply_where = _ply_where
+    pandas.DataFrame.ply_select = _ply_select
+
+    pandas.Series.ply_where = _ply_where
+
+    pandas.core.groupby.DataFrameGroupBy.ply_select = _ply_select_for_groups
+
+    pandas.core.groupby.SeriesGroupBy.ply_select = _ply_select_for_groups
+
+
+def _ply_where(self, *conditions):
+    """Filter a dataframe/series to only include rows/entries satisfying a
+    given set of conditions.
+
+    Analogous to SQL's ``WHERE``, or dplyr's ``filter``.
+
+    Args:
+        `*conditions`: Each should be a dataframe/series of booleans, a
+            function returning such an object when run on the input dataframe,
+            or a symbolic expression yielding such an object when evaluated
+            with Symbol(0) mapped to the input dataframe. The input dataframe
+            will be filtered by the AND of all the conditions.
+
+    Example:
+        >>> flights.ply_where(X.month == 1, X.day == 1)
+        [ same result as `flights[(flights.month == 1) & (flights.day == 1)]` ]
+    """
+
+    if not conditions:
+        return self
+
+    evalled_conditions = [symbolic.to_callable(condition)(self)
+                          for condition in conditions]
+    anded_evalled_conditions = reduce(
+        lambda x, y: x & y, evalled_conditions)
+    return self[anded_evalled_conditions]
+
+
+def _ply_select(self, *args, **kwargs):
+    """Transform a dataframe by selecting old columns and new (computed)
+    columns.
+
+    Analogous to SQL's ``SELECT``, or dplyr's ``select`` / ``rename`` /
+    ``mutate`` / ``transmute``.
+
+    Args:
+        `*args`: Each should be one of:
+
+            ``'*'``
+                says that all columns in the input dataframe should be
+                included
+            ``'column_name'``
+                says that `column_name` in the input dataframe should be
+                included
+            ``'-column_name'``
+                says that `column_name` in the input dataframe should be
+                excluded.
+
+            If any `'-column_name'` is present, then `'*'` should be
+            present, and if `'*'` is present, no 'column_name' should be
+            present. Column-includes and column-excludes should not overlap.
+        `**kwargs`: Each argument name will be the name of a new column in the
+            output dataframe, with the column's contents determined by the
+            argument contents. These contents can be given as a dataframe, a
+            function (taking the input dataframe as its single argument), or a
+            symbolic expression (taking the input dataframe as ``Symbol(0)``).
+            kwarg-provided columns override arg-provided columns.
+
+    Example:
+        >>> flights.ply_select('*',
+        ...     gain = X.arr_delay - X.dep_delay,
+        ...     speed = X.distance / X.air_time * 60)
+        [ original dataframe, with two new computed columns added ]
+    """
+
+    input_columns = set(self.columns)
+
+    has_star = False
+    include_columns = []
+    exclude_columns = []
+    for arg in args:
+        if arg == '*':
+            if has_star:
+                raise ValueError('ply_select received repeated stars')
+            has_star = True
+        elif arg in input_columns:
+            if arg in include_columns:
+                raise ValueError(
+                    'ply_select received a repeated column-include (%s)' %
+                    arg)
+            include_columns.append(arg)
+        elif arg[0] == '-' and arg[1:] in input_columns:
+            if arg in exclude_columns:
+                raise ValueError(
+                    'ply_select received a repeated column-exclude (%s)' %
+                    arg[1:])
+            exclude_columns.append(arg[1:])
+        else:
+            raise ValueError(
+                'ply_select received a strange argument (%s)' %
+                arg)
+    if exclude_columns and not has_star:
+        raise ValueError(
+            'ply_select received column-excludes without an star')
+    if has_star and include_columns:
+        raise ValueError(
+            'ply_select received both an star and column-includes')
+    if set(include_columns) & set(exclude_columns):
+        raise ValueError(
+            'ply_select received overlapping column-includes and ' +
+            'column-excludes')
+
+    include_columns_inc_star = self.columns if has_star else include_columns
+
+    output_columns = [col for col in include_columns_inc_star
+                      if col not in exclude_columns]
+
+    # Note: This maintains self's index even if output_columns is [].
+    to_return = self[output_columns]
+
+    # Temporarily disable SettingWithCopyWarning, as setting columns on a
+    # copy (`to_return`) is intended here.
+    old_chained_assignment = pandas.options.mode.chained_assignment
+    pandas.options.mode.chained_assignment = None
+
+    for column_name, column_value in kwargs.iteritems():
+        evaluated_value = symbolic.to_callable(column_value)(self)
+        # TODO: verify that evaluated_value is a series!
+        if column_name == 'index':
+            to_return.index = evaluated_value
+        else:
+            to_return[column_name] = evaluated_value
+
+    pandas.options.mode.chained_assignment = old_chained_assignment
+
+    return to_return
+
+
+# TODO: Ensure that an empty ply_select on a groupby returns a large dataframe
+def _ply_select_for_groups(self, **kwargs):
+    """Summarize a grouped dataframe or series.
+
+    Analogous to SQL's ``SELECT`` (when a ``GROUP BY`` is present), or dplyr's
+    ``summarise``.
+
+    Args:
+        `**kwargs`: Each argument name will be the name of a new column in the
+            output dataframe, with the column's contents determined by the
+            argument contents. These contents can be given as a dataframe, a
+            function (taking the input grouped dataframe as its single
+            argument), or a symbolic expression (taking the input grouped
+            dataframe as `Symbol(0)`).
+    """
+
+    to_return = pandas.DataFrame()
+
+    for column_name, column_value in kwargs.iteritems():
+        evaluated_value = symbolic.to_callable(column_value)(self)
+        if column_name == 'index':
+            to_return.index = evaluated_value
+        else:
+            to_return[column_name] = evaluated_value
+
+    return to_return
+
+
+class PlyDataFrame:
+    """The following methods are added to `pandas.DataFrame`:"""
+
+    ply_where = _ply_where
+    ply_select = _ply_select
+
+
+class PlySeries:
+    """The following methods are added to `pandas.Series`:"""
+
+    ply_where = _ply_where
+
+
+class PlyDataFrameGroupBy:
+    """The following methods are added to
+    `pandas.core.groupby.DataFrameGroupBy`:"""
+
+    ply_select = _ply_select_for_groups
+
+
+class PlySeriesGroupBy:
+    """The following methods are added to
+    `pandas.core.groupby.SeriesGroupBy`:"""
+
+    ply_select = _ply_select_for_groups
@@ -0,0 +1,202 @@
+"""`ply.symbolic` is a simple system for building "symbolic expressions" to
+provide as arguments to **pandas-ply**'s methods (in place of lambda
+expressions)."""
+
+
+class Expression:
+    """`Expression` is the (abstract) base class for symbolic expressions.
+    Symbolic expressions are encoded representations of Python expressions,
+    kept on ice until you are ready to evaluate them. Operations on
+    symbolic expressions (like `my_expr.some_attr` or `my_expr(some_arg)` or
+    `my_expr + 7`) are automatically turned into symbolic representations
+    thereof -- nothing is actually done until the special evaluation method
+    `_eval` is called.
+    """
+
+    def _eval(self, context, **options):
+        """Evaluate a symbolic expression.
+
+        Args:
+            context: The context object for evaluation. Currently, this is a
+                dictionary mapping symbol names to values,
+            `**options`: Options for evaluation. Currently, the only option is
+                `log`, which results in some debug output during evaluation if
+                it is set to `True`.
+
+        Returns:
+            anything
+        """
+        raise NotImplementedError
+
+    def __repr__(self):
+        raise NotImplementedError
+
+    def __coerce__(self, other):
+        return None
+
+    def __getattr__(self, name):
+        """Construct a symbolic representation of `getattr(self, name)`."""
+        return GetAttr(self, name)
+
+    def __call__(self, *args, **kwargs):
+        """Construct a symbolic representation of `self(*args, **kwargs)`."""
+        return Call(self, args=args, kwargs=kwargs)
+
+
+# Here are the varieties of atomic / compound Expression.
+
+
+class Symbol(Expression):
+    """`Symbol(name)` is an atomic symbolic expression, labelled with an
+    arbitrary `name`."""
+
+    def __init__(self, name):
+        self._name = name
+
+    def _eval(self, context, **options):
+        if options.get('log'):
+            print 'Symbol._eval', repr(self)
+        result = context[self._name]
+        if options.get('log'):
+            print 'Returning', repr(self), '=>', repr(result)
+        return result
+
+    def __repr__(self):
+        return 'Symbol(%s)' % repr(self._name)
+
+
+class GetAttr(Expression):
+    """`GetAttr(obj, name)` is a symbolic expression representing the result of
+    `getattr(obj, name)`. (`obj` and `name` can themselves be symbolic.)"""
+
+    def __init__(self, obj, name):
+        self._obj = obj
+        self._name = name
+
+    def _eval(self, context, **options):
+        if options.get('log'):
+            print 'GetAttr._eval', repr(self)
+        evaled_obj = eval_if_symbolic(self._obj, context, **options)
+        result = getattr(evaled_obj, self._name)
+        if options.get('log'):
+            print 'Returning', repr(self), '=>', repr(result)
+        return result
+
+    def __repr__(self):
+        return 'getattr(%s, %s)' % (repr(self._obj), repr(self._name))
+
+
+class Call(Expression):
+    """`Call(func, args, kwargs)` is a symbolic expression representing the
+    result of `func(*args, **kwargs)`. (`func`, each member of the `args`
+    iterable, and each value in the `kwargs` dictionary can themselves be
+    symbolic)."""
+
+    def __init__(self, func, args=[], kwargs={}):
+        self._func = func
+        self._args = args
+        self._kwargs = kwargs
+
+    def _eval(self, context, **options):
+        if options.get('log'):
+            print 'Call._eval', repr(self)
+        evaled_func = eval_if_symbolic(self._func, context, **options)
+        evaled_args = [eval_if_symbolic(v, context, **options)
+                       for v in self._args]
+        evaled_kwargs = {k: eval_if_symbolic(v, context, **options)
+                         for k, v in self._kwargs.iteritems()}
+        result = evaled_func(*evaled_args, **evaled_kwargs)
+        if options.get('log'):
+            print 'Returning', repr(self), '=>', repr(result)
+        return result
+
+    def __repr__(self):
+        return '{func}(*{args}, **{kwargs})'.format(
+            func=repr(self._func),
+            args=repr(self._args),
+            kwargs=repr(self._kwargs))
+
+
+def eval_if_symbolic(obj, context, **options):
+    """Evaluate an object if it is a symbolic expression, or otherwise just
+    returns it back.
+
+    Args:
+        obj: Either a symbolic expression, or anything else (in which case this
+            is a noop).
+        context: Passed as an argument to `obj._eval` if `obj` is symbolic.
+        `**options`: Passed as arguments to `obj._eval` if `obj` is symbolic.
+
+    Returns:
+        anything
+
+    Examples:
+        >>> eval_if_symbolic(Symbol('x'), {'x': 10})
+        10
+        >>> eval_if_symbolic(7, {'x': 10})
+        7
+    """
+    return obj._eval(context, **options) if hasattr(obj, '_eval') else obj
+
+
+def to_callable(obj):
+    """Turn an object into a callable.
+
+    Args:
+        obj: This can be
+
+            * **a symbolic expression**, in which case the output callable
+              evaluates the expression with symbols taking values from the
+              callable's arguments (listed arguments named according to their
+              numerical index, keyword arguments named according to their
+              string keys),
+            * **a callable**, in which case the output callable is just the
+              input object, or
+            * **anything else**, in which case the output callable is a
+              constant function which always returns the input object.
+
+    Returns:
+        callable
+
+    Examples:
+        >>> to_callable(Symbol(0) + Symbol('x'))(3, x=4)
+        7
+        >>> to_callable(lambda x: x + 1)(10)
+        11
+        >>> to_callable(12)(3, x=4)
+        12
+    """
+    if hasattr(obj, '_eval'):
+        return lambda *args, **kwargs: obj._eval(dict(enumerate(args), **kwargs))
+    elif callable(obj):
+        return obj
+    else:
+        return lambda *args, **kwargs: obj
+
+
+def sym_call(func, *args, **kwargs):
+    """Construct a symbolic representation of `func(*args, **kwargs)`.
+
+    This is necessary because `func(symbolic)` will not (ordinarily) know to
+    construct a symbolic expression when it receives the symbolic
+    expression `symbolic` as a parameter (if `func` is not itself symbolic).
+    So instead, we write `sym_call(func, symbolic)`.
+
+    Args:
+        func: Function to call on evaluation (can be symbolic).
+        `*args`: Arguments to provide to `func` on evaluation (can be symbolic).
+        `**kwargs`: Keyword arguments to provide to `func` on evaluation (can be
+            symbolic).
+
+    Returns:
+        `ply.symbolic.Expression`
+
+    Example:
+        >>> sym_call(math.sqrt, Symbol('x'))._eval({'x': 16})
+        4
+    """
+
+    return Call(func, args=args, kwargs=kwargs)
+
+X = Symbol(0)
+"""A Symbol for "the first argument" (for convenience)."""
@@ -0,0 +1,15 @@
+from distutils.core import setup
+setup(
+    name = 'pandas-ply',
+    version = '0.1.0',
+    author = 'Coursera Inc.',
+    author_email = 'pandas-ply@coursera.org',
+    packages = [
+        'ply',
+    ],
+    description = 'functional data manipulation for pandas',
+    long_description = open('README.rst').read(),
+    license = 'Apache License 2.0',
+    url = 'https://github.com/coursera/pandas-ply',
+    classifiers = [],
+)
@@ -0,0 +1,3 @@
+#!/bin/bash
+
+ls test_*.py | xargs -n 1 python
@@ -0,0 +1,196 @@
+import unittest
+
+from pandas.util.testing import assert_frame_equal
+from pandas.util.testing import assert_series_equal
+from ply.methods import install_ply
+from ply.symbolic import X
+import pandas as pd
+
+install_ply(pd)
+
+
+def assert_frame_equiv(df1, df2, **kwargs):
+    """ Assert that two dataframes are equal, ignoring ordering of columns.
+
+    See http://stackoverflow.com/questions/14224172/equality-in-pandas-
+        dataframes-column-order-matters
+    """
+    return assert_frame_equal(
+        df1.sort(axis=1),
+        df2.sort(axis=1),
+        check_names=True, **kwargs)
+
+test_df = pd.DataFrame(
+    {'x': [1, 2, 3, 4], 'y': [4, 3, 2, 1]},
+    columns=['x', 'y'])
+test_series = pd.Series([1, 2, 3, 4])
+
+test_dfsq = pd.DataFrame(
+    {'x': [-2, -1, 0, 1, 2], 'xsq': [4, 1, 0, 1, 4]},
+    columns=['x', 'xsq'])
+
+
+class PlyWhereTest(unittest.TestCase):
+
+    def test_no_conditions(self):
+        assert_frame_equal(test_df.ply_where(), test_df)
+
+    def test_single_condition(self):
+        expected = pd.DataFrame(
+            {'x': [3, 4], 'y': [2, 1]},
+            index=[2, 3],
+            columns=['x', 'y'])
+
+        assert_frame_equal(test_df.ply_where(test_df.x > 2.5), expected)
+        assert_frame_equal(test_df.ply_where(lambda df: df.x > 2.5), expected)
+        assert_frame_equal(test_df.ply_where(X.x > 2.5), expected)
+
+    def test_multiple_conditions(self):
+        expected = pd.DataFrame(
+            {'x': [2, 3], 'y': [3, 2]},
+            index=[1, 2],
+            columns=['x', 'y'])
+
+        lo_df = test_df.x > 1.5
+        hi_df = test_df.x < 3.5
+        lo_func = lambda df: df.x > 1.5
+        hi_func = lambda df: df.x < 3.5
+        lo_sym = X.x > 1.5
+        hi_sym = X.x < 3.5
+
+        for lo in [lo_df, lo_func, lo_sym]:
+            for hi in [hi_df, hi_func, hi_sym]:
+                assert_frame_equal(test_df.ply_where(lo, hi), expected)
+
+
+class PlyWhereForSeriesTest(unittest.TestCase):
+
+    def test_no_conditions(self):
+        assert_series_equal(test_series.ply_where(), test_series)
+
+    def test_single_condition(self):
+        expected = pd.Series([3, 4], index=[2, 3])
+
+        assert_series_equal(test_series.ply_where(test_series > 2.5), expected)
+        assert_series_equal(test_series.ply_where(lambda s: s > 2.5), expected)
+        assert_series_equal(test_series.ply_where(X > 2.5), expected)
+
+    def test_multiple_conditions(self):
+        expected = pd.Series([2, 3], index=[1, 2])
+
+        assert_series_equal(
+            test_series.ply_where(test_series < 3.5, test_series > 1.5), expected)
+        assert_series_equal(
+            test_series.ply_where(test_series < 3.5, lambda s: s > 1.5), expected)
+        assert_series_equal(
+            test_series.ply_where(test_series < 3.5, X > 1.5), expected)
+        assert_series_equal(
+            test_series.ply_where(lambda s: s < 3.5, lambda s: s > 1.5), expected)
+        assert_series_equal(
+            test_series.ply_where(lambda s: s < 3.5, X > 1.5), expected)
+        assert_series_equal(
+            test_series.ply_where(X < 3.5, X > 1.5), expected)
+
+
+class PlySelectTest(unittest.TestCase):
+
+    def test_bad_arguments(self):
+        # Nonexistent column, include or exclude
+        with self.assertRaises(ValueError):
+            test_df.ply_select('z')
+        with self.assertRaises(ValueError):
+            test_df.ply_select('-z')
+
+        # Exclude without asterisk
+        with self.assertRaises(ValueError):
+            test_df.ply_select('-x')
+
+        # Include with asterisk
+        with self.assertRaises(ValueError):
+            test_df.ply_select('*', 'x')
+
+    def test_noops(self):
+        assert_frame_equal(test_df.ply_select('*'), test_df)
+        assert_frame_equal(test_df.ply_select('x', 'y'), test_df)
+        assert_frame_equiv(test_df.ply_select(x=X.x, y=X.y), test_df)
+
+    def test_reorder(self):
+        reordered = test_df.ply_select('y', 'x')
+        assert_frame_equiv(reordered, test_df[['y', 'x']])
+        self.assertEqual(list(reordered.columns), ['y', 'x'])
+
+    def test_subset_via_includes(self):
+        assert_frame_equal(test_df.ply_select('x'), test_df[['x']])
+        assert_frame_equal(test_df.ply_select('y'), test_df[['y']])
+
+    def test_subset_via_excludes(self):
+        assert_frame_equal(test_df.ply_select('*', '-y'), test_df[['x']])
+        assert_frame_equal(test_df.ply_select('*', '-x'), test_df[['y']])
+
+    def test_empty(self):
+        assert_frame_equal(test_df.ply_select(), test_df[[]])
+        assert_frame_equal(test_df.ply_select('*', '-x', '-y'), test_df[[]])
+
+    def test_ways_of_providing_new_columns(self):
+        # Value
+        assert_frame_equal(
+            test_df.ply_select(new=5),
+            pd.DataFrame({'new': [5, 5, 5, 5]}))
+
+        # Dataframe-like
+        assert_frame_equal(
+            test_df.ply_select(new=[5, 6, 7, 8]),
+            pd.DataFrame({'new': [5, 6, 7, 8]}))
+
+        # Function
+        assert_frame_equal(
+            test_df.ply_select(new=lambda df: df.x),
+            pd.DataFrame({'new': [1, 2, 3, 4]}))
+
+        # Symbolic expression
+        assert_frame_equal(
+            test_df.ply_select(new=X.x),
+            pd.DataFrame({'new': [1, 2, 3, 4]}))
+
+    def test_old_and_new_together(self):
+        assert_frame_equal(
+            test_df.ply_select('x', total=X.x + X.y),
+            pd.DataFrame(
+                {'x': [1, 2, 3, 4], 'total': [5, 5, 5, 5]},
+                columns=['x', 'total']))
+
+    def test_kwarg_overrides_asterisk(self):
+        assert_frame_equal(
+            test_df.ply_select('*', y=X.x),
+            pd.DataFrame({'x': [1, 2, 3, 4], 'y': [1, 2, 3, 4]}))
+
+    def test_kwarg_overrides_column_include(self):
+        assert_frame_equal(
+            test_df.ply_select('x', 'y', y=X.x),
+            pd.DataFrame({'x': [1, 2, 3, 4], 'y': [1, 2, 3, 4]}))
+
+    def test_new_index(self):
+        assert_frame_equal(
+            test_df.ply_select('x', index=X.y),
+            pd.DataFrame(
+                {'x': [1, 2, 3, 4]},
+                index=pd.Index([4, 3, 2, 1], name='y')))
+
+
+class PlySelectForGroupsTest(unittest.TestCase):
+
+    def test_simple(self):
+        grp = test_dfsq.groupby('xsq')
+        assert_frame_equal(
+            grp.ply_select(count=X.x.count()),
+            pd.DataFrame(
+                {'count': [1, 2, 2]},
+                index=pd.Index([0, 1, 4], name='xsq')))
+
+
+if __name__ == '__main__':
+    try:
+        from colour_runner.runner import ColourTextTestRunner
+        unittest.main(verbosity=2, testRunner=ColourTextTestRunner)
+    except ImportError:
+        unittest.main(verbosity=2)
@@ -0,0 +1,248 @@
+import unittest
+import mock
+
+from ply.symbolic import Call
+from ply.symbolic import GetAttr
+from ply.symbolic import Symbol
+from ply.symbolic import eval_if_symbolic
+from ply.symbolic import sym_call
+from ply.symbolic import to_callable
+
+
+class ExpressionTest(unittest.TestCase):
+
+    # These test whether operations on symbolic expressions correctly construct
+    # compound symbolic expressions:
+
+    def test_getattr(self):
+        expr = Symbol('some_symbol').some_attr
+        self.assertEqual(
+            repr(expr),
+            "getattr(Symbol('some_symbol'), 'some_attr')")
+
+    def test_call(self):
+        expr = Symbol('some_symbol')('arg1', 'arg2', kwarg_name='kwarg value')
+        self.assertEqual(
+            repr(expr),
+            "Symbol('some_symbol')(*('arg1', 'arg2'), " +
+            "**{'kwarg_name': 'kwarg value'})")
+
+    def test_ops(self):
+        expr = Symbol('some_symbol') + 1
+        self.assertEqual(
+            repr(expr),
+            "getattr(Symbol('some_symbol'), '__add__')(*(1,), **{})")
+
+        expr = 1 + Symbol('some_symbol')
+        self.assertEqual(
+            repr(expr),
+            "getattr(Symbol('some_symbol'), '__radd__')(*(1,), **{})")
+
+        expr = Symbol('some_symbol')['key']
+        self.assertEqual(
+            repr(expr),
+            "getattr(Symbol('some_symbol'), '__getitem__')(*('key',), **{})")
+
+
+class SymbolTest(unittest.TestCase):
+
+    def test_eval(self):
+        self.assertEqual(
+            Symbol('some_symbol')._eval({'some_symbol': 'value'}),
+            'value')
+        self.assertEqual(
+            Symbol('some_symbol')._eval(
+                {'some_symbol': 'value', 'other_symbol': 'irrelevant'}),
+            'value')
+        with self.assertRaises(KeyError):
+            Symbol('some_symbol')._eval({'other_symbol': 'irrelevant'}),
+
+    def test_repr(self):
+        self.assertEqual(repr(Symbol('some_symbol')), "Symbol('some_symbol')")
+
+
+class GetAttrTest(unittest.TestCase):
+
+    def test_eval_with_nonsymbolic_object(self):
+        some_obj = mock.Mock()
+        del some_obj._eval
+        # Ensure constructing the expression does not access `.some_attr`.
+        del some_obj.some_attr
+
+        with self.assertRaises(AttributeError):
+            some_obj.some_attr
+        expr = GetAttr(some_obj, 'some_attr')
+
+        some_obj.some_attr = 'attribute value'
+
+        self.assertEqual(expr._eval({}), 'attribute value')
+
+    def test_eval_with_symbolic_object(self):
+        some_obj = mock.Mock()
+        del some_obj._eval
+        some_obj.some_attr = 'attribute value'
+
+        expr = GetAttr(Symbol('some_symbol'), 'some_attr')
+
+        self.assertEqual(
+            expr._eval({'some_symbol': some_obj}),
+            'attribute value')
+
+    def test_repr(self):
+        self.assertEqual(
+            repr(GetAttr('object', 'attrname')),
+            "getattr('object', 'attrname')")
+
+
+class CallTest(unittest.TestCase):
+
+    def test_eval_with_nonsymbolic_func(self):
+        func = mock.Mock(return_value='return value')
+        del func._eval  # So it doesn't pretend to be symbolic
+
+        expr = Call(func, ('arg1', 'arg2'), {'kwarg_name': 'kwarg value'})
+
+        # Ensure constructing the expression does not call the function
+        self.assertFalse(func.called)
+
+        result = expr._eval({})
+
+        func.assert_called_once_with('arg1', 'arg2', kwarg_name='kwarg value')
+        self.assertEqual(result, 'return value')
+
+    def test_eval_with_symbolic_func(self):
+        func = mock.Mock(return_value='return value')
+        del func._eval  # So it doesn't pretend to be symbolic
+
+        expr = Call(
+            Symbol('some_symbol'),
+            ('arg1', 'arg2'),
+            {'kwarg_name': 'kwarg value'})
+
+        result = expr._eval({'some_symbol': func})
+
+        func.assert_called_once_with('arg1', 'arg2', kwarg_name='kwarg value')
+        self.assertEqual(result, 'return value')
+
+    def test_eval_with_symbolic_arg(self):
+        func = mock.Mock(return_value='return value')
+        del func._eval  # So it doesn't pretend to be symbolic
+
+        expr = Call(
+            func,
+            (Symbol('some_symbol'), 'arg2'),
+            {'kwarg_name': 'kwarg value'})
+
+        result = expr._eval({'some_symbol': 'arg1'})
+
+        func.assert_called_once_with('arg1', 'arg2', kwarg_name='kwarg value')
+        self.assertEqual(result, 'return value')
+
+    def test_eval_with_symbol_kwarg(self):
+        func = mock.Mock(return_value='return value')
+        del func._eval  # So it doesn't pretend to be symbolic
+
+        expr = Call(
+            func,
+            ('arg1', 'arg2'),
+            {'kwarg_name': Symbol('some_symbol')})
+
+        result = expr._eval({'some_symbol': 'kwarg value'})
+
+        func.assert_called_once_with('arg1', 'arg2', kwarg_name='kwarg value')
+        self.assertEqual(result, 'return value')
+
+    def test_repr(self):
+        # One arg
+        self.assertEqual(
+            repr(Call('func', ('arg1',), {'kwarg_name': 'kwarg value'})),
+            "'func'(*('arg1',), **{'kwarg_name': 'kwarg value'})")
+
+        # Two args
+        self.assertEqual(
+            repr(Call(
+                'func',
+                ('arg1', 'arg2'),
+                {'kwarg_name': 'kwarg value'})),
+            "'func'(*('arg1', 'arg2'), **{'kwarg_name': 'kwarg value'})")
+
+
+class FunctionsTest(unittest.TestCase):
+
+    def test_eval_if_symbolic(self):
+        self.assertEqual(
+            eval_if_symbolic(
+                'nonsymbolic',
+                {'some_symbol': 'symbol_value'}),
+            'nonsymbolic')
+        self.assertEqual(
+            eval_if_symbolic(
+                Symbol('some_symbol'),
+                {'some_symbol': 'symbol_value'}),
+            'symbol_value')
+
+    def test_to_callable_from_nonsymbolic_noncallable(self):
+        test_callable = to_callable('nonsymbolic')
+        self.assertEqual(
+            test_callable('arg1', 'arg2', kwarg_name='kwarg value'),
+            'nonsymbolic')
+
+    def test_to_callable_from_nonsymbolic_callable(self):
+        func = mock.Mock(return_value='return value')
+        del func._eval  # So it doesn't pretend to be symbolic
+
+        test_callable = to_callable(func)
+
+        # Ensure running to_callable does not call the function
+        self.assertFalse(func.called)
+
+        result = test_callable('arg1', 'arg2', kwarg_name='kwarg value')
+
+        func.assert_called_once_with('arg1', 'arg2', kwarg_name='kwarg value')
+        self.assertEqual(result, 'return value')
+
+    def test_to_callable_from_symbolic(self):
+        mock_expr = mock.Mock()
+        mock_expr._eval.return_value = 'eval return value'
+
+        test_callable = to_callable(mock_expr)
+
+        # Ensure running to_callable does not evaluate the expression
+        self.assertFalse(mock_expr._eval.called)
+
+        result = test_callable('arg1', 'arg2', kwarg_name='kwarg value')
+
+        mock_expr._eval.assert_called_once_with(
+            {0: 'arg1', 1: 'arg2', 'kwarg_name': 'kwarg value'})
+        self.assertEqual(result, 'eval return value')
+
+    def test_sym_call(self):
+        expr = sym_call(
+            'func', Symbol('some_symbol'), 'arg1', 'arg2',
+            kwarg_name='kwarg value')
+        self.assertEqual(
+            repr(expr),
+            "'func'(*(Symbol('some_symbol'), 'arg1', 'arg2'), " +
+            "**{'kwarg_name': 'kwarg value'})")
+
+
+class IntegrationTest(unittest.TestCase):
+
+    def test_pythagoras(self):
+        from math import sqrt
+
+        X = Symbol('X')
+        Y = Symbol('Y')
+
+        expr = sym_call(sqrt, X ** 2 + Y ** 2)
+        func = to_callable(expr)
+
+        self.assertEqual(func(X=3, Y=4), 5)
+
+
+if __name__ == '__main__':
+    try:
+        from colour_runner.runner import ColourTextTestRunner
+        unittest.main(verbosity=2, testRunner=ColourTextTestRunner)
+    except ImportError:
+        unittest.main(verbosity=2)
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+from methods import install_ply`
	`2`	`+from symbolic import X, sym_call`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+#!/bin/bash`
	`2`	`+`
	`3`	`+ls test_*.py \| xargs -n 1 python`