Target releaseSummer 2015
Epic DM-1991 - Getting issue details... STATUS
Document statusDRAFT
Document owner

Yusra AlSayyad

AssigneeYusra AlSayyad
  
  

The original description of the problem is here: DM-740 - Getting issue details... STATUS  

The HSC implementation is simple enough that I don't see many modifications to the design needed to fit with LSST.

However, before finalizing a design and making a request for comments, I'd like to make sure I fully understand the scope and requirements. This interface will be used by many components we haven't written yet, and I would appreciate help completing this list of possible clients.

Goals

Design an abstract interface for 2D surface-modeling.  Refactor Approximate/Interpolate classes to inherit from a single interface so that they can be used interchangeably, regardless of internal representation of parameters.

Questions

Please take a look at the following lists to see if there is anything I haven't captured.

  • List of client code in the stack:
    • Current:
      • lsst.pipe.tasks.MatchBackgrounds
      • afw.math.BackgroundMI
      • afw.math.Background
    • Future:
      • Aperture Corrections
      • Zeropoint Scaling: Zeropoints vary spatially over a focal plane. We want a way to fit and store a model of the spatially varying zeropoint, along with the Calib.
      • Interpolate PSF across the focal plane
    • Notes: Currently the only implementations are Chebyshev polynomials, Splines which operate on gridded input data, and Gaussian Processes that operate on scattered data.

  • Domain terminology. Sharing a consistent terminology will simplify the design process. Ideas for describing these concepts:
    • General concept of a fit 2d surface that will inspire the name of the abstract base class:
      • Surface?
      • 2D Model?
      • Bounded Field? <-- from HSC

    • Positions of input points (two types): 
      • gridded vs. scattered

    • Noise handling.  How do we want to describe the difference between polynomial fitting  vs. interpolation through the exact values. Assumption is that a smoothed approximation would be twice differentiable.
      • smoothed vs. exact
        • Smoothed examples:
          • Chebyshev polynomial, bicubic spline, kriging/gaussian processes, radial basis functions
        • Exact examples:
          • nearest neighbor, linear interpolation (residuals = 0,  parameters are original input points)

  • What basic operations do we expect to perform on these 2D Models:
    • transformations
      • Affine
      • Scale
      • Rotation may be too specific. It is difficult on gridded interpolation for example.
    • Operations on images: (image +/-/*/+/ surface)
    • Operations with other surfaces (surface = surface + another surface)
    • fillImage(), evaluate(),  fit(), getResiduals()

  • Expected inputs:
    • Vectors or ndarrays of x1, x2, y, weights
    • Image
    • Masked Image

Assumptions

Requirements

#TitleUser StoryImportanceNotes
1Persistence

Aperture correction

needs to save surface fits

 

Must Have
  • DM-832 - Getting issue details... STATUS
2Gridded and Scattered input

Should use faster algorithms when input is gridded.

Interface should make it easy to get the right algorithm

  
32D-Model objects need same interface

Client code (background-matching task for example) will instantiate a 2D-Model object (whether polynomial or spline subclass will depend on the configuration - begs for a Factory). It will then call the same methods on it regardless of type.

Must have 
4...   

User Interaction

I would like consistency with the way that the similar objects are created and used in the lsst.afw.math. For example, many require the creation of a Control which gets passed to the constructor: 

statsCtrl = afwMath.StatisticsControl()
statsCtrl.setNumSigmaClip(self.config.sigmaClip)
statsCtrl.setNumIter(self.config.clipIter)
statsCtrl.setAndMask(self.getBadPixelMask())
statsCtrl.setNanSafe(True)
statObj = afwMath.makeStatistics(maskedImage.getVariance(), maskedImage.getMask(),
afwMath.MEANCLIP, statsCtrl)

I would also like consistency with APIs that other 2D-modelling code that astronomer users might be familiar with:

#Astropy:
from astropy.modeling import models, fitting
polynomialModel2D = models.Polynomial2D(degree=2)
fitter = fitting.LinearLSQFitter()
polynomial2D  = fitter(polynomialModel2D, x, y, z)
zNew = polynomial2D(xNew, yNew) #to evaluate
 
#Numpy/scipy:
from scipy import interpolate
f = interpolate.interp2d(x, y, z, kind='cubic')
zNew = f(xNew, yNew)
 
#Scikit-learn (1d-example)
from sklearn import GaussianProcess
gp = GaussianProcess(corr='squared_exponential', theta0=theta0...)
gp.fit(x, y)
zNew = gp.predict(xNew)
#This create then fit is consistent throughout sklearn. 
 

I like the consistency of the sci-kit learn API, but these objects are not are immutable once created (see first comment).

 

The prototype user interaction that was presented in RFC-58:

chebCtrl = lsst.afw.math.Model2DControl.makeControl('CHEBYSHEV', moreConfigs)
chebyshevModel2D = lsst.afw.math.Model2D.fit(x, y, z, bbox, chebCtrl)
chebyshevModel2D.fillImage(im)



interpCtrl = lsst.afw.math.Model2DControl.makeControl('INTERPOLATE', moreConfigs)
interpModel2D = lsst.afw.math.Model2D.fit(x, y, z, bbox, interpCtrl)
interpModel2D.fillImage(im)
 

Design

Prototype design that could would enable this type of interface:

 

 

 

 

 

 

Questions

QuestionOutcome

Is this refactor a candidate for rewriting the class in python?

  • There has been talk of redrawing the boundary between python and C++.

 

Not Doing

1 Comment

  1. I think we'll eventually want some way of dealing with transformations that aren't affine (i.e. arbitrary XYTransforms).  This may be something that only certain subclasses support, or something that converts one subclass to anther (and evaluates lazily).  It may not need to go into the design now, but it may be worth considering to make sure the original design doesn't rule it out.

     

    Two major (related) design questions here are whether the object knows its domain, and if so, whether that domain is always a box.  Both of those are true for the HSC BoundedField class, which has some advantages, but it may complicate the situation for transformations, which would not necessarily preserve the shape of the domain.  A hybrid scheme where the objects stores its maximum domain as a bounding box but reserves the right to throw an domain exception at some points within it might be the best way to go.

     

    We may ultimately want ways to compose these objects, and combine them in other ways (i.e. using multiple adjacent objects to cover a larger area).  I think that's just a matter of adding new derived classes with lazy evaluation, though.

     

    I think it's desirable that these be immutable, which probably necessitates related objects (like the factories you've proposed) for creating them.  I do think it's important to separate the interface for users of these spatial fields from their builders; different mathematical approaches may not have the same interface for the latter (and hence may not share a common base class), but they can and should share an interface for the former.  As a result, I'm okay with punting on the interface for construction, at least for now.