Skip to content

Problem inverting Matrix, saving as errSVDArray_2453 #38

@adrabent

Description

@adrabent

Dear all,

I am running the DDF-pipeline on a three-epoch observation. The quality of the data looks pretty good, according to the quality of the produced images. But during the bootstrap step of the pipeline, it crashes with the following error message:

 - 10:42:55 - ClassMultiScaleMachine       [16.2/18.9 30.6/33.4 27.7Gb] 48 scales and 7 scale functions in list
/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/ClassPSFServer.py:336: RuntimeWarning: invalid value encountered in double_scalars
  FreqBandsFluxRatio[iAlpha,iChannel]=np.sqrt(np.sum(BeamFactor*((ThisFreqs/RefFreq)**ThisAlpha)**2))/np.sqrt(np.sum(BeamFactorWeightSq))
 - 10:43:04 - DDFacet                      [16.8/18.9 31.2/33.4 28.2Gb] Traceback (most recent call last):
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Array/ModLinAlg.py", line 229, in invSVD
    u,s,v=np.linalg.svd(Ar)
  File "<__array_function__ internals>", line 5, in svd
  File "/usr/lib/python3/dist-packages/numpy/linalg/linalg.py", line 1661, in svd
    u, s, vh = gufunc(a, signature=signature, extobj=extobj)
ValueError: On entry to DLASCL parameter number 4 had an illegal value

The extended error message looks like this:

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/bin/DDF.py", line 461, in <module>
    main(OP, messages)
  File "/usr/local/bin/DDF.py", line 295, in main
    Imager.main()
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/ClassDeconvMachine.py", line 1222, in main
    repMinor, continue_deconv, update_model = self.DeconvMachine.Deconvolve()
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/SSD/ClassImageDeconvMachineSSD.py", line 410, in Deconvolve
    self.InitIslands()
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/SSD/ClassImageDeconvMachineSSD.py", line 323, in InitIslands
    self._init_InitMachine()
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/SSD/ClassImageDeconvMachineSSD.py", line 151, in _init_InitMachine
    self.InitMachine.Init(self.DicoVariablePSF, self.GridFreqs, self.DegridFreqs)
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/SSD/ClassInitSSDModelHMP.py", line 38, in Init
    self.InitMachine.Init(DicoVariablePSF, GridFreqs, DegridFreqs)
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/SSD/ClassInitSSDModelHMP.py", line 243, in Init
    self.DeconvMachine.Init(PSFVar=self.DicoVariablePSF,PSFAve=self.DicoVariablePSF["PSFSideLobes"],
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/MSMF/ClassImageDeconvMachineMSMF.py", line 199, in Init
    self.InitMSMF(approx=approx, cache=cache, facetcache=facetcache)
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/MSMF/ClassImageDeconvMachineMSMF.py", line 285, in InitMSMF
    self._initMSM_facet(centralFacet,
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/MSMF/ClassImageDeconvMachineMSMF.py", line 243, in _initMSM_facet
    MSMachine.MakeBasisMatrix()
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/MSMF/ClassMultiScaleMachine.py", line 727, in MakeBasisMatrix
    self.DicoBasisMatrix = self.GiveBasisMatrix()
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Imager/MSMF/ClassMultiScaleMachine.py", line 770, in GiveBasisMatrix
    DicoBasisMatrix["BMT_BM_inv"] = np.float32(ModLinAlg.invSVD(BMT_BM))
  File "/usr/local/lib/python3.9/dist-packages/DDFacet/Array/ModLinAlg.py", line 238, in invSVD
    u,s,v=np.linalg.svd(np.complex64(Ar))#+np.random.randn(*Ar.shape)*(1e-10*np.abs(Ar).max()))
  File "<__array_function__ internals>", line 5, in svd
  File "/usr/lib/python3/dist-packages/numpy/linalg/linalg.py", line 1661, in svd
    u, s, vh = gufunc(a, signature=signature, extobj=extobj)
ValueError: On entry to DLASCL parameter number 4 had an illegal value

and it finishes with:

 - 10:43:04 - DDFacet                      [14.6/18.9 29.1/33.4 28.2Gb] There was a problem after 3m7.5s; if you think this is a bug please open an issue, 
 - 10:43:04 - DDFacet                      [14.6/18.9 29.1/33.4 28.2Gb]   quote your version of DDFacet and attach your logfile.
 - 10:43:04 - DDFacet                      [14.6/18.9 29.1/33.4 28.2Gb] You are using DDFacet revision: 0.6.0.0
 - 10:43:04 - DDFacet                      [14.6/18.9 29.1/33.4 28.2Gb] Your logfile is available here: /data/LOFAR/HBA/A2319_merged/image_bootstrap_L823390.log
Problem inverting Matrix, saving as errSVDArray_2453
  will make it svd-able
 - 10:43:08 - ClearSHM                     | Clear shared memory
 - 10:43:08 - ClearSHM                     | Clear Semaphores
 - 10:43:08 - ClearSHM                     | Clear shared dictionaries

followed by Tracebacks just mentioning the occurence of the Runtime Error. I attached the logfile to this ticket:
image_bootstrap_L823390.log

What can I do to figure out why DDF fails? Was such a problem seen before?

With kind regards,
Alex

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions