skip to main content
10.1145/2746090.2746109acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicailConference Proceedingsconference-collections
extended-abstract

Transfer of predictive models for classification of statutory texts in multi-jurisdictional settings

Published:08 June 2015Publication History

ABSTRACT

In this paper we use statistical machine learning to classify statutory texts in terms of highly specific functional categories. We focus on regulatory provisions from multiple US state jurisdictions, all dealing with the same general topic of public health system emergency preparedness and response. In prior work we have established that one can improve classification performance on one jurisdiction's statutory texts using texts from another jurisdiction. Here we describe a framework facilitating transfer of predictive models for classification of statutory texts among multiple state jurisdictions. Our results show that the classification performance improves as we employ an increasing number of models trained on data coming from different states.

References

  1. I. Batal, C. Hong, and M. Hauskrecht. An efficient probabilistic framework for multi-dimensional classification. In CIKM, pages 2417--2422, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. C. Biagioli, E. Francesconi, A. Passerini, S. Montemagni, and C. Soria. Automatic semantics extraction in law documents. In ICAIL '05, pages 133--140. ACM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. G. Boella, L. D. Caro, L. Lesmo, D. Rispoli, and L. Robaldo. Multi-label classification of legislative text into eurovoc. In B. Schäfer, editor, JURIX 2012, pages 21--30. IOS Press, 2012.Google ScholarGoogle Scholar
  4. N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer. Smote: Synthetic minority over-sampling technique. J. Artif. Int. Res., 16: 321--357, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. V. Daudaravicius. Automatic multilingual annotation of eu legislation with eurovoc descriptors. In EEOP2012 Workshop Proceedings, 2012.Google ScholarGoogle Scholar
  6. E. de Maat, K. Krabben, and R. Winkels. Machine learning versus knowledge based classification of legal texts. In R. Winkels, editor, JURIX 2010, pages 87--96. IOS Press, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. E. de Maat. and R. Winkels. Categorisation of norms. In JURIX 2007, pages 79--88. IOS Press, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. T. Evgeniou and M. Pontil. Regularized multi--task learning. In KDD'04, pages 109--117, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. E. Francesconi. An approach to legal rules modelling and automatic learning. In G. Governatori, editor, JURIX 2009, pages 59--68. IOS Press, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. E. Francesconi, S. Montemagni, W. Peters, and D. Tiscornia. Semantic Processing of Legal Texts, chapter Integrating a Bottom-up and Top-Down Methodology for Building Semantic Resources for the Multilingual Legal Domain, pages 95--121. Number 6036 in LNAI. Springer, Berlin, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. E. Francesconi and A. Passerini. Automatic classification of provisions in legislative texts. AI and Law, 15: 1--17, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. E. Francesconi. and G. Peruginelli. Integrated access to legal literature through automated semantic classification. AI and Law, 17: 31--49, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. Grabmair, K. D. Ashley, R. Hwa, and P. M. Sweeney. Toward extracting information from public health statutes using text classification and machine learning. In K. M. Atkinson, editor, JURIX 2011, pages 73--82. IOS Press, 2011.Google ScholarGoogle Scholar
  14. N. Kakwani. On a class of poverty measures. Econometria, pages 437--446, 1980.Google ScholarGoogle ScholarCross RefCross Ref
  15. R. Opsomer, G. D. Meyer, C. Cornelis, and G. van Eetvelde. Exploiting properties of legislative texts to improve classification accuracy. In G. Governatori, editor, JURIX 2009, pages 136--145. IOS Press, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. J. Pan and Q. Yang. A survey on transfer learning. TKDE, 22(10): 1345--1359, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. B. Pouliquen, R. Steinberger, and C. Ignat. Automatic annotation of multi-lingual text collections with a conceptual thesaurus. arXiv preprint, 2006.Google ScholarGoogle Scholar
  18. R. Steinberger, M. Ebrahim, and C. Ignat. Jrc eurovoc indexer jex-a freely available multi-label categorisation tool. arXiv preprint, 2013.Google ScholarGoogle Scholar
  19. P. M. Sweeney, E. E. Bjerke, M. A. Potter, H. Guclu, C. R. Keane, K. D. Ashley, M. Grabmair, and R. Hwa. Network analysis of manually-encoded state laws and prospects for automation. In R. Winkels, N. Lattieri, and S. Faro, editors, Network Analysis in Law, pages 53--78, Napoli, 2014. Diritto Scienza Technologia.Google ScholarGoogle Scholar
  20. J. Šavelka, M. Grabmair, and K. D. Ashley. Mining information from statutory texts in multi-jurisdictional settings. In R. Hoekstra, editor, JURIX 2014, pages 133--142. IOS Press, 2014.Google ScholarGoogle Scholar
  21. R. Winkels and R. Hoekstra. Automatic extraction of legal concepts and definitions. In JURIX 2012, pages 157--166, 2012.Google ScholarGoogle Scholar
  22. A. Wyner and W. Peters. On rule extraction from regulations. In JURIX 2011, pages 113--122. IOS Press, 2011.Google ScholarGoogle Scholar

Index Terms

  1. Transfer of predictive models for classification of statutory texts in multi-jurisdictional settings

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICAIL '15: Proceedings of the 15th International Conference on Artificial Intelligence and Law
        June 2015
        246 pages
        ISBN:9781450335225
        DOI:10.1145/2746090
        • Conference Chair:
        • Ted Sichelman,
        • Program Chair:
        • Katie Atkinson

        Copyright © 2015 Owner/Author

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 8 June 2015

        Check for updates

        Qualifiers

        • extended-abstract

        Acceptance Rates

        ICAIL '15 Paper Acceptance Rate30of58submissions,52%Overall Acceptance Rate69of169submissions,41%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader