email:
PHONE: +1-206-543-5143
Snail mail:
Michael Gubanov
Paul Allen Center of Computer Science and Engineering
University of Washington
Box 352350
Seattle, WA 98195-2350

About

I'm a PhD candidate in the UW Paul Allen Center of Computer Science and Engineering. Broadly speaking I'm interested in Databases, Large-scale data management, Search, and Natural Language Processing. Before coming to Seattle I received M.Sc. and B.Sc. in Math. and Computer Science from St. Petersburg State University of IT, Mechanics, and Optics (ACM World Champions, 2004, 08, 09). After that I spent a few wonderful years in software industry in Germany and Russia in various roles. While on PhD program here I spent some time working on relevant projects at IBM Almaden Research Center and Google.

Recent research projects

  • UFOs: During my internship at Almaden and hereafter I worked on design of a new modular data management and integration paradigm.
  • TextDB: A new text database management system that is capable to automatically extract schema from natural language text and process structural queries.
  • GCET: During my internship at Google I worked on a highly scalable (hundreds of nodes) distributed machine learning infrastructure.
  • ULDisk: A new user-level (not in kernel-mode) driver for UNIX block-devices. Applied as a P2P distributed virutal RAM disk (a graduate systems course project, together with Steve Gribble, Ed Lazowska).
  • SPAMDB: A new approach to SPAM filtering based on splitting the training corpus for SPAM classification by certain user groups (a graduate networks course project, together with Chris Re, David Wetherall).
  • RondoRE: New restructuring merging and reverse-engineering operators added to Rondo (a graduate databases course project).

Selected publications

Policy
I do not provide copies of my papers due to copyrights. If you cannot find them, let me know and I'll try to help. Thanks for understanding.
    IBM UFO Repository: Object-oriented data integration [bib]
    Michael Gubanov, Lucian Popa, Howard Ho, Hamid Pirahesh, Jeng-Yih Chang, Shr-Chang Chen.
    Proceedings of the 35th International Conference on Very Large Data Bases, Lyon, France 2009

    Simplifying Information Integration: Object-based flow-of-mappings framework for integration. [bib]
    Bogdan Alexe, Michael Gubanov, Mauricio A. Hernandez, Howard Ho, Jen-Wei Huang, Yannis Katsis, Lucian Popa, Barna Saha, Ioana Stanoi
    Invited paper in the book Business Intelligence for the Real Time Enterprise, Springer, pp. 108-121, 2009

    Group-based SPAM Fighting [bib]
    Michael Gubanov, Christopher Re.
    University of Washington Technical Report 2009

    User-level disks - building disk services at the block level [bib]
    Alexander Moshchuk, Michael Gubanov
    University of Washington Technical Report 2009

    Simplifying Information Integration: Object-based flow-of-mappings framework for integration. [bib]
    Bogdan Alexe, Michael Gubanov, Mauricio A. Hernandez, Howard Ho, Jen-Wei Huang, Yannis Katsis, Lucian Popa.
    Proceedings of the 2nd International Workshop on Business Intelligence for the Real Time Enterprise, Auckland, New Zealand 2008

    Structural text search and comparison using automatically extracted schema [bib]
    Michael Gubanov, Philip A. Bernstein
    Proceedings of the 9th International Workshop on the Web and Databases, Chicago, Illinois 2006

    Distributed component architecture and a library for Web XML applications
    Michael Gubanov
    Proceedings of the 9th International conference on the Web and distributed computing, St. Petersburg, Russia 2002

    Design, modeling, and implementation techniques for Turbo-, Viterbi-, and Reed-Solomon-codecs for Hard Disks
    Michael Gubanov
    Proceedings of the 9th International conference on the codec design, St. Petersburg, Russia 2001

    Load balancing and fault tolerance for clearing interbanking transaction processing
    Michael Gubanov
    M.Sc. thesis 2001, St. Petersburg State University of IT, Mechanics, and Optics, St. Petersburg, Russia 2001

    Sparse matrices and advanced data storage techniques
    Michael Gubanov
    Proceedings of the 4th International conference on new mathematical models and simulations, St. Petersburg, Russia 1997

    Multidimensional integrating with Haar Functions
    Michael Gubanov
    Proceedings of the 4th International conference on new mathematical models and simulations, St. Petersburg, Russia 1997

Links

© Copyright 2009-2010 Michael Gubanov. All rights reserved.