Description: Penn State Official Sheild


Prasenjit Mitra,
Associate Professor


College of Information Sciences and Technology

Department of Computer Science and Engineering        (Graduate Faculty)
Department of Industrial and Manufacturing Engineering   (Affiliate Faculty)

Students Publications Projects Curriculum Vita

Biographical Sketch:

Prasenjit Mitra is an Associate Professor in the College of Information Sciences and Technology; he serves on the graduate faculty of the Department of Computer Sciences and Engineering and is an affiliate faculty member of the Department of Industrial and Manufacturing Engineering at The Pennsylvania State University.  His major research interests are in exploring issues in information extraction, information integration and information visualization.  His research is being supported by the NSF CAREER Award.  Additionally, his research has been supported by the NSF, Microsoft Corporation, DoD, DHS, DoE, NGA, and DTRA.   He obtained his Ph.D. in Electrical Engineering from Stanford University in 2004.  Prior to that, he obtained his M.S. in Computer Science from The University of Texas at Austin in 1994 and his B. Tech. (Hons.) from the Indian Institute of Technology, Kharagpur in 1993.  From 1995 to 2000, he was a Senior Member of the Technical Staff at Oracle Corporation in the Oracle Parallel Server and Languages and Relational Technologies groups in the Server Technologies division.  He also serves in the Board of Advisors of Global IDs, Inc.  Mitra has co-authored over sixty articles at top conferences and journals. His work along (with his co-authors) resulted in a visual analytics system that was awarded the IEEE VAST '08 Grand Challenge award in the Data Integration area. He has served as the co-chair of three workshops including WIDMí09 and served on the PC of several conferences including SIGMOD, AAAI, IJCAI, WWW, CIKM, and ICDM.



Electrical Engineering

Stanford University



Computer Science

The University of Texas at Austin


B.Tech. (Hons.)

Computer Science and Engineering

Indian Institute of Technology, Kharagpur


Selected Awards:

         NSF CAREER Award, 2009-2014
         National Talent Search, 1989-1993


Intelligent Information Systems Laboratory
Cyber Security Laboratory
North-East Visualization and Analytics Center

Institute for CyberScience


Industrial Experience:

Global IDs: Chief Scientist, Member of the Board of Advisors, 2007 to present.

DBWizards: Senior Software Engineer, 2002-2003

Narus: Senior Software Engineer, 2000-2001

Oracle Corporation: Senior Member of Technical Staff (Server Technologies Division), 1995-2000.

Research Interests:

General Areas: Database Systems, Digital Libraries, Visual Analytics, Data Mining, Semantic Web, Information Retrieval.
Core Problems:   Information Extraction, Information Integration, Information Visualization

My primary research focus is on issues related to information extraction from documents especially documents retrieved from the World-Wide-Web.  Apart from extracting information from the web, we have started looking into extracting information from tables and images in digital documents automatically.  Of special interest to me is automated geo-spatial information extraction and visualization.

Some projects in which I am a principal investigator or a co-principal investigator follow:
  The DOES Project on DOcument-element Extraction and Search, NSF CAREER

  Semantic CiteSeerX, NSF
  ChemXSeer: An Integrated Digital Library and Data Repository, Dow Corporation (Past: NSF)
  VACCINE: Visual Analytics for Command, Control, and Interoperability Environments, DHS University Centre of Excellence

  Analysis and Intelligent Search for Cypriot Works of Art and Secreteriat Corpus, NSF


I am also a co-director of the Cancer Informatics Initiative (CANI) at Penn State.


IST 552: Data and Knowledge Management, Spring 2010-12
IST 512: Information Processing Technologies and Architectures, Spring 2007, 2008
IST 461: Database Systems Management and Administration, Fall 2006
IST 220: Computer Networks and Telecommunications, Spring 2004-06,2010 Fall 2007-11
IST 402: Emerging Topics in Database Systems, Fall 2004, 2005


Office: 313F IST Building,
The Pennsylvania State University,
University Park, PA 16802.
Office Phone: +1 (814) 865-4454

Fax : +1 (814) 865-6426
Email: pmitra AT

Other Interesting Links:

Some Maps